You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Sofian ChayboutiSC

Sofian Chaybouti

Machine Learning Engineer

350 €/jour
4 projets
Paris, FR
3-7 ans

Délai de réponse moyen : 1h

À propos de Sofian

Ingénieur diplômé de l'ENSTA Paris et du Master MVA de l'ENS Paris-Saclay,
Je suis formé et experimenté en Machine Learning, Deep Learning, NLP et vision.
Je suis actuellement ingénieur de recherche au Noah's Ark Lab où je travaille sur des projets de recherche en deep learning, RL, bandits et optimisation.
  • Français

    Bilingue ou natif

  • Anglais

    Capacité professionnelle complète

En télétravail uniquement
Travaille majoritairement à distance

Expériences

  • Huawei
    Research Engineer
    TÉLÉCOMMUNICATIONS
    août 2021 - Aujourd'hui (4 ans et 10 mois)
    Boulogne-Billancourt, France
    Noah's Ark Lab
  • Crédit Agricole SA
    Data Scientist
    BANQUE & ASSURANCES
    novembre 2020 - juillet 2021 (8 mois)
    Montrouge, France
    DataLab, Team Semantica.

    - Aspect Based Sentiment Analysis for clients' feedback analysis :
    The goal of this project was to improve the model in production by using transformers models.
    A multitask model that achieves both aspect detection and polarity detection was implemented and put in production.

    - Project on financial contracts analysis from the investment bank CACIB :
    The goal of the project is to build a search engine on the contracts database.
    The contracts are in pdf format, have to be converted to images, ocerized and indexed in an Elasticsearch index.
    The search engine has many features : - Segmentation of the contract into paragraphs, lists, clauses
    - Clause classification that allows to extract specific clauses of interest using tf-idf approaches.
    - Extraction of relevant spans that helps rule whether the contract is transferable using techniques inspired from neural question answering and semantic similarity

    - Training of Multimodal Classification deep learning models of car insurance contracts using textual (ocr) and visual content.

    - Training of Information Extraction deep learning models from car insurance contracts using semantic segmentation and textual embeddings.


    Technological stack : pytorch, tensoflow, transformers, tesseract, elasticsearch, mlflow, gitlab CI/CD, poetry, docker, AWS S3, mlflow, etc.
    NLP Computer Vision Python Pytorch TensorFlow
  • Crédit Agricole SA
    NLP research intern
    BANQUE & ASSURANCES
    avril 2020 - octobre 2020 (6 mois)
    Montrouge, France
    DataLab, Team Semantica.

    Designed a French textual search engine with a span extraction module.
    Achieved state-of-the-art results on the Phrase-Indexed QA (PIQA) benchmark.

    Achieved state-of-the-art results on the squad-open benchmark.
    (preprint : https://arxiv.org/abs/2012.09766)
    NLP Pytorch Python Research Deep Learning Transfer Learning Multitask Learning Elasticsearch

Recommandations

Soyez le premier à recommander Sofian

Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.

Ces profils de freelance correspondent également à vos critères

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Formations

  • Master MVA
    École Normale Supérieure Paris-Saclay
    2020
  • Ingénieur
    ENSTA Paris
    2020

Compétences (12)

Catégories