Description

Doctorant en NLP à Inria Paris (équipe ALMAnaCH). Je travaille sur le pretraining et l'adaptation de LLMs.

J'ai publié CamemBERT-bio, un modèle NLP biomédical français avec 100k+ téléchargements sur Hugging Face.

Pendant ma thèse j'ai contribué à GAPeron, une suite de LLMs français open-source (1.5B à 24B paramètres) entraîné sur +128 GPUs AMD/NVIDIA.

Ce que je sais faire:

- Pretraining de LLMs from scratch (DeepSpeed, FSDP, infrastructure distribuée)

- Data curation à grande échelle

- Déploiement de LLMs 7B-70B en production (vLLM, quantization GPTQ/AWQ)

- NER et extraction d'information

- Fine-tuning pour des domaines spécialisés (biomédical, légal, etc..)

Ingénieur diplômé de l'ECE Paris et CentraleSupélec en Intelligence Artificielle.

Dispo soirs et weekends pour des missions courtes.

Publications (2 first-author, 53 citations):

- CamemBERT-bio (LREC-COLING 2024)

- GAPeron LLMs (submitted to ACL 2026)

- Biomed-Enriched (submitted to ACL 2026)

- CamemBERT 2.0 (arXiv 2024)

Google Scholar: scholar.google.com/citations?user=f5hnXrcAAAAJGoogle Scholar: scholar.google.com/citations?user=f5hnXrcAAAAJ

Domaines d’expertise

Langues

Français
Bilingue ou natif
Anglais
Capacité professionnelle complète

Préférences en matière de lieu de travail

En télétravail uniquement

Travaille majoritairement à distance

ViaDialog
R&D Consultant - NLP/NER
mars 2025 - juin 2025 (3 mois)
Built French text anonymization system using NER for customer data. Designed LLM annotation pipeline (vLLM + constrained decoding) to generate synthetic training data; distilled to production NER model (0.82 F1).
Praxysanté
R&D Consultant - LLM Infrastructure
juin 2023 - juin 2024 (1 an)
Deployed open-source LLMs (7B-70B) for healthcare applications with quantization (GPTQ, AWQ) on high-end GPUs. Built production inference pipeline with vLLM and FastAPI.
Inria
PhD student
CENTRES DE RECHERCHE
juin 2022 - Aujourd'hui (4 ans)
Paris, France
PhD thesis on LLM pretraining for clinical NLP, supervised by Laurent Romary and Eric de La Clergerie (ALMAnaCH team).

Key achievements:
- Trained a 7B decoder from scratch on 128 GPUs (8.7k GPU-hours), matching clinical SOTA with 2.5x fewer tokens
- Published CamemBERT-bio, a biomedical NLP model with 100k+ downloads on Hugging Face
- Core contributor to GAPeron, a suite of open French LLMs (1.5B-24B parameters)
- Research published at LREC-COLING 2024 and submitted to ACL 2026
NLP LLMs

Consulter toutes les expériences de Rian

Olivier

Monument SAS

Avis laissé le 15/02/2022

Très satisfait du déroulement et du résultat de la mission : écoute, réactivité, objectif, délais tenus. Je recommande Rian.

Compte supprimé

Avis laissé le 14/11/2018

1) tarif très compétitif pour les petites associations que nous somme 2) j ai eu 2 petite applications pour mes 2 associations qui sont différentes pour le même prix 300€ pour les 2 appli 3) il garantis les bug env 3 mois 4) il donne les sources choses qui était impératif pour nous 5) seul regret impossible de le rencontrer tout se passe à distance je recommande vivement Mr Ryan

Ancien utilisateur et 2 autres personnes recommandent Rian

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

Baptiste Duhen

Fullstack developer

4.6

(4)

Amed Hamou

Senior Lead Developer

(2)

Audrey Champion

Web developer

4.3

(3)

S’inscrire pour les voir

Master's Degree in Engineering
ECE Paris École d'ingénieur
2022
• Maths (Linear Algebra, Probability, Calculus ...) • C/C++, Java, and C# programming • Algorithms and Data Structures, Graph Theory • Advanced Database and SQL
M2 - Intelligence Artificielle
CentraleSupélec
2022
•Machine Learning fundamentals and Deep Learning •Reinforcement Learning •Computer Vision and NLP •Explainable AI

Consultez la formation qu'a suivie Rian

Developing Android Apps - MOOC
Google - Udacity
2017
Java Android
Kotlin for Android Developers - MOOC
Google - Udacity
2018
Kotlin

Data Scientist

Rian T.

PhD LLM Training & Deployment | NLP | AI

À propos de Rian

Expériences

Avis

5,0

Qualité

5,0

Délai

5,0

Communication

4,5

Olivier

Compte supprimé

Recommandations

Ces profils de freelance correspondent également à vos critères

Formations

Certifications

Compétences

Catégories