Description

J'aide les équipes IA à maîtriser leurs systèmes : optimisation, benchmark, qualification.

Entraînement LLM, pipelines d'inférence, calcul scientifique, comparaison de stacks ou de hardware : la question centrale est toujours la même — est-ce que le système fait ce qu'il devrait, et comment le prouver chiffres en main ?

Mon approche combine deux expertises rares en France :

→ Optimisation GPU bas niveau : profilage PyTorch / Nsight, identification des régimes (math-bound, memory-bound, overhead-bound, SFU-bound), tuning FSDP / DeepSpeed / vLLM, parallélismes 3D, kernels Triton.

→ Benchmark et qualification système : conception et exécution de campagnes de tests sur infrastructure IA (training, inférence, storage, multi-nœuds), définition des métriques (TTFT, ITL, throughput, scaling efficiency), comparaisons de stacks (PyTorch vs JAX, vLLM vs TensorRT-LLM, FSDP vs DeepSpeed), évaluation LLM (Lighteval, MLPerf-style).

Mon parcours : 5 ans au CNRS/IDRIS sur le supercalculateur Jean Zay (H100/A100/V100, InfiniBand, Lustre, SLURM). Support expert sur des projets nationaux : pretraining BLOOM, OpenLLM France, Scribe, leaderboard INESIA, collaboration Hugging Face. 20 ans d'ingénierie dont 10+ en Deep Learning et HPC.

Ce sur quoi on peut travailler ensemble :

→ Audit performance GPU (5-10 jours) : diagnostic complet, identification des goulots, plan d'optimisation chiffré

→ Optimisation entraînement / inférence LLM (mission longue) : FSDP, DeepSpeed, 3D parallelism, vLLM, TensorRT-LLM

→ Campagne de benchmark / qualification système (mission longue) : conception du test plan, exécution sur cluster, rapports techniques pour audience NVIDIA / clients / décideurs

→ Évaluation LLM et IA de confiance : méthodes d'évaluation, hallucination analysis, benchmarks reproductibles, contribution à des leaderboards

→ Formation / transfert de compétences : Deep Learning distribué, profiling GPU, usage HPC

Langues

Français
Bilingue ou natif
Anglais
Capacité professionnelle complète

Préférences en matière de lieu de travail

Accepte de travailler sur site

Orsay (jusqu’à 50 km)

IDRIS
| AI/HPC Support Engineer
juin 2020 - décembre 2025 (5 ans et 6 mois)
91400 Orsay, France
• ● Provided advanced consulting and optimization support for AI workloads running on the Jean Zay supercomputer (850 compute nodes, SLURM scheduling, multi-generation GPU fleet: V100, A100, H100, B200; high-speed interconnects: Omni-Path 100, InfiniBand, NVLink).
• ● Assisted large-scale flagship projects running on Jean Zay, including BIG Science (2022) – BLOOM pretraining, OpenLLM (2024–2025), and Scribe (2025).
• ● Designed, led, and delivered the "Optimized Deep Learning on Jean Zay" training program (2022–2025), covering Dataloader optimization, GPU computing fundamentals, DDP, FSDP, 3D Parallelism (TP/PP/DP), and DeepSpeed.
• ● Active contributor and presenter on the FIDLE YouTube channel, a CNRS-backed academic reference for self-learning Deep Learning (CNRS, MIAI, EFELIA).
• ● Developed the National Coordination / INESIA Leaderboard for AI Summit 2025, in collaboration with Hugging Face teams.
• ● Authored and published a poster for the GENCI cluster user community and conference audience. (2025): "Democratizing LLMFine-Tuning"
• ● Contributed to the long-term sustainability of the ChallengeData platform operated by Stéphane Mallat, supporting data science competitions
Pytorch vllm LLM Training profiling HPC
ASTEK (SNCF Innovation Program)
Data Scientist / Research Engineer
septembre 2017 - février 2020 (2 ans et 5 mois)
• ● Processed and analyzed > 1 TB of geometric data from IRIS320 measurement trains.
• ● Built predictive ML models for track wear and equipment fault detection.
• ● Deployed ML pipelines for railway maintenance optimization.
Machine learning Big Data
BEKK Studio
Computer Vision & Mobile AI Developer
juillet 2015 - septembre 2017 (2 ans et 2 mois)
• ● Developed Android applications using Computer Vision, NLP, and Deep Learning.
• ● Created experimental reinforcement learning trading bots and video AI apps with TensorFlow Lite.
Computer Vision Deep Learning

Consulter toutes les expériences de Bertrand

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

Baptiste Duhen

Fullstack developer

4.6

(4)

Amed Hamou

Senior Lead Developer

(2)

Audrey Champion

Web developer

4.3

(3)

S’inscrire pour les voir

Engineering Degree
ISEP (Institut Supérieur d'Électronique de Paris)
2005
Engineering Degree
Deep Reinforcement Learning Engineer
2016
Deep Reinforcement Learning Engineer

Consultez la formation qu'a suivie Bertrand

Bertrand Cabot

AI/HPC Engineer

À propos de Bertrand

Expériences

Recommandations

Ces profils de freelance correspondent également à vos critères

Formations

Compétences

Catégories