Ă propos de Tuka
Data (Business) Scientist Freelance â Machine Learning & Automatisation
Anglais
Bilingue ou natif
Français
Bilingue ou natif
Expériences
- Better & StrongerData ScientistCONSEIL & AUDITfĂ©vrier 2025 - septembre 2025 (7 mois)Lyon, Franceđ Project 1:Classification of visitors to a hotel booking website in order to analyse potential purchasers.đŻ Context:TraïŹc data from a hotel groupâs website is collected and used to analyse user behaviour and optimise the site journey. For advertising purposes, the Marketing team leverages these data to build targeted ad audiences and promote the hotelâs oïŹerings. Classifying users therefore helps to understand distinct user segments and the common behaviours that drive them to purchase.â Missions:†Audited the existing classification model to determine why it failed to meet marketing requirements†Analysed the dataset to demonstrate overfitting in the current model†Advised the Data team on the scarcity of positive labels (58 million rows, only 0.00418% positive)†Identified correlations between user events to isolate the most relevant features†Built the first draft of the data pipeline, including data cleaning and splitting into training, test and production sets†Retrained the existing Boosted Tree Classifier; trained Logistic Regression and Random Forest models†Calculated feature importances and deployed the Random Forest model into production†Presented findings and delivered workshops to both the Marketing and Data teams
- Better & StrongerData ScientistCONSEIL & AUDITfĂ©vrier 2025 - septembre 2025 (7 mois)Lyon, Franceđ Project 2:Categorisation of usersâ purchase probabilities on the website using a clustering model.đŻ Context:Following on from the user classification project, the Marketing team wishes to categorise the probability segments as follows: very low purchase probability, low purchase probability, moderate purchase probability and high purchase probability. These segments will be used by the Marketing team to create personalised audiences to enhance the performance of their online advertising.â Missions:†Applied arbitrary marketing criteria to define audience segments†Analysed the distribution of model-generated probabilities†Employed a clustering model on the probability scores†Delivered the resulting audience segments into a dedicated table†Presented the audiences and accompanying analyses to the Marketing team
- ENDEL ENGIEData ScientistENERGIEjuin 2025 - juin 2025Paris, Franceđ Project:Creation of a prompt engineering repository to automate content creation.đŻ Context:For ENGIEâs Legal Research & Development team, I created a knowledge base to enable the business unit to automate content creation, retrieval and verification. I also trained the team to use LLMs and techniques such as zero-shot, one-shot and few-shot learning.â Missions:†Define what AI is and where LLMs sit within the field of AI†Explain how to maintain critical thinking when using an LLM†Advise on how to use LLMs as a legal researcher†Master techniques such as Chain-of-Thought, zero-shot, one-shot and few-shot learning†Demonstrate and embed best technical practices within the team†Explain how to detect biases in outputs and audit a prompt from its formulation to the responses obtained
Avis
Recommandations
Ces profils de freelance correspondent également à vos critÚres
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Formations
- Master Concepteur développeur en science des donnéesJedhaStatistiques, Data Science & Engineering
- Bachelor's degree, Python and Machine LearningEmil2022Bachelor's degree, Python and Machine Learning
Certifications
- Python et Machine LearningEmil2022
- SQL for data scienceedX2021