À propos de Dorsaf
Arabe
Bilingue ou natif
Anglais
Capacité professionnelle complète
Français
Bilingue ou natif
Expériences
- OVHCloudData EngineerHIGH TECHjanvier 2025 - Aujourd'hui (1 an et 5 mois)Paris, FranceGeneral contextManage data quality and data governance for enterprise datasets, implementing a full DQ framework and metadatagovernance layer.• Implemented an end‑to‑end data quality framework using PySpark and Airflow (daily & monthly rules, scoring,reporting) and configuration of Alerting system.• Built and automated DataHub pipelines: ingestion, transformers, lineage extraction from Airflow, Spark and SQLoperators.• Defined and deployed dataset governance policies: domains, ownership, schema sensitivity tags and metadatastandardization.• Implemented data anonymisation rules (year‑level masking for personal attributes).• Integrated CI/CD for governance and quality workflows to ensure reproducibility across environments.
- Publicis MediaDATA ENGiNEERseptembre 2022 - décembre 2024 (2 ans et 3 mois)FranceGeneral context Manage the architecture, governance, and quality of media data. Organize, consolidate, and monitor the ingestion system of various data sources.• Development of Data models using SQL and make data available for Data Analysts/Dashboarding team in Big‑ query and scheduling the stored procedures for incremental loads.• Data Platform : Automating GCP ressources provisionning and Data pipelines of multi‑sources using Terraform with CI CD pipeline• Create monitoring pipelines from Alerting by mail using Cloud Functions in Python, Backup and Archiving to Data Observability.• Automating DAG in Snowflake using tasks.• Developement of MLops pipeline in Snowflake.• Development of attribution and contribution models using Shapley, logistic regression and markov chain.• Calling APIs such as Bings API and Facebook API for a specefic business report, in instance the auction insight report and scheduling an alerting for the nomenclature• R & D project for cookieless attribution and contribution using econometrics namely ARDL model.• Keywords Python, Bigquery, SQL, Snowflake, GCP , Terraform, Cloud Build, cloud scheduler, workflow, cloud functions, AdverityFreelance, Medix & Talentoday Remote, Client en US & France
- Freelance, DNAAfricaDATA ENGiNEER/ML ENGiNEER/DATA SCiENTiSTdécembre 2020 - février 2021 (2 mois)Mallard Point Remote non-electric canoe-in only campsite, Eyota, MN, USAGeneral context Development of a Lead Generation application using NLP and ML algorithms for the client.• Implementation of the backend architecture: scraping, storage in a Firebase NoSQL database, Flask and GCP APIs.• Deployment of the Python script in App Engine, configuration of resources (CPU, memory, readiness, liveliness) and real‑time maintenance.• Automating ETL pipelines running on an App Engine instance using Flask APIs: scraping, preprocessing, scoring, storing in Firebase and daily feedback loop through Cloud Scheduler vs cron jobs.• Creating a bucket in Cloud Storage for serialized ML models on a daily basis.• User feedback loop from the interface.• Developed variable extraction functionsformodelsfrom Tweets/Google Alerts (country, industry, company, prod‑ uct...)• Select, train, and evaluate the scoring models of the scrapped news (Catboost) and the classification models of the news category (NLP with TFIDF and Bi‑LSTM attention with Pytorch).
Recommandations
Soyez le premier à recommander Dorsaf
Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.
Ces profils de freelance correspondent également à vos critères
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Formations
- ENGiNEERiNG DEGREE iN STATiSTiCS AND INFORMATiON ANALYSiSHigher School of Statistics and Information Analysis, ESSAIT2018ENGiNEERiNG DEGREE iN STATiSTiCS AND INFORMATiON ANALYSiS
- & PhysicsPreparatory Institute for Engineering Studies of El Manar& Physics