You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Dorsaf SdiriDS

Dorsaf Sdiri

Machine Learning et Data Engineer

550 €/jour
Paris, FR
3-7 ans

Délai de réponse moyen : 1h

À propos de Dorsaf

Passionnée par les technologies, l'automatisation, le fast prototyping et surtout l'innovation, j'ai impléménté plusieurs pipelines de dataops, mlops ETL pipelines et j'ai méné à bien end to end ML projetcs, j'ai une connaissance solide sur tout le scope Data ( statistiques, ML , Data engineering , ops )
  • Arabe

    Bilingue ou natif

  • Anglais

    Capacité professionnelle complète

  • Français

    Bilingue ou natif

Accepte de travailler sur site
Paris (jusqu’à 50 km)

Expériences

  • OVHCloud
    Data Engineer
    HIGH TECH
    janvier 2025 - Aujourd'hui (1 an et 5 mois)
    Paris, France
    General context
    Manage data quality and data governance for enterprise datasets, implementing a full DQ framework and metadata
    governance layer.
    • Implemented an end‑to‑end data quality framework using PySpark and Airflow (daily & monthly rules, scoring,
    reporting) and configuration of Alerting system.
    • Built and automated DataHub pipelines: ingestion, transformers, lineage extraction from Airflow, Spark and SQL
    operators.
    • Defined and deployed dataset governance policies: domains, ownership, schema sensitivity tags and metadata
    standardization.
    • Implemented data anonymisation rules (year‑level masking for personal attributes).
    • Integrated CI/CD for governance and quality workflows to ensure reproducibility across environments.

  • Publicis Media
    DATA ENGiNEER
    septembre 2022 - décembre 2024 (2 ans et 3 mois)
    France
    General context Manage the architecture, governance, and quality of media data. Organize, consolidate, and monitor the ingestion system of various data sources.
    • Development of Data models using SQL and make data available for Data Analysts/Dashboarding team in Big‑ query and scheduling the stored procedures for incremental loads.
    • Data Platform : Automating GCP ressources provisionning and Data pipelines of multi‑sources using Terraform with CI CD pipeline
    • Create monitoring pipelines from Alerting by mail using Cloud Functions in Python, Backup and Archiving to Data Observability.
    • Automating DAG in Snowflake using tasks.
    • Developement of MLops pipeline in Snowflake.
    • Development of attribution and contribution models using Shapley, logistic regression and markov chain.
    • Calling APIs such as Bings API and Facebook API for a specefic business report, in instance the auction insight report and scheduling an alerting for the nomenclature
    • R & D project for cookieless attribution and contribution using econometrics namely ARDL model.
    • Keywords Python, Bigquery, SQL, Snowflake, GCP , Terraform, Cloud Build, cloud scheduler, workflow, cloud functions, Adverity

    Freelance, Medix & Talentoday Remote, Client en US & France
  • Freelance, DNAAfrica
    DATA ENGiNEER/ML ENGiNEER/DATA SCiENTiST
    décembre 2020 - février 2021 (2 mois)
    Mallard Point Remote non-electric canoe-in only campsite, Eyota, MN, USA
    General context Development of a Lead Generation application using NLP and ML algorithms for the client.
    • Implementation of the backend architecture: scraping, storage in a Firebase NoSQL database, Flask and GCP APIs.
    • Deployment of the Python script in App Engine, configuration of resources (CPU, memory, readiness, liveliness) and real‑time maintenance.
    • Automating ETL pipelines running on an App Engine instance using Flask APIs: scraping, preprocessing, scoring, storing in Firebase and daily feedback loop through Cloud Scheduler vs cron jobs.
    • Creating a bucket in Cloud Storage for serialized ML models on a daily basis.
    • User feedback loop from the interface.
    • Developed variable extraction functionsformodelsfrom Tweets/Google Alerts (country, industry, company, prod‑ uct...)
    • Select, train, and evaluate the scoring models of the scrapped news (Catboost) and the classification models of the news category (NLP with TFIDF and Bi‑LSTM attention with Pytorch).

Recommandations

Soyez le premier à recommander Dorsaf

Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.

Ces profils de freelance correspondent également à vos critères

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Formations

  • ENGiNEERiNG DEGREE iN STATiSTiCS AND INFORMATiON ANALYSiS
    Higher School of Statistics and Information Analysis, ESSAIT
    2018
    ENGiNEERiNG DEGREE iN STATiSTiCS AND INFORMATiON ANALYSiS
  • & Physics
    Preparatory Institute for Engineering Studies of El Manar
    & Physics

Compétences (19)

Catégories