You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Hussein AwalaHA

Hussein Awala

Senior Data Engineer

1 000 €/jour
Châtillon, FR
3-7 ans

Délai de réponse moyen : 1h

À propos de Hussein

I'm a Senior Data Engineer in the Ad Network team at Voodoo and a Committer & PMC member at Apache Airflow.

I have worked on various types of projects, including:
- Building GDPR-compliant Lakehouses and analytics platforms using Apache Iceberg or Apache Hudi.
- Developing low-latency stream applications (stateful and stateless).
- Creating ML platforms on top of Kubernetes clusters, and serving ML models with FastAPI and GRPC.
- Setting up and deploying Spark On Kubernetes clusters with hundreds of jobs and thousands of daily job runs.
  • Français

    Bilingue ou natif

  • Arabe

    Bilingue ou natif

  • Anglais

    Capacité professionnelle complète

Accepte de travailler sur site
Châtillon (jusqu’à 50 km)

Expériences

  • Apache Airflow
    Committer & PMC member
    HIGH TECH
    avril 2023 - Aujourd'hui (3 ans et 2 mois)
    Paris, France
    - Active contributor; fix the reported bugs, introduce new features and improve the code quality and its performance.
    - Join the discussions and participate in deciding the future of the project.
    - Test and vote on the different releases, mentor the new contributors and help Airflow users to solve their problems.
    Airflow Python Kubernetes AWS Helm flask Github Actions GCP Vault SQL
  • Leboncoin
    Senior Data Engineer
    E-COMMERCE
    octobre 2021 - Aujourd'hui (4 ans et 7 mois)
    Paris, France
    - Developing low-latency stream applications using Java, Spring Cloud Stream, KStream, Kafka and K8S for fraud detection.
    - Develop a scalable in-house feature store to ingest the aggregate and ingest the company events (>1B/day) in a KV store (DynamoDB) using Kafka, Kstream, FastAPI, AsyncIO, Avro and Airflow.
    - Designing a new Lakehouse architecture to apply GDPR on the legacy datalake and optimize the data processing by optimizing the data files (compaction, z-order, indexing, ...) using Java Spark, HUDI, Airflow, Kafka, S3, Avro, Glue and K8S.
    - Improve the data platform: migrate Airflow from LocalExecutor to Celery, migrate spark jobs from EMR (YARN) to K8S, migrate Airflow operators to the new deferrable (async) mode to reduce the infra cost, migrate Spark to jdk11 after patching the Hive which doesn’t work with jdk11.
    - As a Sr. DE, I give data/infra courses (Airflow, Spark, Terraform, K8S, ...), I help data teams to design their projects and overcome challenges, and I lead the contribution to open source data projects.
    Airflow Spark Python Java Kubernetes AWS Kafka Hudi parquet MLflow Github Actions Terraform avro FastAPI
  • Data4Risk
    Data Engineer & Head of Data
    HIGH TECH
    février 2019 - octobre 2021 (2 ans et 8 mois)
    Paris, France
    - Designing and implementing stream applications and batch ETL using Docker, Pyspark, Kafka, Argo Workflows, mongoDB, MySQL, MinIO and Kubernetes on GCP and OVH cloud to collect and process weather data.
    - Designing and implementing a low-latency Lakehouse using PySpark streaming, Delta Lake, Hive and K8S, to support ACID transactions, update and Delete operations and time travel on the big data tables.
    - Leading the DS team: desing and train ML models and pipelines using Keras, Tensorlow, MLlib and other libraries to classify and process satellite images and deploy these models using MLflow and TF serving.
    - Designing and implementing a datalake for a financial data platform: stream Spark on k8s jobs to ingest Kafka events in the parquet datalake, and batch Spark on k8s ETL pipelines scheduled by Argo
    Python Spark kafak Delta Lake Kubernetes Argo Workflows MongoDB GCP OVH MLflow Gitlab CI Argo CD Terraform

Recommandations

Soyez le premier à recommander Hussein

Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.

Ces profils de freelance correspondent également à vos critères

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Formations

  • Master en Data Science
    Grenoble INP ENSIMAG
    2019
  • Licence en Informatique
    Université libanaise - Faculté des sciences
    2017

Compétences

Catégories