À propos de Hussein
Français
Bilingue ou natif
Arabe
Bilingue ou natif
Anglais
Capacité professionnelle complète
Expériences
- Apache AirflowCommitter & PMC memberHIGH TECHavril 2023 - Aujourd'hui (3 ans et 2 mois)Paris, France- Active contributor; fix the reported bugs, introduce new features and improve the code quality and its performance.- Join the discussions and participate in deciding the future of the project.- Test and vote on the different releases, mentor the new contributors and help Airflow users to solve their problems.
- LeboncoinSenior Data EngineerE-COMMERCEoctobre 2021 - Aujourd'hui (4 ans et 7 mois)Paris, France- Developing low-latency stream applications using Java, Spring Cloud Stream, KStream, Kafka and K8S for fraud detection.- Develop a scalable in-house feature store to ingest the aggregate and ingest the company events (>1B/day) in a KV store (DynamoDB) using Kafka, Kstream, FastAPI, AsyncIO, Avro and Airflow.- Designing a new Lakehouse architecture to apply GDPR on the legacy datalake and optimize the data processing by optimizing the data files (compaction, z-order, indexing, ...) using Java Spark, HUDI, Airflow, Kafka, S3, Avro, Glue and K8S.- Improve the data platform: migrate Airflow from LocalExecutor to Celery, migrate spark jobs from EMR (YARN) to K8S, migrate Airflow operators to the new deferrable (async) mode to reduce the infra cost, migrate Spark to jdk11 after patching the Hive which doesn’t work with jdk11.- As a Sr. DE, I give data/infra courses (Airflow, Spark, Terraform, K8S, ...), I help data teams to design their projects and overcome challenges, and I lead the contribution to open source data projects.
- Data4RiskData Engineer & Head of DataHIGH TECHfévrier 2019 - octobre 2021 (2 ans et 8 mois)Paris, France- Designing and implementing stream applications and batch ETL using Docker, Pyspark, Kafka, Argo Workflows, mongoDB, MySQL, MinIO and Kubernetes on GCP and OVH cloud to collect and process weather data.- Designing and implementing a low-latency Lakehouse using PySpark streaming, Delta Lake, Hive and K8S, to support ACID transactions, update and Delete operations and time travel on the big data tables.- Leading the DS team: desing and train ML models and pipelines using Keras, Tensorlow, MLlib and other libraries to classify and process satellite images and deploy these models using MLflow and TF serving.- Designing and implementing a datalake for a financial data platform: stream Spark on k8s jobs to ingest Kafka events in the parquet datalake, and batch Spark on k8s ETL pipelines scheduled by Argo
Recommandations
Soyez le premier à recommander Hussein
Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.
Ces profils de freelance correspondent également à vos critères
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Formations
- Master en Data ScienceGrenoble INP ENSIMAG2019
- Licence en InformatiqueUniversité libanaise - Faculté des sciences2017