À propos de Karis
Français
Bilingue ou natif
Anglais
Bilingue ou natif
Expériences
- Devoteam A CloudData Engineer @ Veolia Water Technologiesfévrier 2024 - Aujourd'hui (2 ans et 4 mois)Paris, FranceWithin the Datalake team, I designed and optimized several ETL pipelines on AWS architectures leveraging services such as AWS Kinesis, Glue Jobs, Lambda, SNS/SQS (fan-out pattern), LakeFormation, Step Functions, S3, and DynamoDB to process data in JSON, CSV, and Parquet formats.A major project in 2024, Daily Aggregations, involved redesigning a critical part of the Datalake, considered the source of truth. Across nine distinct architectures, I managed data ingestion, transformation, and aggregation (15-minute/hourly/daily) using Apache Spark, storing the results in partitioned tables within Glue Data Catalog and DynamoDB. These processes ran daily through various Step Functions and primarily fed our cold storage layer.As the DevOps referent, I was responsible for managing the development environment during Merge Requests on GitLab CI and actively contributed to deploying improvements using Terraform, CloudFormation, Makefile, Docker, and shell scripts integrated into GitLab CI.
- FortuneoMachine Learning Engineerseptembre 2022 - septembre 2023 (1 an)Paris, France(Data Science / Machine Learning Engineering)Development of Machine Learning models for propensity scoring and deployment (i.e., models to evaluate customer interest in various products/actions such as bank mobility, American Express cards, or savings accounts):
- Dataset creation from multiple sources (Data Warehouse, Data Mart, Open Data)
- PCA, Clustering
- R, Python, Random Forest, XGBoost, LGBM
- Descriptive & inferential statistics, KPI analysis, Data Quality (drift detection)
- KPI evaluation (Lift curve, Precision/Recall)
Pipeline & Production Deployment (Data Engineering), Data Management & MLOps (AWS Cloud environment):- ETL processes, Data organization, Machine Learning pipelines
- SQL, NoSQL (MongoDB), APIs, Unit testing
- AWS Lambda, AWS Athena, AWS Glue, AWS S3, AWS CloudFormation, AWS
- DynamoDB, AWS CodePipeline, AWS SageMaker Studio
- Python, Hadoop HDFS, Hive, GitLab, Data Warehouse, Data Mart
- CI/CD deployment on AWS (via CodePipeline)
- ALTENData Scientistnovembre 2021 - avril 2022 (5 mois)FranceSeveral Deep Learning projects for the development of a data exploitation platform. During this internship, I was able to strengthen my skills in data visualization and preprocessing, as well as in algorithm modeling, using the Python language and specifically the TensorFlow, Keras, Pandas, NumPy, Matplotlib, Seaborn, and OpenCV libraries. Among these projects were:
- COVID-19 detection from lung scans
- Classification of heartbeat signals
- Face mask detection
- Bank fraud detection
Recommandations
Soyez le premier à recommander Karis
Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.
Ces profils de freelance correspondent également à vos critères
Agatha Frydrych
Backend Java Software Engineer
4.7
(3)
2
Baptiste Duhen
Fullstack developer
4.6
(4)
5
Amed Hamou
Senior Lead Developer
4
(2)
7
Audrey Champion
Web developer
4.3
(3)
4
Formations
- Data & Artificial Intelligence, Ingénierie informatiqueEFREI Paris2023Data & Artificial Intelligence, Ingénierie informatique