You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Arnaud FrancoisAF

Arnaud Francois

Data Engineer

500 €/jour
Lyon, FR
3-7 ans

Délai de réponse moyen : 1h

À propos de Arnaud

Data Engineer - Transport & Mobility - GCP - MDS (Modern data stack)

6 years of experience in data engineering and analytics, including 2 years at WanData — a SaaS platform for local authorities and transport operators.

What I do:
- End-to-end data pipelines (raw → analytics/OBT) with the modern data stack: Airbyte · Airflow · dbt · BigQuery
- On-prem → GCP migration (Cloud Run, Functions, Pub/Sub, Dataflow/Beam)
- GDPR compliance: passenger data anonymization
- FastAPI services and dashboards for business teams
Tech stack:
GCP (Cloud Run, BigQuery, Pub/Sub, GCS, IAM, WIF) · dbt · Airflow · Airbyte · Terraform · Terragrunt · Github actions · Streamlit · Looker · FastAPI · Snowflake

What sets me apart:
- Migration to Modern Data Stack
- Google Professional Data Engineer + Astronomer Airflow certified
- Global Top 100 on the DataTalksClub Data Engineering Zoomcamp (Bruin/dlt + Redpanda + Streamlit)
  • Français

    Bilingue ou natif

  • Anglais

    Capacité professionnelle complète

  • Russe

    Notions

  • Espagnol

    Notions

Accepte de travailler sur site
Lyon (jusqu’à 50 km)

Expériences

  • Ubitransport
    Data Engineer / Data Analyst
    septembre 2021 - Aujourd'hui (4 ans et 9 mois)
    Lyon, France
    WanData Project
    - Context: WanData is an innovative SaaS platform, specifically designed to simplify the process of developing a data / AI strategy for local authorities and transport operators.
    - Summary:
    - KPI development: Implementation using GCS, Pandas, dbt, and BigQuery.
    - Endpoint Exposure : Creation and serving of services via FastAPI.
    - CI/CD Pipeline Setup : CI / CD using GitHub Actions, Artifact Registry, Podman.
    - GCP Infrastructure Setup : Infrastructure design and deployment (Cloud Run, GCS, IAM permissions, Secret Manager, VPC, Terraform, Datadog, etc)
    - Tool modernization : Migration of the dependency management tool (from Poetry to uv, from pylint to ruff).
    - Participation in the modern data stack migration : Transition to a modern architecture (Airbyte, dbt, shifting indicators (KPIs) to OBTs (One Big Table)).
    - PoC Gemini LLM agent using function calling. : Gemini / BigQuery / Jupyter / Streamlit

    DataViz Project
    - Summary: Analyzed trip launch rate and wild stops using data visualization.
    - Technologies Used: GCS (Datalake), Airflow (Pipelines), BigQuery (DW), Tableau online

    Modernization of Exports
    - Context: Modernization of an export system aimed at reducing resource costs, preventing queue blockages, and minimizing support tickets. The exports consist of Excel or CSV files containing data on subjects such as trip histories, sales histories and so on. The system operates in a Serverless environment, with resources provisioned at the time of export.
    - Technologies Used: Cloud Functions / Google Cloud Storage (GCS) / Python / Terraform / Cloud Build

    API calls migration (modernization)
    - Context: Transfer API calls data from the relational database to Pub/Sub and then an analytical database (BigQuery).
    - Technologies Used: Pub/Sub, Dataflow (Apache beam), BigQuery, Java

    Anonymization of usages
    - Context: Anonymization of passenger usages to be GDPR Compliant.
    - Technologies Used: Pandas, Airflow (Composer), Postgres
    bruin DBT Big Query Terraform Google Cloud Platform (GCP)
  • Ubitransport
    Exp. Innovation ALT
    septembre 2020 - janvier 2022 (1 an et 4 mois)
    Mâcon, France
    Next Trip Prediction Project
    - Context: Designed and implemented predictive models for trip planning.
    - Technologies Used: AI Platform, Google Cloud Storage (GCS), CSV, SKlearn, Weka, Jupyter Notebook. Usage Projection Project
    - Context: Created usage projection models for optimizing transportation operations.
    - Technologies Used: AI Platform, TensorFlow, LSTM, AutoKeras, Facebook Prophet (ARIMA/SARIMA). School Transportation Data Analysis
    - Context: Collaborated on a study (ANATEEP) involving the consolidation of data from approximately twenty different transport networks. The objective was to analyze and provide valuable insights into school transport usage, focusing on indicators such as punctuality, service duration, and user-centric school runs.
    - Technologies Used: Metabase, Snowflake (Data Warehouse), Talend (ETL).

Recommandations

Soyez le premier à recommander Arnaud

Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.

Ces profils de freelance correspondent également à vos critères

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Formations

  • Master's degree, Databases and A.I
    Université de Bourgogne
    2021
    Master's degree, Databases and A.I
  • Master's degree
    NORWEGIAN UNIVERSITY OF SCIENCE AND TECHNOLOGY (NTNU)
    2020
    Master's degree

Certifications

Compétences

Catégories