You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Guillaume GeoffroyGG

Guillaume Geoffroy

Data Engineer | Databricks | PySpark | CI/CD Azure

650 €/jour
Grenoble, FR
3-7 ans

Délai de réponse moyen : 1h

À propos de Guillaume

Data Engineer PySpark & Cloud | Scalable Data Pipelines

I help companies design, build, and optimize robust and scalable data pipelines in cloud environments.

Specialized in PySpark, Databricks, and Airflow, I support end-to-end data platform projects from architecture to production deployment.

Core expertise:
ETL/ELT pipelines (PySpark, Databricks, Airflow)
Cloud data platforms (Azure, AWS, GCP)
Data lakehouse architecture (Delta Lake)
CI/CD, Terraform, and DevOps practices
Data quality & pipeline monitoring
Spark performance optimization
GitHub Copilot for AI support



Experience:
5 years working on large-scale data projects in energy, aerospace, and industrial environments.

Focus:
Reliable, scalable, production-ready data systems with strong engineering standards.

Available remotely or in Switzerland / France.
  • Français

    Bilingue ou natif

  • Anglais

    Capacité professionnelle complète

  • Espagnol

    Capacité professionnelle limitée

Accepte de travailler sur site
Grenoble (jusqu’à 10 km), Lyon (jusqu’à 10 km), Paris (jusqu’à 10 km), Pau (jusqu’à 10 km)

Expériences

  • Engie - V.I.E
    Data Engineer
    ENERGIE
    juin 2025 - Aujourd'hui (1 an)
    Brussels, Belgium
    Designed and managed data pipelines for energy consumption and billing data.

    Developed a Python library to structure ETL workflows, including complex PySpark transformations on time-series data. Worked on VSCode with Github Copilot.

    Industrialized and managed Databricks jobs, with scheduling through Apache Airflow and data storage on S3 using Delta Lake format.

    Orchestrated CI/CD with Azure DevOps and IaC deployments with Terraform across Databricks environments (dev, preprod, prod).

    Integrated GitHub Copilot into the development workflow for code generation, refactoring, and pull request review support.

    Built a Data Quality framework within the library, implementing checks for duplicates, overlaps, and completeness. Used Docker Image for unit testing / functional testing.

    Performed data analysis and developed dashboards with Databricks.
    Databricks Python PySpark Azure DevOps Terraform
  • Terra Systema
    Data Scientist
    AGROALIMENTAIRE
    mai 2024 - juillet 2024 (2 mois)
    Molsheim, France
    Analyzed weather sensor data to anticipate late frost events.
    Led the project autonomously, coordinating with multiple stakeholders.

    Analyzed time-series data from weather sensors and developed solutions on Linux using Python (Pandas, Matplotlib, TensorFlow) and MySQL.

    Designed a Proof of Concept and built a Deep Learning model (CNN/LSTM) to
    estimate dew point at parcel level.
    TensorFlow Jupyter Notebook Python Linux autonomie
  • CS Group
    Data Engineer
    AÉRONAUTIQUE & AÉROSPATIALE
    juin 2021 - avril 2023 (1 an et 10 mois)
    Toulouse, France
    Predicted aircraft failures for Airbus and airline operators.
    Filtered, analyzed, and visualized multi-source aircraft sensor data, including model development and alert monitoring.

    Developed a Python library dedicated to model development, built on complex
    PySpark transformations.

    Industrialized Big Data models using internal DevOps tools within a continuous
    integration framework.

    Used the internal CodeWorkbook ETL for model prototyping and validation.
    DevOps Python Data Pipeline ETL Big Data

Recommandations

Soyez le premier à recommander Guillaume

Contribuez à la réussite de ce freelance en partageant votre expérience de collaboration avec lui.

Ces profils de freelance correspondent également à vos critères

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Formations

  • Final-year exchange semester
    Université Laval
    2020
    Specialization in Python Advanced and Machine Learning
  • Engineering degree
    SUPMICROTECH-ENSMM
    2020
    Computer Science

Compétences

Catégories