Skip to content
View GuirassyFode's full-sized avatar

Block or report GuirassyFode

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
GuirassyFode/README.md

Hi there, I'm Fode Guirassy 👋

🚀 Data & AI Engineer | Azure DP-203 In Progress | Building Scalable Data Pipelines

LinkedIn GitHub followers


👨‍💻 About Me

I'm a Data & AI Engineer passionate about designing and building end-to-end data solutions — from raw ingestion to AI-powered insights. I specialize in cloud-native data architectures, real-time streaming pipelines, and machine learning integrations using modern data stack technologies.

  • 🏗️ Building production-grade ETL/ELT pipelines on Azure, AWS & GCP
  • 🤖 Developing AI/ML-powered data workflows & RAG (Retrieval-Augmented Generation) systems
  • Engineering real-time streaming solutions with Apache Spark, Kafka & Flink
  • 🗄️ Designing dimensional data models (Star/Snowflake schema) for analytics
  • 📊 Optimizing data platforms for scalability, reliability, and performance

🛠️ Tech Stack

☁️ Cloud Platforms

Azure AWS GCP

🔧 Data Engineering

Apache Spark Apache Kafka Apache Airflow dbt

💾 Databases & Storage

PostgreSQL MySQL Cassandra Snowflake

🤖 AI / ML

Python LangChain OpenAI Scikit-learn

🐳 DevOps & Infrastructure

Docker Kubernetes Git


🎯 Certifications & Learning

  • 📚 Microsoft Azure Data Engineer Associate (DP-203)In Progress

📌 Featured Projects

Project Description Tech Stack
🔥 Apache Spark Portfolio End-to-end Spark data engineering solutions with local vs. global sort optimizations PySpark, Scala
☁️ Azure Data Engineer (DP-203) Azure-based ETL/ELT pipelines — preparation for DP-203 certification Azure Data Factory, Synapse, ADLS
🤖 AI Chat RAG Workflow Retrieval-Augmented Generation pipeline for intelligent document Q&A Python, LangChain, OpenAI
📰 News Trend Data Pipeline Real-time news trend ingestion and analytics pipeline Python, Airflow, Kafka
🗄️ Dimensional Modeling - NBA Star schema dimensional model for NBA analytics SQL, PostgreSQL
☸️ Kubernetes Data Engineer Containerized data pipeline deployment with Kubernetes Kubernetes, Docker, Python
📊 SQL Deep Dive Advanced SQL techniques: window functions, CTEs, optimization SQL, Jupyter Notebook

📈 GitHub Stats

GuirassyFode's GitHub Stats Top Languages


📫 Let's Connect

I'm always open to discussing data engineering, AI/ML projects, cloud architecture, or opportunities in consulting and technology.

LinkedIn Email


"Turning raw data into actionable intelligence — one pipeline at a time."

Pinned Loading

  1. azure-dp-203-data-engineer-azure azure-dp-203-data-engineer-azure Public

    Azure DP-203 Data Engineer certification prep: Azure Data Factory, Synapse Analytics, ADLS Gen2, Stream Analytics, Databricks & Delta Lake pipelines

    1

  2. kubernetesDataEngineer kubernetesDataEngineer Public

    Kubernetes-orchestrated data engineering platform: containerized ETL pipelines, Helm charts, pod autoscaling & cloud-native data workflow deployment

    Python 1

  3. SQL-Deep-Dive- SQL-Deep-Dive- Public

    Advanced SQL mastery: window functions, CTEs, recursive queries, query optimization, indexing strategies & analytical patterns for data engineering interviews

    Jupyter Notebook 1

  4. Apache-Spark-Data-Engineering-Portfolio Apache-Spark-Data-Engineering-Portfolio Public

    Production-grade PySpark data engineering solutions: ETL pipelines, sorting optimization, Spark SQL, Azure ADLS integration & dimensional modeling

  5. my-ai-chat-rag-workflow my-ai-chat-rag-workflow Public

    RAG-powered AI chat workflow using LangChain & OpenAI for intelligent document Q&A — retrieval-augmented generation pipeline with vector embeddings

  6. News_trend_data_pipeline News_trend_data_pipeline Public

    End-to-end containerized data pipeline for real-time news trend ingestion, transformation, data quality checks & alerting using Docker and Apache Airflow