Skip to content
View umerkhan95's full-sized avatar

Block or report umerkhan95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
umerkhan95/README.md

Umer Khan

I build systems that sit between AI and the real world.

Most of my work starts with one question: what happens after the model gives you an answer? That space between a language model's output and something a business can actually use is where I spend my time. I write the infrastructure, the pipelines, the agents, and the glue code that turns raw intelligence into reliable software.

I work across the full stack. Python and FastAPI for the backend. Next.js and React on the frontend. Rust when performance is the priority. AWS when it needs to scale. I reach for whatever gets the job done well.

Based in Berlin. AWS Certified Cloud Solutions Architect. Building at the intersection of AI, data, and cloud infrastructure.

Profile Views

Where I Have Worked

Bloomers Berlin Founding AI Engineer. Built AI powered e-commerce systems and GDPR compliant RAG pipelines from the ground up.

Comparado (Idealo Group) Data Engineer. Standardized ETL pipelines across teams, migrated business intelligence infrastructure to AWS, and ensured DSA and DMA compliance at scale.

CorAI (Bernstein Analytics) Cloud Architect. Designed and ran a system that processed over one million news articles daily with NLP categorization and sentiment analysis.

Teradata Data Engineer. Built data pipelines for major telecom clients including Telenor Pakistan.

What I Build

AI Agents and Infrastructure I build multi-agent systems that orchestrate LLMs for real tasks. Code validation, semantic search with vector databases, content generation, and automated quality checks. My work with LangGraph, CrewAI, and custom agent frameworks focuses on making AI reliable enough for production.

Cloud and Data Engineering I design cloud native data systems on AWS. ETL pipelines, data warehousing, BI migrations, and processing architectures that handle millions of records. Everything built with proper monitoring, compliance, and cost awareness.

E-Commerce Automation I have built a range of tools for Shopify merchants. AI chatbots for customer support, product content generators, app validation systems that check against 100+ compliance rules, and MCP servers that let agents interact with store data directly.

Healthcare Tech Medical data is personal and complex. I have worked on health tracking platforms, blood work converters, and domain specific language models for medical queries. Everything here is built with extra care for data privacy and accuracy.

A Few Things I Have Built

Gatekpr A Rust native platform that validates Shopify and WooCommerce apps using LLMs, vector search with Qdrant, and static code analysis. Deployed on Fly.io with OpenTelemetry tracing and Stripe billing.

AI Agent Management System A full internal platform for managing fleets of AI agents. Built as a monorepo with Next.js, FastAPI, Redis, and Celery. Handles agent lifecycle, monitoring dashboards, and multi-tenant access control.

News Analysis System An agent to agent architecture for journalists. Multiple AI agents work in parallel to research, verify, and synthesize news stories. Uses knowledge graphs with Neo4j and supports GDPR compliant data handling.

Shopify MCP Server A Model Context Protocol server that gives AI agents structured access to Shopify store data. Integrated with Crawl4AI for URL validation and SEO analysis.

Tech I Reach For

Languages Python Rust TypeScript Golang Mojo

Agent Frameworks LangGraph LangChain CrewAI Model Context Protocol (MCP) OpenAI Hugging Face PyTorch

AWS S3 EC2 RDS SageMaker Lambda CloudWatch

Databases PostgreSQL Qdrant Pinecone ChromaDB Milvus Redis Neo4j Supabase

Infrastructure FastAPI Next.js React Airflow Celery Docker Fly.io

Get in Touch

Website GitHub

Pinned Loading

  1. qdrant/qdrant qdrant/qdrant Public

    Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

    Rust 29.9k 2.1k

  2. crawl4ai crawl4ai Public

    Forked from unclecode/crawl4ai

    🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

    Python 1

  3. gatekpr-releases gatekpr-releases Public

    Release binaries for gatekpr - Shopify & WooCommerce app validator

  4. imagededup imagededup Public

    Forked from idealo/imagededup

    😎 Finding duplicate images made easy!

    Python

  5. EdgeLLM EdgeLLM Public

    Forked from MuhammadMurtuzaZafar/ollama-api-gateway

    A secure, production-ready API gateway for Ollama with authentication, authorization, rate limiting, and comprehensive monitoring.

    Mojo

  6. shopextract shopextract Public

    Extract, compare, and monitor product data from any e-commerce store. Supports Shopify, WooCommerce, Magento, BigCommerce, Shopware, and generic stores. Zero config — just give a URL, get products.

    Python