AI/ML Engineer • Agentic Systems Specialist

Annas Mustafa

Building intelligent agentic systems and production-ready RAG pipelines with 3+ years of experience.

View CV
About Me
Annas Mustafa

AI Engineer & Architect

I'm an AI/ML Engineer with over 3 years of experience specializing in building intelligent, agentic systems and production-ready RAG pipelines. Currently pursuing my Master's in Artificial Intelligence at BTU Cottbus-Senftenberg, Germany.

My expertise lies in developing scalable AI solutions that connect advanced research with real-world applications. I'm passionate about creating autonomous AI systems that can think, reason, and act intelligently while maintaining reliability.

3+
Years Experience
20+
Pipelines Developed
70%
Workload Reduction
3.4M+
Records Processed

Featured Projects

Production-Scale Text-to-SQL Agent

Production-Scale Text-to-SQL Agent

Advanced agentic pipeline translating natural language to SQL queries over large databases (350+ columns, 7M+ rows)

90% accuracy with reflexive loops and memory
LangGraphOpenAIFastAPISQLArize Phoenix
View on GitHub
Real-Time Object Detection & Tracking

Real-Time Object Detection & Tracking

Production-grade YOLOv8 pipeline with 12 custom classes for real-time video analytics

92.7% mAP@0.5, 18ms latency per frame
YOLOv8DeepSORTPyTorchOpenCV
View on GitHub
LLMOps RAG Evaluation Framework

LLMOps RAG Evaluation Framework

End-to-end framework for managing, evaluating, and tracing RAG pipelines using modern LLMOps practices

Automated testing with model versioning
LangChainArize PhoenixPythonMLOps
View on GitHub
AI Chef Assistant (HomeChef)

AI Chef Assistant (HomeChef)

AI-powered meal planning with ingredient management, recipe generation, and grocery list automation

Intelligent recipe recommendations
PythonOpenAIFastAPILLM
View on GitHub
Medical Image Classification

Medical Image Classification

CNN and ResNet-based medical image classification with transfer learning analysis

Transfer learning comparison study
PyTorchCNNResNetTransfer Learning
View on GitHub
Intelligent Document Q&A System

Intelligent Document Q&A System

RAG-based document analysis with multi-modal support (PDF, DOCX, images) and citation tracking

Hybrid retrieval with source highlighting
LangChainFAISSPineconeFastAPIOCR
View on GitHub
High-Accuracy Network Intrusion Detection (NIDS)

High-Accuracy Network Intrusion Detection (NIDS)

Developed and benchmarked Machine Learning models for **Binary and Multi-Class Network Intrusion Detection** (e.g., DoS, Probe) to classify network traffic as normal or anomalous.

Comparative study across multiple models; implemented advanced Feature Engineering and Model Tuning for high detection accuracy.
PythonScikit-learnEnsemble MethodsBiLSTM/Autoencoders (DL)Feature EngineeringNSL-KDD
View on GitHub
Arabic NLP: Sentiment Analysis & NER

Arabic NLP: Sentiment Analysis & NER

Comprehensive Natural Language Processing project on Arabic social media text, focusing on **Sentiment Analysis** and **Named Entity Recognition (NER)** on COVID-19 tweets.

Comparative performance evaluation across 9+ models, including state-of-the-art LLMs (GPT-4, AraBERT) vs. traditional ML (Random Forest), demonstrating superior Arabic NLP capabilities.
PythonHugging Face (Transformers)LLMs (GPT-4, AraBERT)NLTKScikit-learnSentiment AnalysisNER
View on GitHub
Algorithmic Bias Detection and Mitigation (AIF360)

Algorithmic Bias Detection and Mitigation (AIF360)

Analyzed and mitigated fairness concerns in predictive models (e.g., Student Performance/Loan Eligibility) across protected attributes like gender and race.

Implemented the **AI Fairness 360 (AIF360)** toolkit to detect and apply mitigation strategies (e.g., Reweighing, CDA), ensuring fair and unbiased model outcomes.
PythonMachine LearningAIF360Fairness MetricsData PreprocessingEthical AI
View on GitHub
Real-Time Voice AI Assistant

Real-Time Voice AI Assistant

End-to-end voice agent with STT, LLM reasoning, and natural TTS for multi-turn conversations

92% transcription accuracy
WhisperOpenAIElevenLabsWebSockets
View on GitHub
Automated MLOps Pipeline

Automated MLOps Pipeline

Production ML pipeline with versioning, A/B testing, drift detection, and automated retraining

Cost optimization with caching strategies using Redis
MLflowW&BDVCDockerGitHub Actions
View on GitHub

Work Experience

Stixor

AI/ML Developer

Feb 2025 – Sep 2025

📍 Islamabad, Pakistan

  • Led design and integration of agentic AI architectures for contract automation
  • Built RAG-based legal chatbot with model versioning and evaluation pipelines
  • Contributed to legal AI tech startup that secured $1M+ seed funding
  • Applied MLOps practices using Mlfow and W&B for model tracking and deployment
  • Desiged n8n workflows to automate document processing and client onboarding

Niblon

AI Engineer

Mar 2024 – Jan 2025

📍 Remote

  • Designed and deployed 10+ multi-agent systems with adaptive memory
  • Created React-based AI agents using LangGraph for contextual orchestration
  • Built LLM-as-Judge pipelines improving accuracy by 25%
  • Designed custom evaluation frameworks for RAG systems enhancing reliability
  • Implemented reranking, hybrid retrieval, and caching strategies

Developers Den LLC

Associate ML Developer

Dec 2022 – Jan 2024

📍 Remote

  • Architected hybrid RAG systems with 35% improvement in answer relevance
  • Integrated Zapier workflows reducing manual task processing time by 60%
  • Developed voice-enabled AI agent (WhisperAI) processing interactions
  • Optimized embedding generation reducing API costs by 45%

Technical Skills

LLM Ecosystems

OpenAIGeminiClaudeAzure AILangChainLlamaIndexLangGraphCrewAI

Programming & Backend

PythonFastAPI (Sync/Async)SQLC++TypeScriptJavaScriptNext.js

RAG Engineering

FAISSPineconeAzure SearchEmbeddingsChunkingHybrid Retrieval (BM25)IndexingRe-rankingCachingStreaming

Agentic AI & Tool Use

Reflexive ReasoningFunction CallingWorkflow OrchestrationA2AMCPn8nZapier

MLOps & DevOps

DockerGit/GitHub Actions (CI/CD)GCPAWSAzure

Other Tools

n8nMongoDBPostgreSQLStreamlitLoRAQLORAPEFTW&BArize PhoenixMLflowCustom Evaluation PipelinesModel Versioning & Governance

Certifications

Professional Certifications

  • DeepLearning.AI - LangChain for LLM Application Development

    Advanced LLM application architecture

    View Certificate →
  • DeepLearning.AI - Generative AI with LLMs

    Foundation for modern LLM techniques

Languages

English

C1 - Fluent

Urdu

Native

German

A2 - (Actively Improving)

Get In Touch

Location

Berlin, Germany

Connect with me