Hello, I'm Sillah,
Machine Learning/Data Engineer
based in Oulu, Finland.

Sillah Babar

about.

I'm a Machine Learning Engineer and Data Engineer with a passion for building scalable AI systems and data pipelines. Currently pursuing my Masters in Computer Science and Engineering at the University of Oulu, Finland, with an AI major. I specialize in deploying production-ready ML systems, building data pipelines, and working with cloud technologies.

Education

Masters in Computer Science and Engineering

University of Oulu, Finland | 2025 - present

AI Major

Bachelor of Computer Science, CGPA-3.6

FAST NUCES | 2019 - 2023

Gold Medal, Rector's List (4.0 GPA), Dean's List

Experience

Machine Learning Engineer / Data Engineer

CIVIC IQ, Remote, USA | Jan 2025 – Present

Deployed Kestra as a pipeline orchestrator on Kubernetes cluster, configuring fault-tolerant data extraction jobs on pre-emptible pods and integrating Pub/Sub for asynchronous messaging. Built scalable AI pipelines using embeddings for vendor and product recommendations based on customer profiles and structured tagging. Developed a ChatGPT deep search connector to identify RFP signals and relevant documents. Led development of internal tool to generate RFP signals and automate sales outreach, integrating seamlessly with HubSpot. Created AI-driven features for leads recommendation and targeting based on client metadata. Built data pipelines using GPT-4.1, Claude, and DeepSeek to extract structured data from spend reports. Fine-tuned LLMs for improved accuracy in purchase and domain-specific data extraction tasks. Integrated CI/CD workflows with Grafana dashboards for real-time monitoring and alerting. Developed a graph-based knowledge system using embeddings for semantic chatbot document retrieval. Designed data validation and sanity checks for LLM-based data extraction workflows. Handling processing and loading of tables reaching up to 148M+ rows of structured data and creating sync pipelines for them.

Machine Learning Engineer

Maanz AI, Islamabad | July 2023 - Jan 2025

Worked extensively with Audi AG, Cariad, and AAI on self-driving car models, extracting data and creating KPIs in C++ for model evaluation. Maintained post-processing pipelines in Python for lidar-based calculations including interpolations, distance and angle estimations. Led cloud migration to AWS by setting up Kubernetes clusters, Job DSLs, seed jobs, and Jenkins for 120 Scala and C++ pipelines.

AI Software Developer

Hais.ai, Switzerland (Remote) | Jan 2024 - May 2024

Deployed chatbots using FastAPI, created RAG systems for banking and medical device regulatory compliance, and integrated MLOps practices.

skills.

Technical Skills

Python, Java, C++/C, MySQL, OpenCV, AWS, GCP, Azure, PostgreSQL, BigQuery, Kubernetes, Jenkins, Docker, Langchain, RAG, FastAPI, Flask

Soft Skills

Team Player, Strong Oral and Written Communication, Attention to Detail, Optimistic, Leadership

Other Skills

Trello, Git, Prolog, MATLAB, Academic Writing, Traditional and Digital Art

work.

Here are some of my notable projects and work experiences in machine learning, data engineering, and AI systems.

📊

Lip Reading Application in Urdu

Developed sentence and word-level models for Urdu lip reading to aid hearing-impaired people. Collected dataset of 20 speakers with 108 sentences each, achieving 63% accuracy on unseen speakers.

PythonDeep LearningComputer Vision
🛡️

Malware Classification using N-gram

Used N-grams on malware byte code to extract features for training. Selected best features using SelectKBest and tried various models to achieve accurate malware classification results.

Machine LearningFeature EngineeringPython
🤖

AI Pipeline Orchestration

Deployed Kestra as pipeline orchestrator on Kubernetes cluster, built scalable AI pipelines using embeddings for vendor recommendations, and integrated Pub/Sub for asynchronous messaging.

KubernetesGCPAI/ML
💬

RAG-based Chatbots

Created chatbots for banking sector and medical device regulatory compliance using Langchain, RAG, and deployed using FastAPI to Cloud Run and VMs.

RAGFastAPILangchain

contact.

Feel free to reach out if you'd like to collaborate, discuss opportunities, or just want to connect!