Zakeer Shaik — Software Engineer

languagesPythonAdvanced SQLPySparkPandasNumPy

databasesSQL ServerMySQLSnowflakeSynapse Analytics

azure_cloudData FactoryDatabricksADLS Gen2Microsoft FabricDelta LakeOneLakeLakehouse

ai_agenticAgentic AIRAGPrompt EngineeringRLHFVector DBsLangChainLangGraphCrewAIVoice Agents

devops_toolsDockerGitCI/CDFlaskFastAPIHugging FaceStreamlitGradioAzure DevOpsJira

Software Engineer

Jan 2024 — Present

VisionQuest Solutions · Client: Wipro

Monitored ETL production pipelines in Azure Data Factory, maintaining ~99% successful daily load completion within SLA.
Refactored legacy SQL Server stored procedures into PySpark on Databricks, improving performance ~25–30%.
Built and tested new ETL pipelines in Microsoft Fabric, ingesting legacy sources into Fabric Lakehouse and OneLake.
Built a Python utility to archive inactive ADLS Gen2 data to cold storage, cutting storage costs ~15–20%.
Implemented logging, alerting, and data quality validations to improve early failure detection.

Jr. Data Scientist (Freelance)

Aug 2023 — Dec 2023

Outlier AI · Client: Scale AI

Wrote 400+ Python & SQL solutions to train generative AI models with focus on correct syntax and logic.
Designed complex prompts to test model handling of database queries and edge cases.
Ran A/B comparisons of model outputs and flagged unsafe responses to reduce hallucinations.

Data Science Intern

Feb 2023 — Aug 2023

AIVariant

Led a customer segmentation project using behavioural and demographic data.
Applied K-Means & Agglomerative clustering; built a KNN classifier reaching ~85% accuracy.

RAG & Retrieval Engineering

Event-Driven Code RAG Slack Agent

Built a code-aware RAG agent answering natural-language questions about a GitHub repo in Slack via hybrid retrieval — dense Qdrant + TF-IDF BM25 fused with RRF, then reranked — returning file/line citations.
A GitHub webhook (HMAC-SHA256 verified) auto-rebuilds the vector index on every push so answers always reflect current code; measured hit rate 1.00 and MRR 0.955.

stack: Python · Qdrant · Jina embeddings · tree-sitter · scikit-learn · Slack Bolt · Flask · Groq · Docker

view repo →

★ Winner · Kong API Summit 2025

Customer Support Agent with AI Gateway

Architected a cost-optimised AI support system with dynamic query routing via Kong AI Gateway — simple queries to Llama-3.3-70b, complex ones to GPT-OSS-120b.
Added real-time sentiment-based CRM escalation, hitting <$0.10 per query and <2s response time.

stack: Python · FastAPI · Streamlit · Kong Gateway · Groq · ChromaDB · LangChain · Docker

view repo →

Agentic DevSecOps Auditor

Elastic MCP PR Reviewer

Built an autonomous agent that vectorised 5,000 historical PRs for kNN semantic search to detect recurring security vulnerabilities in live PRs.
Used a GitHub MCP server to read live PR diffs and inject secure code fixes directly into Pull Requests without human intervention.

stack: Python · Elasticsearch · SentenceTransformers · Elastic Agent Builder · MCP · Node.js · GitHub API

view repo →

LLM Agent Observability SDK

AgentLens

Built an open-source SDK that auto-instruments CrewAI and LangGraph agents with one line of code, streaming OpenTelemetry spans to Splunk.
Added two-layer prompt-injection and token-anomaly detection using classical ML, running entirely on-prem with no external LLM calls.

stack: Python · OpenTelemetry · OpenInference · Splunk AI Toolkit · CrewAI · LangGraph · Groq

view repo →

## Education

M.C.A — Computer Applications

Annamacharya Institute of Tech & Science · 2022

B.Sc — Computer Science & Statistics

Yogi Vemana University · 2020

## Certifications

HackerRank — Software Engineer Certificate

Advanced NLP with Python for Machine Learning

Excelr — Data Science Certification

## Achievements & Community

Winner Best Agentic Project, Kong API Summit 2025
1st Place Data Engineering Hackathon Leaderboard
Co-Organizer Kong API Events, Hyderabad
Technical Volunteer DevRel Squad Hyderabad (LangDB.AI, FireDucks, Firebolt)
Volunteer GitTogether GitHub Copilot Event

Software Engineer

Jr. Data Scientist (Freelance)

Data Science Intern

Event-Driven Code RAG Slack Agent

Customer Support Agent with AI Gateway

Elastic MCP PR Reviewer

AgentLens

Get in Touch