online · available for work --:--:-- IST

Zakeer Shaik

Software Engineer|AI + Data

2+ years bridging scalable Azure data pipelines and intelligent agentic systems — RAG, LLM evaluation, and production-grade AI.

Bangalore, India

$

languagesPythonAdvanced SQLPySparkPandasNumPy
databasesSQL ServerMySQLSnowflakeSynapse Analytics
azure_cloudData FactoryDatabricksADLS Gen2Microsoft FabricDelta LakeOneLakeLakehouse
ai_agenticAgentic AIRAGPrompt EngineeringRLHFVector DBsLangChainLangGraphCrewAIVoice Agents
devops_toolsDockerGitCI/CDFlaskFastAPIHugging FaceStreamlitGradioAzure DevOpsJira

$

Software Engineer

Jan 2024 — Present
VisionQuest Solutions · Client: Wipro
  • Monitored ETL production pipelines in Azure Data Factory, maintaining ~99% successful daily load completion within SLA.
  • Refactored legacy SQL Server stored procedures into PySpark on Databricks, improving performance ~25–30%.
  • Built and tested new ETL pipelines in Microsoft Fabric, ingesting legacy sources into Fabric Lakehouse and OneLake.
  • Built a Python utility to archive inactive ADLS Gen2 data to cold storage, cutting storage costs ~15–20%.
  • Implemented logging, alerting, and data quality validations to improve early failure detection.

Jr. Data Scientist (Freelance)

Aug 2023 — Dec 2023
Outlier AI · Client: Scale AI
  • Wrote 400+ Python & SQL solutions to train generative AI models with focus on correct syntax and logic.
  • Designed complex prompts to test model handling of database queries and edge cases.
  • Ran A/B comparisons of model outputs and flagged unsafe responses to reduce hallucinations.

Data Science Intern

Feb 2023 — Aug 2023
AIVariant
  • Led a customer segmentation project using behavioural and demographic data.
  • Applied K-Means & Agglomerative clustering; built a KNN classifier reaching ~85% accuracy.

$

RAG & Retrieval Engineering

Event-Driven Code RAG Slack Agent

  • Built a code-aware RAG agent answering natural-language questions about a GitHub repo in Slack via hybrid retrieval — dense Qdrant + TF-IDF BM25 fused with RRF, then reranked — returning file/line citations.
  • A GitHub webhook (HMAC-SHA256 verified) auto-rebuilds the vector index on every push so answers always reflect current code; measured hit rate 1.00 and MRR 0.955.
stack: Python · Qdrant · Jina embeddings · tree-sitter · scikit-learn · Slack Bolt · Flask · Groq · Docker
view repo →
★ Winner · Kong API Summit 2025

Customer Support Agent with AI Gateway

  • Architected a cost-optimised AI support system with dynamic query routing via Kong AI Gateway — simple queries to Llama-3.3-70b, complex ones to GPT-OSS-120b.
  • Added real-time sentiment-based CRM escalation, hitting <$0.10 per query and <2s response time.
stack: Python · FastAPI · Streamlit · Kong Gateway · Groq · ChromaDB · LangChain · Docker
view repo →
Agentic DevSecOps Auditor

Elastic MCP PR Reviewer

  • Built an autonomous agent that vectorised 5,000 historical PRs for kNN semantic search to detect recurring security vulnerabilities in live PRs.
  • Used a GitHub MCP server to read live PR diffs and inject secure code fixes directly into Pull Requests without human intervention.
stack: Python · Elasticsearch · SentenceTransformers · Elastic Agent Builder · MCP · Node.js · GitHub API
view repo →
LLM Agent Observability SDK

AgentLens

  • Built an open-source SDK that auto-instruments CrewAI and LangGraph agents with one line of code, streaming OpenTelemetry spans to Splunk.
  • Added two-layer prompt-injection and token-anomaly detection using classical ML, running entirely on-prem with no external LLM calls.
stack: Python · OpenTelemetry · OpenInference · Splunk AI Toolkit · CrewAI · LangGraph · Groq
view repo →

$

## Education
M.C.A — Computer Applications
Annamacharya Institute of Tech & Science · 2022
B.Sc — Computer Science & Statistics
Yogi Vemana University · 2020
## Certifications
HackerRank — Software Engineer Certificate
Advanced NLP with Python for Machine Learning
Excelr — Data Science Certification
## Achievements & Community
  • Winner Best Agentic Project, Kong API Summit 2025
  • 1st Place Data Engineering Hackathon Leaderboard
  • Co-Organizer Kong API Events, Hyderabad
  • Technical Volunteer DevRel Squad Hyderabad (LangDB.AI, FireDucks, Firebolt)
  • Volunteer GitTogether GitHub Copilot Event

$

Get in Touch

Open to Data Engineer & AI Engineer roles.