Skip to content
View pranshu1921's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report pranshu1921

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pranshu1921/README.md

Hi, I'm Pranshu Kumar Premi 👋

I build production-grade ML and GenAI systems (RAG, fraud detection, MLOps) across AWS and Azure, focused on shipping reliable models that improve real business workflows.

My work spans classical ML, LLM-based systems, data pipelines, and MLOps, with experience delivering solutions in energy, telecommunications, and technology environments—optimizing for latency, quality, and operational efficiency in production.

🎯 Target roles: Data Scientist (Product), ML Engineer, AI Engineer
🎯 Currently focused on:

  • Production LLM applications (RAG, agentic systems, evaluation frameworks)
  • Scalable ML pipelines and advanced feature engineering
  • AI system design for data-intensive products

📍 Location: Raleigh, NC | Open to Remote & Hybrid roles
📌 Status: Actively seeking new opportunities


🏆 Credentials & Highlights

  • 🎓 Master’s in Data Analytics – Northeastern University

  • 💼 3.5+ years building production ML systems across consulting and product teams

  • 🚀 Deployed 9 production ML pipelines on AWS SageMaker supporting business-critical workflows

  • 🏅 Azure Data Scientist Associate

  • 🏅 HashiCorp Certified: Terraform Associate

  • 📜 IBM Data Science Professional Certificate

  • 📜 Google Data Analytics Professional Certificate


🚀 Featured Projects

These repositories reflect how I design and ship real-world AI systems, not notebook demos.


🔹 ShopAssist RAG: AI-Powered E-Commerce Assistant

Production RAG system for customer support over 150K+ product, review, and policy documents.

  • LangChain-based retrieval with vector search and source-grounded answers
  • Sub-10ms cached responses; cold queries typically under ~1.5s in local benchmarks
  • Evaluation harness with 15+ domain-specific test queries
  • Dockerized FastAPI backend with Streamlit UI for interactive exploration

Why this matters: Demonstrates how to build reliable, cost-aware RAG systems that can augment or replace human support workflows while remaining debuggable and transparent.

🔗 View Repository: https://github.com/pranshu1921/shopassist-rag
(Includes architecture diagram and reproducible benchmarks.)


🔹 Agentic RAG Document Search Platform

Enterprise-style RAG system using agentic reasoning over custom knowledge bases.

  • LangGraph orchestration for multi-step reasoning and tool selection
  • FAISS vector store for efficient semantic search
  • ReAct-style agent with deterministic source citation
  • Modular ingestion for URLs and local documents (PDF/TXT/DOCX)

Why this matters: Mirrors how modern AI teams build trustworthy internal copilots and knowledge systems that can be inspected, debugged, and extended.

🔗 View Repository: https://github.com/pranshu1921/Agentic-RAG-Document-Search-System
(Includes architecture overview and evaluation hooks.)


🔹 Machine Learning Production Pipeline

End-to-end ML system emphasizing maintainability and production correctness.

  • Data ingestion → feature engineering → training → evaluation → inference
  • Clear separation of data, training, and serving layers
  • Reproducible experiments with versioning and automated tests
  • Dockerized inference service with CI/CD-ready structure

Why this matters: Shows how to take a model from notebook to production in a way teams can operate, extend, and trust.

🔗 View Repository: https://github.com/pranshu1921/ml-production-pipeline


🔹 Applied ML Case Studies: Churn & Fraud Detection

Business-first ML case studies focused on decisions, not just accuracy.

  • Churn modeling with cost-benefit thresholding and retention strategies
  • Fraud detection under class imbalance with precision-recall tradeoffs
  • End-to-end pipelines from raw data to actionable recommendations

Why this matters: Demonstrates how to translate ambiguous business problems into ML systems that drive real decisions.

🔗 View Repository: https://github.com/pranshu1921/applied-ml-case-studies


🧠 What I Work On

Applied Machine Learning
Designing predictive models and experiments that solve concrete business problems, from churn and fraud to operational forecasting.

Generative AI & LLM Systems
Building production RAG and agentic workflows with an emphasis on evaluation, reliability, and source-grounded answers.

Production ML Systems
Taking models from notebook to production through deployment, monitoring, CI/CD, and performance tuning.

Data Engineering for ML
Designing SQL-first pipelines, feature stores, and data quality checks that make ML systems reliable at scale.

I gravitate toward work where models become real systems that teams can trust and operate in production.


🛠️ Tech Stack

ML & AI (Primary Stack)

  • Classical ML: Scikit-learn (custom transformers, pipeline optimization), XGBoost, LightGBM
  • Deep Learning: PyTorch (NLP, neural models), TensorFlow
  • LLM Systems: LangChain, LangGraph, OpenAI API, prompt engineering
  • Vector Search: FAISS, ChromaDB, embeddings (OpenAI, HuggingFace)
  • Evaluation: Custom metrics, RAGAS framework, A/B testing

Data & Analytics

  • Languages: Python, SQL (complex queries, optimization), R
  • Libraries: Pandas, NumPy, Polars
  • Feature Engineering: Domain-specific transformations, time series features
  • Data Quality: Great Expectations, custom validation frameworks

MLOps & Infrastructure

  • Cloud: AWS (SageMaker, Glue, Lambda, S3, Athena), Azure (OpenAI, AI Search, DevOps)
  • Orchestration: Apache Airflow, AWS Step Functions
  • Infrastructure as Code: Terraform (multi-account setups, state management)
  • Containerization: Docker, Docker Compose
  • CI/CD: GitHub Actions, automated testing
  • Monitoring: Model performance tracking, data drift detection

Visualization & BI

  • Tools: Tableau, Power BI, Streamlit
  • Libraries: Matplotlib, Seaborn, Plotly

📈 How I Think About AI

I believe strong AI engineers:

  • Start with problem framing, not models
  • Treat data pipelines as first-class systems
  • Optimize for maintainability over cleverness
  • Measure success using business impact
  • Build systems that explain themselves
  • Plan explicitly for failure modes

🔬 Currently Exploring

  • Advanced RAG architectures (hybrid search, re-ranking, query decomposition)
  • LLM evaluation frameworks (RAGAS, domain-specific metrics)
  • Fine-tuning and adapting open-source models (Llama 3, Mistral)
  • Production monitoring for LLM systems (cost, latency, hallucination detection)

🤝 Let’s Connect

Seeking:

  • Data Scientist, ML Engineer, and AI Engineer roles (Remote or Hybrid)
  • Contract or consulting projects in AI/ML system design

Happy to discuss:

  • RAG system architecture and evaluation
  • MLOps best practices and scaling challenges
  • Bridging the gap between research and production

Reach me at:


If you find something useful here, feel free to star a repo or reach out.

Pinned Loading

  1. Agentic-RAG-Document-Search-System Agentic-RAG-Document-Search-System Public

    Agentic RAG system using LangGraph + FAISS with source-grounded answers and a production-style architecture

    Python

  2. applied-ml-case-studies applied-ml-case-studies Public

    Business-first, production-style ML case studies with reproducible pipelines, artifacts, and KPI-driven decisioning.

    Python

  3. ml-production-pipeline ml-production-pipeline Public

    End-to-end, production-style machine learning pipeline with training, evaluation, artifact management, and a FastAPI inference service.

    Python

  4. Gemini_Chatbot Gemini_Chatbot Public

    Python

  5. shopassist-rag shopassist-rag Public

    Production-ready RAG system for e-commerce: AI assistant that answers customer questions using semantic search across 150K+ products, reviews, and policies. Built with LangChain, ChromaDB, OpenAI, …

    Python