A complete, hands-on guide to becoming an AI Engineer.
This repository is designed to help you learn AI from first principles, build real neural networks, and understand modern LLM systems end-to-end. You'll progress through math, PyTorch, deep learning, transformers, RAG, and OCR — with clean, intuitive Jupyter notebooks guiding you at every step.
Whether you're a beginner or an engineer levelling up, this repo gives you the clarity, structure, and intuition needed to build real AI systems.
If you learn something useful, a star is appreciated.
- Math functions, derivatives, vectors, and gradients
- Matrix operations and linear algebra
- Probability and statistics
- Creating and manipulating tensors
- Matrix multiplication, transposing, and reshaping
- Indexing, slicing, and concatenating tensors
- Special tensor creation functions
- Building neurons, layers, and networks from scratch
- Normalization techniques (RMSNorm)
- Activation functions
- Optimizers (Adam, Muon) and learning rate decay
- Attention and self-attention mechanisms
- Multi-head attention
- Decoder-only transformer architecture
- Building RAG pipelines end to end
- Indexing, retrieval, chunking strategies
- Integrations with embedding models and vector stores
- OCR pipeline and utilities
- Preprocessing images and extracting text
Recommended reading to deepen your understanding (not included):
AI Engineeringby Chip HuyenHands-On Machine Learning with Scikit-Learn, Keras, and TensorFlowby Aurélien GéronDeep Learningby Ian Goodfellow, Yoshua Bengio, and Aaron CourvilleThe Elements of Statistical Learningby Trevor Hastie, Robert Tibshirani, and Jerome FriedmanNeural Networks and Deep Learningby Michael NielsenSQL Cookbookby Anthony Molinaro
For more books in AI/ML, I have created another repo for this Check Here. I will be adding lot more in coming days/months. If you are interested to read book, go check this repo out.
For a recommended step-by-step progression through the materials, see the Learning Path:
Start_here/learning_path.md
Install dependencies with:
pip install -r requirements.txtSome subfolders (for example 5.RAG/ and 6.OCR/) include their own requirements.txt with additional dependencies.
Recommended workflow:
-
Open Jupyter in the project root:
jupyter lab # or jupyter notebook -
Work through notebooks in order:
1.Math/2.PyTorch/3.Neural-Network(NN)/4.Transformer/
-
Folder to run separately:
5.RAG/6.OCR/
-
Resources
-
Basic ML Model Implementation (Supervised + Un-supervised + RL)
1.Linear Regression2.Logistic Regression3.Decision Tree Model4.Naive Bayes Classification
| Tool | Category | Link |
|---|---|---|
| Scikit-learn | Traditional ML | https://scikit-learn.org/stable/ |
| XGBoost | Gradient Boosting | https://xgboost.ai/ |
| LightGBM | Gradient Boosting | https://lightgbm.readthedocs.io/en/stable/ |
| CatBoost | Gradient Boosting | https://catboost.ai/ |
Contributions are welcome!
Please ensure:
- Notebooks are clean (Restart & Run All before committing)
- Existing structure & naming conventions are followed
- PRs are focused, readable, and documented
- In folders like RAG and OCR, please maintain the cleaned structure part
- If you want to add something new folders, make it proper structure way.
- This project is licensed under the MIT License. See
LICENSEfor details.
