Skip to content
View amaljithkuttamath's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report amaljithkuttamath

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
amaljithkuttamath/README.md

Amaljith Kuttamath

amaljithkuttamath

AI Engineer building tools to understand what language models are doing internally, and catching them when they fail.

Working on mechanistic interpretability, LLM evaluation, and trust/safety tooling. Most of my projects are small, sharp, and designed to answer one question well.


Current focus

Trust Bench - Interpretability toolkit that probes SAE features, activations, and circuits in Llama 3.1 8B. Three probes (feature survey, hallucination, cross-lingual), statistical analysis, publication-quality viz, and a CLI.

Interpretability research

Project What it does
sae-explorer Found a single SAE feature (#10543) that fires on "and" across six languages in Gemma 2 2B. Zero false positives.
superposition-viz Reproduces Anthropic's Toy Models of Superposition. Found phase transition at 0.7 sparsity.
activation-atlas Layer-by-layer UMAP projections showing how neural networks organize learned representations.
scaling-laws Train transformers from 100K to 10M params, fit power laws, plot the curves. Do they hold at toy scale?
loss-landscape 3D surface plots of loss landscapes around trained weights. Sharpness comparison across training configs.

LLM evaluation

Project What it does
calibration-probe Measure how well LLMs know what they know. Reliability diagrams and ECE across prompting strategies.
attention-bench Benchmark MHA vs GQA vs MQA vs Sliding Window. Train small transformers, compare perplexity and throughput.

Rust CLI tools

Project What it does
crux Terminal dashboard for AI coding tool token usage.
tokenizer-arena Compare how different LLM tokenizers handle the same text. Color-coded token boundaries.
gguf-inspect Inspect GGUF model files from the terminal. Architecture, quantization, tensors, memory estimates.

Writing

Recent posts on amaljithkuttamath.github.io/work:

Tech

Python Rust PyTorch Hugging Face TransformerLens SAELens

Connect

LinkedIn Twitter Substack Email Website

Pinned Loading

  1. Mp3Metadata Mp3Metadata Public

    Do you have a lot of mp3 files lying around without its name and metadata, well this tool is for you! Go ahead and spin this up.Add cover, name, artist name and other data to your music.

    Python

  2. falcon falcon Public

    LLM-powered chat application using Google Gemma via Ollama, LangChain, and Chainlit

    Python 1

  3. job-resume-matcher job-resume-matcher Public

    AI-powered resume-to-job matching using NLP, TF-IDF, and cosine similarity with keyword suggestions

    Python 1