Skip to content
View rb125's full-sized avatar

Block or report rb125

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
rb125/README.md

Rahul Baxi

Embodied AI Startup Founder/ Independent AI researcher working on behavioral evaluation, epistemic robustness, and comprehension-gated intelligence systems.

Research Program

My work studies how large language models fail under compression, adversarial pressure, and ethical stress.

Rather than optimizing for surface accuracy, I investigate structural robustness:

  • Do models preserve constraints under ambiguity?
  • Can they reject plausible falsehoods?
  • Do they adapt behaviorally under pressure?
  • Can capability growth be gated by demonstrated understanding?

The research arc progresses across four evaluation frameworks:

CDCT → DDFT → AGT → Comprehension-Gated Capability Growth


Papers

CDCT — Compression-Decay Comprehension Test (2025)

Separates constraint compliance from semantic accuracy under prompt compression.
arXiv: [https://arxiv.org/abs/2512.17920]

DDFT — Drill-Down and Fabricate Test (2025)

Stress-tests epistemic robustness via progressive compression and adversarial fabrication.
Under peer review. arXiv: [https://arxiv.org/abs/2512.23850]

AGT — Action-Gating Test (2026)

Behavioral diagnostic distinguishing performative ethical reasoning from genuine adaptability.
Zenodo: [https://zenodo.org/records/18282166]

Before the First Cause (2025)

Recursive causality in Indian philosophical literature.
Under peer review, Zenodo: [https://zenodo.org/records/17905985]


Current Focus

  • Behavioral Evaluation Theory
  • Agentic verification systems
  • Comprehension-gated training architectures
  • Robustness under compression and adversarial drift

Contact: rbaxi@alumni.cmu.edu

Pinned Loading

  1. esp32-face-detection esp32-face-detection Public

    Face detection with esp32 and opencv

    Python

  2. FlavorGraph FlavorGraph Public

    Forked from lamypark/FlavorGraph

    Python

  3. OpenDevin OpenDevin Public

    Forked from OpenHands/OpenHands

    TypeScript