Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc GRPO log diagnostics, evaluation, and export end-to-end. Part of the Gaslamp AI platform.
transformer lora fine-tuning sft dpo huggingface apple-silicon rlhf qlora unsloth grpo claude-code gaslamp
- Updated
Mar 24, 2026 - Python