Tag
#agent
2 posts
Professional ai ml 24 min
Agent Core v5.4: When Data Augmentation Backfires
We 6.5x'd our training data and made the model safer but less capable. Here's the root cause, the numbers, and the v5.5 fix plan.
lora fine-tuning tool-calling
Professional ai ml 22 min
Agent Core v5.3: What We Learned Training a Tool-Calling LoRA on Real Data
Concrete training results comparing v5.2 and v5.3 of Agent Core — a universal tool-calling LoRA for 8B models. What improved, what regressed, and why SFT has a ceiling.
lora fine-tuning tool-calling