Joche Ojeda

Xari
How CLI Tools Can Drastically Reduce Token Consumption ⚡

How CLI Tools Can Drastically Reduce Token Consumption ⚡

After realizing that traditional tool schemas were burning tokens, I started looking for a lighter approach. CLI-style tools turned out to be a practical solution. By replacing verbose JSON schemas with simple commands, I reduced prompt size, improved tool selection, and made agents cheaper, faster, and easier to scale.

From “Hello” to Quota Exceeded: The Day My Agent Broke 💥

From “Hello” to Quota Exceeded: The Day My Agent Broke 💥

After building an agent with 50 tools, I expected powerful automation. Instead, users hit quota limits after saying “Hello.” The problem wasn’t usage, it was hidden context cost. Every tool was injected into every request, burning tokens instantly. This experience revealed why tool design and loading strategy matter for scalable, efficient AI agents.