AI coding agents from OpenAI, Anthropic, and Google are powerful but problematic. They work by orchestrating multiple LLMs, but face huge “context” limits and can burn through tokens 15x faster than a chat. Understanding their architecture is key to using them without creating a mess.