Engineering notes, model evals, and deep dives from the CowAgent team.
How deepseek-v4-flash behaves inside CowAgent's full agent loop across six end-to-end tasks: planning, coding, memory, browser, knowledge base, and very long documents.