Skip to content
All Posts
Agentic AI

Agentic AI & LLM Engineering

Deep dives into agentic AI, LLM evaluation, synthetic data generation, model selection, and production AI engineering best practices.

9 of 1162 articles

Agentic AI
7 min read

Testing and evals for Claude Cowork agents that ship

Build an eval loop that gates Claude Cowork releases — task-level metrics, deterministic and LLM-judge graders, regression suites, and CI that blocks regressions.

Agentic AI
6 min read

Security hardening for Claude Cowork agentic AI systems

A practical playbook for hardening Claude Cowork agentic AI — sandboxing, least privilege, secrets isolation, and layered prompt-injection defense.

Agentic AI
6 min read

Cutting Claude Cowork token costs: caching and batching

Keep agentic Claude Cowork runs cheap and fast with prompt caching, batching, context trimming, and per-step model routing without sacrificing quality.

Agentic AI
7 min read

Debugging Claude Cowork agents: loops and bad tool calls

Fix the real failure modes of Claude Cowork agents — infinite loops, wrong tool selection, and hallucinated arguments — with grounded, practical techniques.

Agentic AI
7 min read

Prompt and context design for Claude Cowork agents

What to put in a Claude Cowork agent's context, what to leave out, and why — practical context engineering for reliable agentic knowledge work.

Agentic AI
7 min read

Wiring MCP servers into Claude Cowork: the full guide

Connect MCP servers to Claude Cowork the right way — auth scopes, strict schemas, structured error handling, and idempotency keys for safe agentic workflows.

Agentic AI
6 min read

Claude Cowork patterns: prompts, tools, and context

Reusable patterns for Claude Cowork agents — design prompts as contracts, shape tool interfaces, and engineer context so workflows stay reliable at scale.

Agentic AI
7 min read

Build a Claude Cowork workflow: a step-by-step guide

A concrete, engineer-followable guide to building a real Claude Cowork workflow — scope the task, wire connectors, write a skill, and test the agentic loop.

Showing 9 of 1162