Software Tech Radar

Real-world agent engineering: scaling, governance, security, maintenance

Real-world agent engineering: scaling, governance, security, maintenance

Key Questions

What are common failure rates and issues in production AI agents?

Production agents often face failure rates between 41-87% along with memory issues, prompting focus on governance, scaling, and maintenance practices.

What frameworks and tools support agent observability and scaling?

Tools like AgentLens, MCP, Honeycomb, LLMops, and Databricks are used alongside practices such as 12-factor agents, sandbox architectures, and formal verification gates.

How are enterprises addressing AI pilot scaling challenges?

Approaches include ex-SAP production checklists, OpenClaw best practices, hybrid memory systems, and bottom-up adoption with human-in-the-loop patterns like LangGraph.

What security measures are recommended for AI agents?

Zero-trust architectures for MCP workflows from Versa, Palo Alto AI security tools, and five control layers help secure agent infrastructure.

How does AI observability help in production environments?

AI observability makes systems transparent and measurable, supporting enterprise scaling as discussed in Google Enterprise and financial services contexts.

What workshops and assessments aid agent readiness?

Bedrock AgentCore workshops, AI readiness assessments, and n8n/LLMOps best practices support moving agents from pilots to production.

How are financial rails and governance being implemented for agents?

Catena Labs provides agent financial rails while multi-layer controls and observability ensure compliance in scaled deployments.

What role does human-in-the-loop play in agent systems?

Human-in-the-loop patterns, such as those in LangGraph, help mitigate failures and improve reliability in long-running agent workflows.

Failures 41-87% + memory issues; AgentLens/MCP/Honeycomb/LLMops/Databricks; new: ex-SAP production checklists, OpenClaw prod best practices, Gemini 3.5 production failure, Bedrock AgentCore workshops, Palo Alto AI security, n8n/LLMOps best practices, hybrid memory, formal verification gates, Catena Labs agent financial rails, 5 control layers, AI readiness assessments, Google Enterprise scaling/observability/Model Armor, Versa zero-trust MCP, bottom-up adoption, human-in-loop LangGraph, AI observability, 12-factor agents, sandbox architectures.

Sources (35)
Updated May 24, 2026