AI Agent 工程化专题：架构、工具调用、评估与恢复

本系列共 16 篇文章

如果你不只是想做一个 Agent Demo，而是想把 Agent 做到可维护、可评估、可恢复，这个系列适合作为主入口。阅读顺序会从安全边界和架构开始，逐步进入记忆、工具调用、多 Agent、评估和生产恢复。

Agent 架构工具调用评估基准监控恢复

Agent Sandbox 搭建指南：安全运行 AI 代码的完整方案

详解 AI Agent 沙盒环境搭建方案，从 gVisor 到 Firecracker 技术对比，提供本地开发到 Kubernetes 集群的完整部署指南

2026年3月23日AI与智能

Easton editorial illustration: one shielded sandbox containing an agent code cube

AI Agent 开发实战：架构设计与实现指南

深入解析 AI Agent 架构设计：ReAct、Plan-and-Execute、Multi-Agent 三大模式对比，五种多代理编排模式详解，Claude Agent SDK 实战代码示例，助你从理论到实践一网打尽。

2026年3月21日AI与智能

Easton editorial illustration: agent rollout and rollback rail

Agent 记忆系统设计：从会话到长期记忆

从零构建 Agent 记忆系统：四种记忆类型选型、五阶段流水线实现、Mem0/Zep/LangMem 框架对比与生产级成本优化策略

2026年4月23日AI与智能

Easton editorial illustration: supervisor dispatch desk

AI Agent 记忆管理：长期记忆与知识治理实战

深度解析 AI Agent 记忆系统：三种记忆类型、四层认知架构、六大框架对比选型。从 Mem0 到 Letta，从向量库到知识图谱，解决 Agent 失忆与上下文腐烂问题。

2026年4月13日AI与智能

Easton editorial illustration: one central memory library linking recent notes to durable knowledge shelves

Agent 工具调用实战：让 AI 调用外部 API 和服务

从 Function Calling 到 MCP，详解 Claude 和 OpenAI 的工具调用机制，提供完整代码示例和最佳实践，助你打造具备 API 调用能力的 AI Agent

2026年3月21日AI与智能

Easton editorial illustration: tool-socket control board

Computer-Use Agent：让 AI 操作你的电脑

深入解析 Claude Computer Use 技术，从原理到实践完整指南。包含 Docker 部署、代码示例、竞品分析和安全最佳实践，助你掌握 AI 桌面自动化的前沿技术

2026年3月22日AI与智能

Easton editorial illustration: durable queue station

多智能体协作实战：4 种架构模式选择指南

掌握多智能体协作系统的 4 种核心架构模式，从 Subagents 到 Router，附带 LangGraph 代码实现和生产级性能优化建议。

2026年3月25日AI与智能

Easton editorial illustration: permission gate hub

AI Agent 工具链设计：从单一工具到工具生态的演进指南

详解 AI Agent 工具链设计，从 MCP 协议到主流框架选型，覆盖 LangChain、CrewAI、AutoGen 对比与企业落地实践，助你构建可扩展的工具生态

2026年4月30日AI与智能

Easton editorial illustration: agent toolchain assembly desk, model socket, MCP adapter, framework chassis, enterprise control rail

LangGraph 状态管理实战：Checkpoint、Thread State 与失败恢复

2026 LangGraph 状态管理实战指南：解释 checkpoint、thread state、failure recovery、AutoGen 对比和监控方案，帮助你设计可恢复的生产级 Agent 架构。

2026年4月24日AI与智能

Easton editorial illustration: one central state ledger with three controlled graph branches

LangGraph 多 Agent 协作实战：Supervisor 模式与任务分发

深入解析 LangGraph Supervisor 模式架构原理，通过 Research + Writing 团队实战案例，掌握多 Agent 任务分发与协作的核心技巧，包含完整可运行代码示例

2026年5月12日AI与智能

Easton editorial illustration: Supervisor baton, research brief card, research station, writing station, synthesis tray

LangGraph vs AutoGen 状态追踪对比：checkpoint机制、超时恢复与选型决策

从 checkpoint、超时恢复和分布式状态等 12 个维度比较 LangGraph 与 AutoGen，结合代码与选型树判断哪套 Agent 框架更适合生产项目。

2026年5月26日AI与智能

Easton editorial illustration: central durable-state core, LangGraph snapshot vault, AutoGen conversation relay, recovery return path

LLM 结构化输出：JSON Schema 强制与工具调用可靠性保障

生产级 LLM 结构化输出完整指南：从 JSON Schema 强制验证到工具调用可靠性保障。对比 OpenAI/Claude/Gemini 三大厂商实现方案，提供 Python/TypeScript 实战代码模板，构建三层可靠性架构确保 100% 格式合规。

2026年5月6日AI与智能

Easton editorial illustration: central JSON Schema gate, three incoming provider-output cards, one validated tool-call object

Agent 评估基准实战：从 AgentBench 到 DeepEval 的性能测试指南

详解 Agent 评估基准与性能测试框架，对比 AgentBench、WebArena、τ-Bench 等五大基准，介绍 DeepEval 组件级评估方法，提供完整代码示例。

2026年5月3日AI与智能

Easton editorial illustration: three-level agent evaluation scoreboard, benchmark token tray, component-level DeepEval probe

Agent 规划能力怎么测？推理深度、任务分解与自我纠错的评估实战

Agent 规划能力怎么测？本文详解推理深度、任务分解、自我纠错的评测方法论，对比 AgentBench、ToolBench、ACPBench 等主流 benchmark，提供实战评测指南。

2026年5月7日AI与智能

Easton editorial illustration: agent planning test rig scoring decomposition depth, correction, and completion across benchmark trays

AI Agent 监控告警与失败恢复：从日志到状态机的设计实践

AI Agent 上线后失败无从排查？本文从日志到状态机的完整设计实践，教你构建生产级监控告警体系，让每个失败可观测、可恢复

2026年5月27日AI与智能

Easton editorial illustration: large Agent state recorder, coral failure beacon, checkpoint rewind handle, recovery status strip

DeepAgents 架构解析：规划工具、子代理与文件系统

深度解析 DeepAgents 四大支柱架构：Planning Tools、Sub-agents、File System 和 System Prompts，对比 LangGraph、AutoGen 等框架，提供实战代码示例和最佳实践

2026年4月26日AI与智能