Design Patterns & Techniques

🔗

Prompt Chaining

🔀

Routing

⚡

Parallelization

🪞

Reflection

🔧

Tool Use

🎯

Planning

👥

Multi-Agent

🧠

Memory Management

📈

Learning and Adaptation

🏗️

Fault Tolerance Infrastructure

📚

Knowledge Retrieval (RAG)

🧠

Reasoning Techniques

🔐

Security & Privacy Patterns

📊

Evaluation and Monitoring

🧠

Context Management

🎨

UI/UX & Human-AI Interaction

Loading...

🏗️

12-Factor Agent Methodology(12FA)

Production-ready methodology adapting 12-factor app principles for scalable, maintainable agent systems with comprehensive monitoring and evaluation.

Complexity: highEvaluation and Monitoring

🎯 30-Second Overview

Pattern: Production-ready methodology adapting 12-factor app principles for scalable, maintainable agent systems

Why: Moves agents beyond 70-80% prototype reliability to production-grade systems with human-in-the-loop workflows

Key Insight: Most successful agents aren't the most 'agentic' — they're well-engineered software systems leveraging LLMs for controlled transformations

⚡ Quick Implementation

1Design:Start with JSON extraction as foundation

2Structure:Own prompts, context windows, and control flow

3Architect:Build stateless, focused agents with explicit error handling

4Integrate:Add human-in-the-loop and multi-channel support

5Deploy:Monitor observability and iterate at the bleeding edge

Example: agent = Agent(prompt=owned, context=managed, flow=explicit, state=stateless, humans=first_class)

The 12 Factors in Detail

JSON Extraction as Foundation

Core LLM superpower: converting natural language to structured data. The ability to take a string and turn it into JSON.

Own Your Prompts

Production quality requires hand-crafted prompts, not abstractions. Own your prompts to tweak token order and system/user roles as models change.

Manage Context Windows Explicitly

Don't blindly append; actively manage what the LLM sees. Own the context window, squeezing traces and error summaries for self-healing.

Tools Are Just JSON and Code

Demystify "tool use" as simple routing. Treat tools as structured JSON outputs validated through switch statements.

Own Your Control Flow

Agents = prompt + switch + context + loop. Keep control-flow in code with explicit OODA loops and convergence heuristics.

Stateless Agent Design

Enable pause/resume and horizontal scaling. Persist execution state for idempotent restarts.

Separate Business from Execution State

Different lifecycles, different needs. Expose launch/pause/resume endpoints for safe replay runs.

Contact Humans as First-Class Operations

Not an edge case, but core functionality. Route high-stakes steps to humans as first-class tool calls.

Meet Users Where They Are

Email, Slack, Discord — multi-channel by design. Trigger agents from wherever users already work.

Small, Focused Agents Beat Monoliths

3-10 steps max for reliability. Build small, single-purpose agents instead of chatty monoliths.

Explicit Error Handling

Process errors intelligently, not blindly. Compact errors into the next prompt to close the feedback loop.

Find the Bleeding Edge

Engineer reliability where models almost succeed. Find what's at the boundary of model capability and make it reliable.

📋 Do's & Don'ts

✅Hand-craft prompts for production quality, avoid prompt abstractions

✅Actively manage context windows with traces and error summaries

✅Build stateless agents that enable pause/resume functionality

✅Route high-stakes operations to humans as first-class tool calls

✅Keep agents small and focused (3-10 steps max)

❌Blindly append to context windows without management

❌Build chatty monolithic agents instead of focused ones

❌Nest prompts - use explicit control flow in code instead

❌Ignore error handling - compact errors into next prompt

❌Treat human interaction as edge case rather than core feature

🚦 When to Use

Use When

• Building production-ready AI agents for enterprise
• Scaling beyond 70-80% prototype functionality
• Need reliable, maintainable agent systems
• Require human-in-the-loop workflows
• Building multi-channel agent experiences

Avoid When

• Simple demos or proof-of-concept projects
• Single-use or throwaway agent tasks
• Research experiments without production requirements
• Cases where 70-80% reliability is sufficient
• Purely automated workflows without human oversight needs

📊 Key Metrics

Production Reliability

>90% success rate in production environments

Agent Response Time

P95 latency for agent task completion

Human Escalation Rate

% of tasks requiring human intervention

Context Window Utilization

Efficiency of context management

Error Recovery Success

% of errors resolved through self-healing

Multi-Channel Adoption

Usage across different user interfaces

💡 Top Use Cases

Enterprise Customer Support: Human-escalated agent handling complex support tickets across multiple channels

Financial Operations: Stateless agents processing transactions with explicit error handling and audit trails

Content Management: Small, focused agents for content creation, review, and publication workflows

DevOps Automation: Agents managing deployment pipelines with human approval gates for production releases

Sales Enablement: Multi-channel agents supporting sales teams through CRM, Slack, and email integrations

References & Further Reading

Deepen your understanding with these curated resources

Official 12-Factor Agents Resources

12-Factor Agents GitHub Repository (Dexter Horthy)

HumanLayer 12-Factor Agents Documentation

MLOps Community: Agents in Production 2025 (Dexter Horthy)

LlamaIndex 12-Factor Implementation

Implementation Guides & Tutorials

The 12-Factor Agent: Practical Framework (DEV Community)

Building Reliable LLM Applications Without Magic (Brandon AI)

12-Factor Agents: Blueprint for Reliable LLM Applications

DZone: The Twelve-Factor Agents for Production-Ready LLM Apps

Academic & Research Analysis

Adnan Masood: Framework for Reliable LLM Agents (Medium)

Mehul Gupta: How to Build Production Grade AI-Agents (Medium)

ODSC: Blueprint for Scalable AI Agents Insights

Hacker News Discussion: 12-factor Agents

Production & Enterprise Resources

The Original 12-Factor App Methodology (Heroku)

DevThink: Building Reliable AI Agents Framework

FlowHunt: 12-Factor AI Agent Building Guide

Enterprise AI Governance Best Practices

Contribute to this collection

Know a great resource? Submit a pull request to add it.

Contribute

🏗️

12-Factor Agent Methodology(12FA)

Production-ready methodology adapting 12-factor app principles for scalable, maintainable agent systems with comprehensive monitoring and evaluation.

Complexity: highEvaluation and Monitoring

🎯 30-Second Overview

Pattern: Production-ready methodology adapting 12-factor app principles for scalable, maintainable agent systems

Why: Moves agents beyond 70-80% prototype reliability to production-grade systems with human-in-the-loop workflows

Key Insight: Most successful agents aren't the most 'agentic' — they're well-engineered software systems leveraging LLMs for controlled transformations

⚡ Quick Implementation

1Design:Start with JSON extraction as foundation

2Structure:Own prompts, context windows, and control flow

3Architect:Build stateless, focused agents with explicit error handling

4Integrate:Add human-in-the-loop and multi-channel support

5Deploy:Monitor observability and iterate at the bleeding edge

Example: agent = Agent(prompt=owned, context=managed, flow=explicit, state=stateless, humans=first_class)

The 12 Factors in Detail

JSON Extraction as Foundation

Core LLM superpower: converting natural language to structured data. The ability to take a string and turn it into JSON.

Own Your Prompts

Production quality requires hand-crafted prompts, not abstractions. Own your prompts to tweak token order and system/user roles as models change.

Manage Context Windows Explicitly

Don't blindly append; actively manage what the LLM sees. Own the context window, squeezing traces and error summaries for self-healing.

Tools Are Just JSON and Code

Demystify "tool use" as simple routing. Treat tools as structured JSON outputs validated through switch statements.

Own Your Control Flow

Agents = prompt + switch + context + loop. Keep control-flow in code with explicit OODA loops and convergence heuristics.

Stateless Agent Design

Enable pause/resume and horizontal scaling. Persist execution state for idempotent restarts.

Separate Business from Execution State

Different lifecycles, different needs. Expose launch/pause/resume endpoints for safe replay runs.

Contact Humans as First-Class Operations

Not an edge case, but core functionality. Route high-stakes steps to humans as first-class tool calls.

Meet Users Where They Are

Email, Slack, Discord — multi-channel by design. Trigger agents from wherever users already work.

Small, Focused Agents Beat Monoliths

3-10 steps max for reliability. Build small, single-purpose agents instead of chatty monoliths.

Explicit Error Handling

Process errors intelligently, not blindly. Compact errors into the next prompt to close the feedback loop.

Find the Bleeding Edge

Engineer reliability where models almost succeed. Find what's at the boundary of model capability and make it reliable.

📋 Do's & Don'ts

✅Hand-craft prompts for production quality, avoid prompt abstractions

✅Actively manage context windows with traces and error summaries

✅Build stateless agents that enable pause/resume functionality

✅Route high-stakes operations to humans as first-class tool calls

✅Keep agents small and focused (3-10 steps max)

❌Blindly append to context windows without management

❌Build chatty monolithic agents instead of focused ones

❌Nest prompts - use explicit control flow in code instead

❌Ignore error handling - compact errors into next prompt

❌Treat human interaction as edge case rather than core feature

🚦 When to Use

Use When

• Building production-ready AI agents for enterprise
• Scaling beyond 70-80% prototype functionality
• Need reliable, maintainable agent systems
• Require human-in-the-loop workflows
• Building multi-channel agent experiences

Avoid When

• Simple demos or proof-of-concept projects
• Single-use or throwaway agent tasks
• Research experiments without production requirements
• Cases where 70-80% reliability is sufficient
• Purely automated workflows without human oversight needs

📊 Key Metrics

Production Reliability

>90% success rate in production environments

Agent Response Time

P95 latency for agent task completion

Human Escalation Rate

% of tasks requiring human intervention

Context Window Utilization

Efficiency of context management

Error Recovery Success

% of errors resolved through self-healing

Multi-Channel Adoption

Usage across different user interfaces

💡 Top Use Cases

Enterprise Customer Support: Human-escalated agent handling complex support tickets across multiple channels

Financial Operations: Stateless agents processing transactions with explicit error handling and audit trails

Content Management: Small, focused agents for content creation, review, and publication workflows

DevOps Automation: Agents managing deployment pipelines with human approval gates for production releases

Sales Enablement: Multi-channel agents supporting sales teams through CRM, Slack, and email integrations

References & Further Reading

Deepen your understanding with these curated resources

Official 12-Factor Agents Resources

12-Factor Agents GitHub Repository (Dexter Horthy)

HumanLayer 12-Factor Agents Documentation

MLOps Community: Agents in Production 2025 (Dexter Horthy)

LlamaIndex 12-Factor Implementation

Implementation Guides & Tutorials

The 12-Factor Agent: Practical Framework (DEV Community)

Building Reliable LLM Applications Without Magic (Brandon AI)

12-Factor Agents: Blueprint for Reliable LLM Applications

DZone: The Twelve-Factor Agents for Production-Ready LLM Apps

Academic & Research Analysis

Adnan Masood: Framework for Reliable LLM Agents (Medium)

Mehul Gupta: How to Build Production Grade AI-Agents (Medium)

ODSC: Blueprint for Scalable AI Agents Insights

Hacker News Discussion: 12-factor Agents

Production & Enterprise Resources

The Original 12-Factor App Methodology (Heroku)

DevThink: Building Reliable AI Agents Framework

FlowHunt: 12-Factor AI Agent Building Guide

Enterprise AI Governance Best Practices

Contribute to this collection

Know a great resource? Submit a pull request to add it.

Contribute

Patterns

closed

Design Patterns & Techniques

🔗

Prompt Chaining

🔀

Routing

⚡

Parallelization

🪞

Reflection

🔧

Tool Use

🎯

Planning

👥

Multi-Agent

🧠

Memory Management

📈

Learning and Adaptation

🏗️

Fault Tolerance Infrastructure

📚

Knowledge Retrieval (RAG)

🧠

Reasoning Techniques

🔐

Security & Privacy Patterns

📊

Evaluation and Monitoring

🧠

Context Management

🎨

Agentic Design

Agentic Design

Design Patterns & Techniques

Prompt Chaining

Routing

Parallelization

Reflection

Tool Use

Planning

Multi-Agent

Memory Management

Learning and Adaptation

Fault Tolerance Infrastructure

Knowledge Retrieval (RAG)

Reasoning Techniques

Security & Privacy Patterns

Evaluation and Monitoring

MLCommons AI Safety Benchmark v1.0(AILuminate)

AgentBench(AgentBench)

TheAgentCompany Benchmark(TAC)

MLR-Bench(MLR-Bench)

12-Factor Agent Methodology(12FA)

HELM Agent Evaluation Framework(HELM-AE)

Human-in-the-Loop Agent (HULA)(HULA)

CybersecEval 3(CSE3)

METR RE-Bench(RE-Bench)

SWE-bench Suite(SWE-bench)

GAIA: General AI Assistants Benchmark(GAIA)

MMAU: Massive Multitask Agent Understanding(MMAU)

WebArena Evaluation Suite(WebArena)

EU AI Act Compliance Framework(EU-AIACT)

AISI Evaluation Framework(AISI-Eval)

MAPS: Multilingual Agent Performance & Security(MAPS)

Constitutional AI Evaluation Framework(CAI-Eval)

Context Management

UI/UX & Human-AI Interaction

Loading...

12-Factor Agent Methodology(12FA)

🎯 30-Second Overview

⚡ Quick Implementation

The 12 Factors in Detail

JSON Extraction as Foundation

Own Your Prompts

Manage Context Windows Explicitly

Tools Are Just JSON and Code

Own Your Control Flow

Stateless Agent Design

Separate Business from Execution State

Contact Humans as First-Class Operations

Meet Users Where They Are

Small, Focused Agents Beat Monoliths

Explicit Error Handling

Find the Bleeding Edge

📋 Do's & Don'ts

🚦 When to Use

Use When

Avoid When

📊 Key Metrics

💡 Top Use Cases

References & Further Reading

Official 12-Factor Agents Resources

Implementation Guides & Tutorials

Academic & Research Analysis

Production & Enterprise Resources

Contribute to this collection

12-Factor Agent Methodology(12FA)

🎯 30-Second Overview

⚡ Quick Implementation

The 12 Factors in Detail

JSON Extraction as Foundation

Own Your Prompts

Manage Context Windows Explicitly

Tools Are Just JSON and Code

Own Your Control Flow

Stateless Agent Design

Separate Business from Execution State

Contact Humans as First-Class Operations

Meet Users Where They Are

Small, Focused Agents Beat Monoliths

Explicit Error Handling