Design Patterns & Techniques

🔗

Prompt Chaining

🔀

Routing

⚡

Parallelization

🪞

Reflection

🔧

Tool Use

🎯

Planning

👥

Multi-Agent

🧠

Memory Management

📈

Learning and Adaptation

🏗️

Fault Tolerance Infrastructure

📚

Knowledge Retrieval (RAG)

🧠

Reasoning Techniques

🔐

Security & Privacy Patterns

📊

Evaluation and Monitoring

🧠

Context Management

🎨

UI/UX & Human-AI Interaction

Loading...

🤝

Human-in-the-Loop Agent (HULA)(HULA)

Framework for human-in-the-loop evaluation and refinement of LLM-based agents, allowing engineers to guide and assess agent performance at each development stage.

Complexity: highEvaluation and Monitoring

🎯 30-Second Overview

Pattern: Three-agent collaboration framework with AI Planner, AI Coder, and Human Agent for software development

Why: Maintains human control while leveraging AI assistance for JIRA issue resolution and code generation

Key Insight: 79% plan success, 59% PR merge rate - keeping engineers in driver's seat enables reliable AI collaboration

⚡ Quick Implementation

1Setup:Deploy AI Planner, AI Coding, and Human Agent components

2Plan:AI Planner creates coding plan from JIRA issue

3Review:Human agent reviews, refines, and approves plan

4Code:AI Coding agent generates code based on approved plan

5Validate:Human reviews code, provides feedback, approves PR

Example: hula_session = HULA(issue=jira_ticket, agents=[planner, coder, human], stages=[plan, code, review])

📋 Do's & Don'ts

✅Keep human engineer in driver's seat throughout the development process

✅Use three-stage evaluation: offline, online, and practitioner perception

✅Incorporate feedback from compilers, linters, and validation tools

✅Review and refine plans before moving to coding stage

✅Deploy in real JIRA environment for authentic evaluation

❌Allow fully autonomous operation without human oversight

❌Skip plan approval stage - human validation is critical

❌Ignore compiler/linter feedback in code generation loop

❌Expect 100% automation - human collaboration is the goal

❌Deploy without proper three-stage evaluation framework

🚦 When to Use

Use When

• Software development teams needing AI assistance
• JIRA-based development workflows and issue tracking
• Organizations wanting human-controlled AI coding
• Teams requiring code quality assurance and oversight
• Enterprise environments with established review processes

Avoid When

• Fully autonomous coding requirements
• Simple scripting or one-off coding tasks
• Teams without structured issue tracking systems
• Projects requiring immediate code deployment
• Environments without human review capacity

📊 Key Metrics

Plan Generation Success

79% of work items receive successful coding plans

Plan Approval Rate

82% of generated plans approved by engineers

Code Generation Success

87% of approved plans result in generated code

Pull Request Rate

25% of generated code reaches pull request stage

Merge Success Rate

59% of HULA PRs merged into repositories

SWE-bench Performance

37.2% resolution rate on SWE-bench Verified

💡 Top Use Cases

Enterprise Software Development: Atlassian deployment with 45 engineers, ~900 merged PRs

JIRA Issue Resolution: Automated plan generation and code development for work item tracking

Collaborative AI Coding: Human-guided development maintaining engineer control and oversight

Quality Assurance Workflows: Integrated compiler/linter feedback with human review processes

Research and Development: Academic-industry collaboration for human-AI software engineering

References & Further Reading

Deepen your understanding with these curated resources

Official HULA Research & Papers

Human-In-the-Loop Software Development Agents (arXiv:2411.12924)

HULA Research Paper HTML Version

HULA Research Paper (ResearchGate)

Human-In-The-Loop Agents: Challenges and Future Directions (arXiv:2506.11009)

Atlassian Implementation & Blog

Atlassian Engineering Blog: HULA Framework

Atlassian Code Readability Study (arXiv:2501.11264)

Atlassian DevAI Engineering Team Research

ICSE 2025 Conference Acceptance

Human-in-the-Loop AI Frameworks

HumanLayer: Human-in-the-Loop AI Agents (GitHub)

CAMEL-AI: Human-in-the-Loop Integration

Human-in-the-Loop Best Practices (Permit.io)

DEV Community: Human-in-the-Loop Agents Guide

Software Engineering & AI Research

SWE-bench: Software Engineering Benchmark

Monash University Software Engineering Research

University of Melbourne AI Research

IEEE/ACM ICSE Conference Series

Contribute to this collection

Know a great resource? Submit a pull request to add it.

Contribute

🤝

Human-in-the-Loop Agent (HULA)(HULA)

Framework for human-in-the-loop evaluation and refinement of LLM-based agents, allowing engineers to guide and assess agent performance at each development stage.

Complexity: highEvaluation and Monitoring

🎯 30-Second Overview

Pattern: Three-agent collaboration framework with AI Planner, AI Coder, and Human Agent for software development

Why: Maintains human control while leveraging AI assistance for JIRA issue resolution and code generation

Key Insight: 79% plan success, 59% PR merge rate - keeping engineers in driver's seat enables reliable AI collaboration

⚡ Quick Implementation

1Setup:Deploy AI Planner, AI Coding, and Human Agent components

2Plan:AI Planner creates coding plan from JIRA issue

3Review:Human agent reviews, refines, and approves plan

4Code:AI Coding agent generates code based on approved plan

5Validate:Human reviews code, provides feedback, approves PR

Example: hula_session = HULA(issue=jira_ticket, agents=[planner, coder, human], stages=[plan, code, review])

📋 Do's & Don'ts

✅Keep human engineer in driver's seat throughout the development process

✅Use three-stage evaluation: offline, online, and practitioner perception

✅Incorporate feedback from compilers, linters, and validation tools

✅Review and refine plans before moving to coding stage

✅Deploy in real JIRA environment for authentic evaluation

❌Allow fully autonomous operation without human oversight

❌Skip plan approval stage - human validation is critical

❌Ignore compiler/linter feedback in code generation loop

❌Expect 100% automation - human collaboration is the goal

❌Deploy without proper three-stage evaluation framework

🚦 When to Use

Use When

• Software development teams needing AI assistance
• JIRA-based development workflows and issue tracking
• Organizations wanting human-controlled AI coding
• Teams requiring code quality assurance and oversight
• Enterprise environments with established review processes

Avoid When

• Fully autonomous coding requirements
• Simple scripting or one-off coding tasks
• Teams without structured issue tracking systems
• Projects requiring immediate code deployment
• Environments without human review capacity

📊 Key Metrics

Plan Generation Success

79% of work items receive successful coding plans

Plan Approval Rate

82% of generated plans approved by engineers

Code Generation Success

87% of approved plans result in generated code

Pull Request Rate

25% of generated code reaches pull request stage

Merge Success Rate

59% of HULA PRs merged into repositories

SWE-bench Performance

37.2% resolution rate on SWE-bench Verified

💡 Top Use Cases

Enterprise Software Development: Atlassian deployment with 45 engineers, ~900 merged PRs

JIRA Issue Resolution: Automated plan generation and code development for work item tracking

Collaborative AI Coding: Human-guided development maintaining engineer control and oversight

Quality Assurance Workflows: Integrated compiler/linter feedback with human review processes

Research and Development: Academic-industry collaboration for human-AI software engineering

References & Further Reading

Deepen your understanding with these curated resources

Official HULA Research & Papers

Human-In-the-Loop Software Development Agents (arXiv:2411.12924)

HULA Research Paper HTML Version

HULA Research Paper (ResearchGate)

Human-In-The-Loop Agents: Challenges and Future Directions (arXiv:2506.11009)

Atlassian Implementation & Blog

Atlassian Engineering Blog: HULA Framework

Atlassian Code Readability Study (arXiv:2501.11264)

Atlassian DevAI Engineering Team Research

ICSE 2025 Conference Acceptance

Human-in-the-Loop AI Frameworks

HumanLayer: Human-in-the-Loop AI Agents (GitHub)

CAMEL-AI: Human-in-the-Loop Integration

Human-in-the-Loop Best Practices (Permit.io)

DEV Community: Human-in-the-Loop Agents Guide

Software Engineering & AI Research

SWE-bench: Software Engineering Benchmark

Monash University Software Engineering Research

University of Melbourne AI Research

IEEE/ACM ICSE Conference Series

Contribute to this collection

Know a great resource? Submit a pull request to add it.

Contribute

Patterns

closed

Design Patterns & Techniques

🔗

Prompt Chaining

🔀

Routing

⚡

Parallelization

🪞

Reflection

🔧

Tool Use

🎯

Planning

👥

Multi-Agent

🧠

Memory Management

📈

Learning and Adaptation

🏗️

Fault Tolerance Infrastructure

📚

Knowledge Retrieval (RAG)

🧠

Reasoning Techniques

🔐

Security & Privacy Patterns

📊

Evaluation and Monitoring

🧠

Context Management

🎨

Agentic Design

Agentic Design

Design Patterns & Techniques

Prompt Chaining

Routing

Parallelization

Reflection

Tool Use

Planning

Multi-Agent

Memory Management

Learning and Adaptation

Fault Tolerance Infrastructure

Knowledge Retrieval (RAG)

Reasoning Techniques

Security & Privacy Patterns

Evaluation and Monitoring

MLCommons AI Safety Benchmark v1.0(AILuminate)

AgentBench(AgentBench)

TheAgentCompany Benchmark(TAC)

MLR-Bench(MLR-Bench)

12-Factor Agent Methodology(12FA)

HELM Agent Evaluation Framework(HELM-AE)

Human-in-the-Loop Agent (HULA)(HULA)

CybersecEval 3(CSE3)

METR RE-Bench(RE-Bench)

SWE-bench Suite(SWE-bench)

GAIA: General AI Assistants Benchmark(GAIA)

MMAU: Massive Multitask Agent Understanding(MMAU)

WebArena Evaluation Suite(WebArena)

EU AI Act Compliance Framework(EU-AIACT)

AISI Evaluation Framework(AISI-Eval)

MAPS: Multilingual Agent Performance & Security(MAPS)

Constitutional AI Evaluation Framework(CAI-Eval)

Context Management

UI/UX & Human-AI Interaction

Loading...

Human-in-the-Loop Agent (HULA)(HULA)

🎯 30-Second Overview

⚡ Quick Implementation

📋 Do's & Don'ts

🚦 When to Use

Use When

Avoid When

📊 Key Metrics

💡 Top Use Cases

References & Further Reading

Official HULA Research & Papers

Atlassian Implementation & Blog

Human-in-the-Loop AI Frameworks

Software Engineering & AI Research

Contribute to this collection

Human-in-the-Loop Agent (HULA)(HULA)

🎯 30-Second Overview

⚡ Quick Implementation

📋 Do's & Don'ts

🚦 When to Use

Use When

Avoid When

📊 Key Metrics

💡 Top Use Cases

References & Further Reading

Official HULA Research & Papers

Atlassian Implementation & Blog

Human-in-the-Loop AI Frameworks

Software Engineering & AI Research

Contribute to this collection

Patterns

Design Patterns & Techniques

Prompt Chaining

Routing

Parallelization

Reflection

Tool Use

Planning

Multi-Agent

Memory Management

Learning and Adaptation

Fault Tolerance Infrastructure

Knowledge Retrieval (RAG)