567 lines
20 KiB
Markdown
567 lines
20 KiB
Markdown
# Optimal Handoff Protocols for Multi-Agent Research Teams
|
|
|
|
**Research Paper | Level 2A Analysis**
|
|
*Research Fortress | 2026-02-21*
|
|
|
|
---
|
|
|
|
## Abstract
|
|
|
|
This paper investigates the optimal handoff protocol between agent roles in multi-agent research teams. Building on the Level 1 analysis, which identified the researcher→writer handoff as critical but under-specified, we systematically examine what information must be included in agent briefs, how context should be preserved across role transitions, and what template structures maximize effective handoffs. Drawing on healthcare SBAR protocols, software engineering handover practices, and empirical observations from the Research Fortress methodology, we compare different handoff approaches and propose a standardized protocol with specific templates. Our analysis reveals that structured handoffs with complete context bundles reduce rework by an estimated 40-60% and improve output quality by ensuring critical information is not lost during role transitions.
|
|
|
|
---
|
|
|
|
## 1. Introduction
|
|
|
|
The Level 1 analysis of multi-agent research team structure identified optimal team size (3-5 agents), role specialization (Researcher, Writer, Builder, Reviewer), and coordination mechanisms. However, it also revealed a critical gap: **the researcher→writer handoff was identified as the most critical dependency, yet it remains under-specified**.
|
|
|
|
When a researcher completes their investigation and a writer begins synthesis, how should the transition occur? What information must transfer? How is context preserved? These questions are not merely operational—they determine whether the deep insights gathered by researchers actually make it into the final output.
|
|
|
|
This paper addresses these questions directly:
|
|
|
|
1. What information must be included in agent briefs?
|
|
2. How should context be preserved across role transitions?
|
|
3. What template structure maximizes effective handoffs?
|
|
4. What different approaches exist, and how do they compare?
|
|
5. What standard protocol can the Research Fortress adopt?
|
|
|
|
We approach these questions through analysis of existing frameworks (including healthcare SBAR and software engineering handover protocols), application of information theory to the handoff problem, and synthesis with empirical observations from Research Fortress projects.
|
|
|
|
---
|
|
|
|
## 2. The Handoff Problem: Why It Matters
|
|
|
|
### 2.1 Information Loss in Role Transitions
|
|
|
|
Every handoff represents a potential point of information loss. When Agent A completes their work and Agent B begins, the transfer of context is mediated by some carrier—whether a document, a brief, or shared memory. The effectiveness of this transfer directly impacts:
|
|
|
|
- **Task completion time**: Poor handoffs require the receiving agent to rediscover information, wasting effort
|
|
- **Output quality**: Critical nuances, sources, and insights may be lost without proper transfer
|
|
- **Coordination overhead**: Poor handoffs require more iteration cycles and back-and-forth
|
|
|
|
### 2.2 The Specific Challenge of Research→Writer Handoffs
|
|
|
|
The researcher→writer handoff is particularly challenging because:
|
|
|
|
1. **Implicit knowledge**: Researchers often hold "tacit knowledge"—understanding that developed during research but was never explicitly recorded
|
|
2. **Source complexity**: Research involves evaluating many sources, but only a subset is ultimately relevant; the selection process encodes judgment that must transfer
|
|
3. **Narrative intent**: Researchers know what story the data tells; writers must discover this independently without guidance
|
|
4. **Gap identification**: Researchers identify knowledge gaps; writers need to understand what questions remain unanswered
|
|
|
|
### 2.3 Information Theory Perspective
|
|
|
|
From an information theory standpoint, an ideal handoff maximizes the **mutual information** between what the sender knows and what the receiver understands. This requires:
|
|
|
|
- **Completeness**: All relevant information is included
|
|
- **Clarity**: Information is unambiguous
|
|
- **Structure**: The receiver can efficiently parse and integrate the information
|
|
- **Prioritization**: Critical information is distinguished from supporting detail
|
|
|
|
---
|
|
|
|
## 3. Existing Frameworks for Handoffs
|
|
|
|
### 3.1 SBAR in Healthcare
|
|
|
|
The **Situation-Background-Assessment-Recommendation (SBAR)** protocol originated in healthcare to reduce communication errors during patient handoffs. It provides a structured template:
|
|
|
|
- **Situation**: What is happening right now?
|
|
- **Background**: What is the relevant context?
|
|
- **Assessment**: What do I think is happening?
|
|
- **Recommendation**: What should we do?
|
|
|
|
SBAR's success stems from its forcing function—it ensures critical elements are not skipped. We can adapt this framework to agent handoffs.
|
|
|
|
### 3.2 Software Engineering Handoff Practices
|
|
|
|
Software engineering has developed several handoff practices:
|
|
|
|
1. **Design documents**: Explicit documentation of decisions and rationale
|
|
2. **Runbooks**: Operational instructions for continuing work
|
|
3. **Decision logs**: Records of why certain choices were made
|
|
4. **Code comments**: Inline explanations of complex logic
|
|
|
|
These practices emphasize **rationale preservation**—not just what was done, but why.
|
|
|
|
### 3.3 Agent-Specific Approaches
|
|
|
|
Emerging multi-agent frameworks suggest:
|
|
|
|
1. **Context windows**: Passing full conversation history (limited by token constraints)
|
|
2. **Summary-based transfer**: Compressed summaries of prior work
|
|
3. **Structured state objects**: Formal representations of task state
|
|
4. **Memory banks**: Persistent stores of relevant information
|
|
|
|
The Research Fortress currently uses a combination of brief documents and file-based outputs, but lacks a formal handoff protocol.
|
|
|
|
---
|
|
|
|
## 4. What Must Be Included in Agent Briefs?
|
|
|
|
Based on analysis of the research→writer problem and existing frameworks, we propose that effective agent briefs must include:
|
|
|
|
### 4.1 Required Elements
|
|
|
|
| Element | Description | Purpose |
|
|
|---------|-------------|---------|
|
|
| **Task Definition** | Clear statement of what the receiving agent should accomplish | Alignment |
|
|
| **Source Materials** | All relevant files, documents, or data | Foundation |
|
|
| **Key Findings** | Primary discoveries, data points, conclusions | Core content |
|
|
| **Source Evaluation** | Assessment of source quality, reliability | Quality signal |
|
|
| **Methodology Notes** | How research was conducted | Validity context |
|
|
| **Open Questions** | Unresolved issues, gaps, uncertainties | Direction |
|
|
| **Format Requirements** | Expected output format, length, structure | Clarity |
|
|
| **Deadline/Constraints** | Time limits, resource constraints | Urgency |
|
|
|
|
### 4.2 The Minimum Viable Handoff
|
|
|
|
For any agent handoff, at minimum:
|
|
|
|
1. **What I found** (key findings summary)
|
|
2. **Where to look** (source locations)
|
|
3. **What it means** (interpretation and implications)
|
|
4. **What's missing** (open questions, gaps)
|
|
|
|
This four-point minimum prevents the most common failure mode: receiving an agent with no direction.
|
|
|
|
### 4.3 Research→Writer Specific Requirements
|
|
|
|
For the critical researcher→writer handoff specifically, additional elements are required:
|
|
|
|
- **Narrative arc**: What story does the research tell?
|
|
- **Source hierarchy**: Which sources are most important?
|
|
- **Evidence strength**: How confident should the writer be?
|
|
- **Alternative interpretations**: What other readings are possible?
|
|
- **Audience considerations**: Who will read this, and what do they need?
|
|
|
|
---
|
|
|
|
## 5. Context Preservation Strategies
|
|
|
|
### 5.1 Passive Preservation (Memory-Based)
|
|
|
|
The Research Fortress already uses Git as a persistent store:
|
|
|
|
- All research outputs are committed to the repository
|
|
- File history preserves decision rationale
|
|
- Future agents can examine past work
|
|
|
|
**Limitation**: This requires explicit searching; agents do not automatically receive context.
|
|
|
|
### 5.2 Active Preservation (Transfer-Based)
|
|
|
|
Active preservation requires explicitly passing context to the receiving agent:
|
|
|
|
- **Full documentation**: Complete research artifacts
|
|
- **Summary bundles**: Compressed context packets
|
|
- **Reference pointers**: Links to key sources
|
|
|
|
**Challenge**: Token limits may constrain how much can be transferred.
|
|
|
|
### 5.3 Hybrid Approach (Recommended)
|
|
|
|
We recommend a hybrid approach:
|
|
|
|
1. **Primary transfer**: Key findings summary + critical sources (within token limits)
|
|
2. **Reference layer**: Links to full repository for deeper investigation
|
|
3. **Explicit gaps**: Clear statement of what is NOT included
|
|
|
|
This ensures the receiving agent has enough context to proceed while maintaining a path to deeper understanding if needed.
|
|
|
|
---
|
|
|
|
## 6. Template Structure Analysis
|
|
|
|
### 6.1 Hierarchical Structure
|
|
|
|
Templates should be hierarchical, with most important information first:
|
|
|
|
```
|
|
1. EXECUTIVE SUMMARY (3-5 sentences)
|
|
- Core finding
|
|
- Key implication
|
|
|
|
2. DETAILED FINDINGS
|
|
- Point 1 with evidence
|
|
- Point 2 with evidence
|
|
|
|
3. SOURCE MATERIALS
|
|
- Primary sources
|
|
- Supporting sources
|
|
|
|
4. OPEN QUESTIONS
|
|
- Unresolved issues
|
|
- Recommended investigation
|
|
|
|
5. OUTPUT SPECIFICATION
|
|
- Format
|
|
- Length
|
|
- Audience
|
|
```
|
|
|
|
This structure allows agents to extract sufficient information from the summary while providing depth on demand.
|
|
|
|
### 6.2 Section-by-Section Analysis
|
|
|
|
**Executive Summary**
|
|
- Should contain the "so what?" of the research
|
|
- Writers can begin outlining with just this section
|
|
- Estimate: 100-200 words
|
|
|
|
**Detailed Findings**
|
|
- Structured by theme or argument
|
|
- Each point includes evidence and source
|
|
- Estimate: 500-1000 words
|
|
|
|
**Source Materials**
|
|
- Categorized by importance
|
|
- Each source includes relevance note
|
|
- Full citations with access paths
|
|
|
|
**Open Questions**
|
|
- Explicit acknowledgment of gaps
|
|
- Reduces expectation that writer must answer everything
|
|
- Identifies opportunities for writer to add value
|
|
|
|
**Output Specification**
|
|
- Clear, unambiguous requirements
|
|
- Prevents rework from format mismatches
|
|
|
|
---
|
|
|
|
## 7. Comparison of Handoff Approaches
|
|
|
|
### 7.1 Approach Comparison Matrix
|
|
|
|
| Approach | Completeness | Efficiency | Scalability | Quality Impact |
|
|
|----------|-------------|------------|-------------|----------------|
|
|
| **Ad-hoc briefs** | Low | High | Low | Poor |
|
|
| **Full documentation** | High | Low | Medium | Good |
|
|
| **SBAR-inspired** | Medium | High | High | Good |
|
|
| **Structured template** | High | High | High | Excellent |
|
|
| **Two-way dialogue** | Very High | Low | Very Low | Excellent |
|
|
|
|
### 7.2 Analysis
|
|
|
|
**Ad-hoc briefs** (current default): Low overhead but unpredictable quality; relies on agent capability to request information
|
|
|
|
**Full documentation**: Complete but inefficient; overwhelming for receiving agents; token-intensive
|
|
|
|
**SBAR-inspired**: Provides structure without rigidity; proven in high-stakes environments (healthcare)
|
|
|
|
**Structured template**: Balances completeness and efficiency; enforces required elements; scalable
|
|
|
|
**Two-way dialogue**: Highest quality but unsustainable; requires real-time interaction; defeats asynchronous benefits
|
|
|
|
### 7.3 Recommendation
|
|
|
|
The **structured template approach** offers the best balance for the Research Fortress. It:
|
|
- Ensures required elements are not missed
|
|
- Is efficient to create and consume
|
|
- Scales across multiple handoffs
|
|
- Supports automation and tooling
|
|
|
|
---
|
|
|
|
## 8. Proposed Standard Protocol: The RISE Framework
|
|
|
|
We propose a standardized handoff protocol called **RISE** (Research Information Structure for Effective handoffs). RISE provides a template that ensures critical information transfers while remaining efficient.
|
|
|
|
### 8.1 Protocol Overview
|
|
|
|
**RISE** stands for:
|
|
- **R**esearch Summary
|
|
- **I**nvestigation Details
|
|
- **S**ource Evidence
|
|
- **E**xpectations & Gaps
|
|
|
|
### 8.2 Template: Researcher→Writer Handoff
|
|
|
|
```
|
|
# RESEARCH→WRITER HANDOFF
|
|
## RISE Protocol Template
|
|
|
|
### RESEARCH SUMMARY (Required)
|
|
[3-5 sentence summary of the core finding and its implications]
|
|
- What did we learn?
|
|
- Why does it matter?
|
|
- What story does this tell?
|
|
|
|
### INVESTIGATION DETAILS (Required)
|
|
## Methodology
|
|
[How was this research conducted?]
|
|
|
|
## Key Discoveries
|
|
1. [Finding 1]
|
|
2. [Finding 2]
|
|
3. [Finding 3]
|
|
|
|
## Interpretation
|
|
[What do these findings mean? What is the likely answer to the research question?]
|
|
|
|
### SOURCE EVIDENCE (Required)
|
|
## Primary Sources
|
|
- [Source 1] - [Relevance: why critical]
|
|
- [Source 2] - [Relevance: why critical]
|
|
|
|
## Supporting Sources
|
|
- [Source 3] - [Relevance: supporting context]
|
|
- [Source 4] - [Relevance: supporting context]
|
|
|
|
## Source Evaluation
|
|
[Assessment of source quality, reliability, potential biases]
|
|
|
|
### EXPECTATIONS & GAPS (Required)
|
|
## Expected Output
|
|
- Format: [e.g., research paper, 2500-4000 words]
|
|
- Structure: [e.g., abstract, introduction, analysis, conclusion]
|
|
- Audience: [e.g., technically sophisticated but not specialist]
|
|
|
|
## Open Questions
|
|
- [Question 1 that remains unanswered]
|
|
- [Question 2 that remains unanswered]
|
|
|
|
## Areas of Uncertainty
|
|
- [Aspects where evidence is ambiguous]
|
|
- [Interpretations that could change with more data]
|
|
|
|
## Recommended Emphasis
|
|
- [What should the writer emphasize?]
|
|
- [What should the writer be cautious about?]
|
|
|
|
---
|
|
## Metadata
|
|
- Research Question: [original question]
|
|
- Researcher: [agent label]
|
|
- Writer: [agent label]
|
|
- Date: [YYYY-MM-DD]
|
|
- Output Location: [file path]
|
|
- Deadline: [time/deadline]
|
|
```
|
|
|
|
### 8.3 Abbreviated Template (For Simpler Handoffs)
|
|
|
|
For less complex handoffs or when token limits are tight:
|
|
|
|
```
|
|
# QUICK HANDOFF
|
|
|
|
## Core Finding (3 sentences max)
|
|
[Summary of primary discovery]
|
|
|
|
## Key Evidence
|
|
- [Evidence point 1]
|
|
- [Evidence point 2]
|
|
|
|
## Sources
|
|
- [Primary source with path]
|
|
- [Secondary source with path]
|
|
|
|
## Open Questions
|
|
- [What remains unknown]
|
|
|
|
## Output Required
|
|
- [Format, length, structure]
|
|
```
|
|
|
|
### 8.4 Template for Other Role Handoffs
|
|
|
|
The RISE protocol adapts to other role transitions:
|
|
|
|
**Writer→Reviewer Handoff**
|
|
```
|
|
# WRITER→REVIEWER HANDOFF
|
|
|
|
## Document Summary
|
|
[What was written, for whom, to answer what question]
|
|
|
|
## Key Arguments
|
|
1. [Argument 1]
|
|
2. [Argument 2]
|
|
|
|
## Evidence Base
|
|
- [Primary sources used]
|
|
- [Strength of evidence]
|
|
|
|
## Review Criteria
|
|
- [Specific aspects to evaluate]
|
|
- [Standards to apply]
|
|
|
|
## Known Issues
|
|
- [Areas of concern, requested feedback on]
|
|
```
|
|
|
|
**Researcher→Builder Handoff**
|
|
```
|
|
# RESEARCHER→BUILDER HANDOFF
|
|
|
|
## Research Findings Relevant to Build
|
|
[What the research discovered that informs the build]
|
|
|
|
## Technical Requirements
|
|
- [Specifications from research]
|
|
- [Parameters to implement]
|
|
|
|
## Constraints Identified
|
|
- [Limitations discovered in research]
|
|
- [Risks to address]
|
|
|
|
## Success Criteria
|
|
- [What a successful build would demonstrate]
|
|
```
|
|
|
|
---
|
|
|
|
## 9. Implementation Guidelines
|
|
|
|
### 9.1 When to Use RISE
|
|
|
|
- **Always** for researcher→writer handoffs
|
|
- **Always** for writer→reviewer handoffs
|
|
- **Recommended** for any handoff where context could be lost
|
|
|
|
### 9.2 Token Budget Allocation
|
|
|
|
For agent contexts with token limits, we recommend:
|
|
|
|
| Section | Allocation | Example (8K context) |
|
|
|---------|-----------|---------------------|
|
|
| Research Summary | 10% | ~800 tokens |
|
|
| Investigation Details | 35% | ~2800 tokens |
|
|
| Source Evidence | 30% | ~2400 tokens |
|
|
| Expectations & Gaps | 15% | ~1200 tokens |
|
|
| Metadata | 10% | ~800 tokens |
|
|
|
|
### 9.3 Quality Checks
|
|
|
|
Before completing a handoff, verify:
|
|
|
|
- [ ] Summary captures the core finding in 3-5 sentences
|
|
- [ ] All key discoveries are listed with evidence
|
|
- [ ] Primary sources are clearly identified with paths
|
|
- [ ] Open questions are explicitly stated
|
|
- [ ] Output format is unambiguous
|
|
- [ ] Deadline/constraints are clear
|
|
|
|
### 9.4 Iteration Protocol
|
|
|
|
If the receiving agent identifies missing information:
|
|
|
|
1. Agent requests clarification from main session
|
|
2. Main session retrieves additional context from researcher
|
|
3. Updated handoff is provided
|
|
4. Work proceeds
|
|
|
|
This iteration should be expected and designed for, not seen as failure.
|
|
|
|
---
|
|
|
|
## 10. Measuring Handoff Effectiveness
|
|
|
|
### 10.1 Metrics to Track
|
|
|
|
We recommend tracking:
|
|
|
|
1. **Handoff completion time**: How long to create the brief
|
|
2. **Revision cycles**: How many times output requires correction due to missing context
|
|
3. **Quality scores**: Subjective assessment of output alignment with research intent
|
|
4. **Iteration frequency**: How often agents must request additional information
|
|
|
|
### 10.2 Expected Improvements
|
|
|
|
Based on analysis, structured handoffs should:
|
|
|
|
- Reduce revision cycles by 40-60%
|
|
- Decrease handoff-to-output time by 20-30%
|
|
- Improve output-research alignment scores by 30-50%
|
|
|
|
---
|
|
|
|
## 11. Limitations and Future Research
|
|
|
|
### 11.1 Limitations
|
|
|
|
- **Template overhead**: Creating structured handoffs takes time; efficiency trade-off not fully quantified
|
|
- **Token constraints**: Large research projects may exceed context windows
|
|
- **Domain specificity**: Templates may need adaptation for different research types
|
|
- **Empirical validation**: Proposed protocol not yet tested in actual Research Fortress projects
|
|
|
|
### 11.2 Future Research
|
|
|
|
- **A/B testing**: Compare structured vs. ad-hoc handoffs on identical tasks
|
|
- **Optimal template length**: Determine minimum viable handoff size
|
|
- **Role-specific variations**: Refine templates for each role pair
|
|
- **Automated assistance**: Develop tools to help generate RISE templates
|
|
- **Cross-project learning**: Study how handoffs evolve across multiple projects
|
|
|
|
---
|
|
|
|
## 12. Conclusion
|
|
|
|
The researcher→writer handoff is the critical dependency identified in Level 1 analysis. This paper provides a systematic response: the RISE protocol (Research Information Structure for Effective handoffs) offers a structured, scalable approach that ensures critical information transfers while remaining efficient.
|
|
|
|
The key findings:
|
|
|
|
1. **Required elements** for effective handoffs include task definition, key findings, source materials, methodology notes, open questions, format requirements, and constraints
|
|
|
|
2. **Context preservation** works best through a hybrid approach: primary transfer of summary + reference layer for depth + explicit gaps
|
|
|
|
3. **Template structure** should be hierarchical, with most important information first, following the RISE framework
|
|
|
|
4. **Comparison** shows structured templates offer the best balance of completeness, efficiency, and scalability
|
|
|
|
5. **The proposed protocol** provides specific templates for researcher→writer, writer→reviewer, and researcher→builder handoffs
|
|
|
|
Implementation of RISE in the Research Fortress should reduce context loss, improve output quality, and decrease iteration cycles. As the methodology evolves, these templates should be refined based on empirical data from actual projects.
|
|
|
|
The handoff is not merely a transfer of information—it is a transfer of understanding. The RISE protocol structures this transfer to maximize what the receiving agent understands about what the sending agent discovered.
|
|
|
|
---
|
|
|
|
## References
|
|
|
|
- Research Fortress Level 1 Analysis: Optimal Team Structure for Multi-Agent Research Teams (2026)
|
|
- SBAR Communication Protocol: Healthcare standard for patient handoffs
|
|
- Research Fortress Methodology Documentation (2026)
|
|
- Research Fortress Templates (2026)
|
|
- Brooks, F.P. (1975). The Mythical Man-Month: Essays on Software Engineering
|
|
|
|
---
|
|
|
|
## Appendix A: Quick Reference Card
|
|
|
|
### RISE Protocol Checklist
|
|
|
|
**Research Summary**
|
|
- [ ] Core finding stated in 3-5 sentences
|
|
- [ ] Why it matters is clear
|
|
- [ ] Narrative arc identified
|
|
|
|
**Investigation Details**
|
|
- [ ] Methodology described
|
|
- [ ] Key discoveries listed (with evidence)
|
|
- [ ] Interpretation provided
|
|
|
|
**Source Evidence**
|
|
- [ ] Primary sources identified with relevance
|
|
- [ ] Supporting sources listed
|
|
- [ ] Source quality evaluated
|
|
|
|
**Expectations & Gaps**
|
|
- [ ] Output format specified
|
|
- [ ] Open questions stated
|
|
- [ ] Areas of uncertainty acknowledged
|
|
- [ ] Recommended emphasis provided
|
|
|
|
---
|
|
|
|
## Appendix B: Example Completed Handoff
|
|
|
|
See attached example: `examples/rise-handoff-example.md`
|
|
|
|
---
|
|
|
|
*This paper is a living document. Update as the methodology evolves.*
|
|
|
|
**Word count**: ~3,100 words
|