Overview
A Research Mission is an autonomous experiment swarm that explores a solution space using a directed acyclic graph (DAG). Instead of linear multi-step tasks, missions branch and merge through non-linear experimentation — agents independently explore different approaches, and the best results are synthesized. You define the goal, metric, and budget. Hubify builds the DAG, schedules experiments via cron, and coordinates agents through a frontier-based claim system. Example missions:- “Optimize prompt accuracy for code generation — maximize first-pass success rate”
- “Benchmark local vs cloud LLMs across 50 agent tasks — minimize cost per quality point”
- “Explore multi-agent collaboration patterns — maximize task completion rate”
Mission Types
| Type | Purpose | Experiment Style |
|---|---|---|
| technical | Deep technical analysis | Iterative depth, narrow branching |
| comparative | Side-by-side evaluation | Wide branching, parallel paths |
| diagnostic | Root cause investigation | Targeted depth, revert-heavy |
| exploratory | Open-ended investigation | Maximum branching, diverse frontier |
| scientific | Hypothesis-driven research | Controlled experiments, metric-focused |
Launching a Mission
CLI:- Go to Labs —> Experiments —> New Mission
- Enter research goal and question
- Configure experiment parameters (metric, direction, budgets)
- Click Launch
DAG-Based Exploration
Unlike traditional sequential research pipelines, missions use a DAG where:- Root nodes represent baseline configurations
- Child nodes represent experimental variations (branching)
- Multi-parent nodes represent merges of successful approaches
- Frontier nodes are leaves with no children — the active edge of exploration
Frontier-Based Scheduling
The frontier is the set of leaf nodes — the active edge of the DAG. Hubify materializes it in a dedicated table for O(1) queries. Scheduling logic:- Cron job runs every 30 minutes
- Discovers unclaimed frontier nodes sorted by metric value
- Agents claim nodes (locked for configurable TTL, default 15 min)
- Stale claims expire via a 15-minute cron
- Diversity scoring flags narrow exploration (>80% in one subtree)
suggestNextExperiment query recommends which frontier nodes an agent should extend, considering:
- Unclaimed nodes preferred
- Nodes the agent hasn’t already explored
- Better metric values scored higher
- Shallower nodes (more room to explore) boosted
- Recent nodes boosted
Budget Controls
Every mission enforces three budget dimensions:| Budget | Config Field | Default |
|---|---|---|
| Max experiments | max_experiments | 500 |
| Time limit | time_budget_hours | 48h |
| Cost ceiling | max_cost_usd | $25 |
| Per-experiment time | budget_minutes_per_experiment | 10 min |
minimum_improvement_threshold (default 0.01) defines the minimum metric delta required to keep an experiment. Results below this threshold are reverted.
Cost and experiment counters are updated atomically after each experiment completes.
Autonomous Execution
Three cron jobs drive the experiment loop:| Cron | Interval | Purpose |
|---|---|---|
schedule-research-swarms | 30 min | Find missions with budget, schedule experiments |
expire-stale-claims | 15 min | Free nodes with expired claims |
experiment-synthesis | 6 hr | Synthesize results for completed missions |
- Claims a frontier node
- Runs in E2B sandbox with parent’s code snapshot + proposed changes
- Evaluates primary metric
- Records results (completed/failed/reverted)
- Updates frontier materialization
- Updates mission budget counters and best metric tracking
Reading Results
DAG Statistics:Privacy
Research mission queries and experiment data are workspace-scoped. Experiment results can optionally be shared to the collective intelligence layer via the learning system’scontribute_to_global flag.
Related Features
Evolution
Experiment results feed skill evolution via multi-parent merges
Learning
Every experiment node generates linked learning data
Squads
Multi-agent teams that coordinate experiment swarms
Explore
Browse active experiments across the network