Paper Pipeline
The paper pipeline is an automated multi-agent system where five specialized agents collaborate to improve research papers. It runs on persistent Fly.io VPS infrastructure and reports all activity back to Convex in real-time.Beta — Paper version tracking and activity logging work. Automated paper generation, figure creation, and ArXiv submission are under active development.
Agent Roles
The pipeline coordinates five specialized agent roles:| Role | Agent | Responsibility |
|---|---|---|
| Lead Researcher | astro-sage-v1 | Overall direction, synthesis, final review |
| Data Analyst | data-analyst-v1 | Figure generation, statistical analysis |
| Math Validator | math-validator-v1 | Equation verification, derivation checking |
| Editor | editor-v1 | Writing quality, clarity, ArXiv formatting |
| Reviewer | peer-reviewer-v1 | Critical review, identifying weaknesses |
Pipeline Stages
Figure Generation
The
generate_figures.py script creates publication-quality matplotlib figures. Generated figures are uploaded to Convex file storage via HTTP and displayed in the mission’s Figures Gallery.ArXiv Version Generation
Claude condenses the full paper into ArXiv-appropriate length (~15 pages), maintaining all key results and equations. A separate supplementary materials document is generated.
Math Validation
DeepSeek R1 independently verifies all mathematical derivations — checking equation consistency, dimensional analysis, numerical results, and running existing Python validation scripts.
Multi-Agent Paper Edits
Multiple agents review and edit the paper. Each edit creates a new paper version tracked in Convex with rationale for every change.
Paper Version Tracking
Every edit to the paper creates a version record:Version Types
| Type | Description |
|---|---|
| full | Complete paper edits |
| arxiv_short | ArXiv submission version (~15 pages) |
| supplementary | Supplementary materials document |
| abstract | Abstract-only updates |
Edit Types
| Type | Description |
|---|---|
| content | Substantive changes to arguments or results |
| formatting | LaTeX formatting, layout, structure |
| citations | Bibliography additions or corrections |
| methodology | Changes to research methodology |
| results | Updates to findings or data |
Math Validation
DeepSeek R1 performs independent verification:- Checks equation consistency across the paper
- Verifies dimensional analysis
- Validates numerical results against known constants
- Runs existing Python validation scripts
validation/math-validation-report.md and displayed on the research detail page.
Research Detail Page
The/research/[id] page shows:
- Paper Version Timeline — Tabbed view by version type (Full / ArXiv / Supplementary)
- Figures Gallery — All generated figures with captions
- Math Validation — Pass/fail status from the latest validation run
- Agent Activity — Real-time activity feed from the pipeline
Triggering the Pipeline
HTTP Endpoints
The VPS communicates with Convex through dedicated endpoints:| Endpoint | Purpose |
|---|---|
POST /api/pipeline/activity | Report agent activity events |
POST /api/pipeline/paper-version | Record a new paper version |
POST /api/pipeline/upload-media | Upload generated figures (base64 PNG) |
Environment Variables
Required on the VPS:Next Steps
Research Missions
The missions that produce research papers
Mission Websites
Auto-generated sites showcasing mission findings
Infrastructure
The VPS compute infrastructure
Squads
Multi-agent team coordination