MassGen: Multi-Agent Scaling System for GenAI#

MassGen Logo MassGen Logo

PyPI GitHub Stars Python 3.11+ License

Follow on X Follow on LinkedIn Join our Discord


MassGen Demo - Multi-agent collaboration in action (4x speed) MassGen Demo - Multi-agent collaboration in action (4x speed)

What is MassGen?#

MassGen is a cutting-edge multi-agent system that leverages the power of collaborative AI to solve complex tasks. It assigns a task to multiple AI agents who work in parallel, observe each other’s progress, and refine their approaches to converge on the best solution to deliver a comprehensive and high-quality result.

How It Works:

  • Work in Parallel - Multiple agents tackle the problem simultaneously, each bringing unique capabilities

  • See Recent Answers - At each step, agents view the most recent answers from other agents

  • Decide Next Action - Each agent chooses to provide a new answer or vote for an existing answer

  • Share Workspaces - When agents provide answers, their workspace is captured so others can review their work

  • Natural Consensus - Coordination continues until all agents vote, then the agent with most votes presents the final answer

MassGen is a cutting-edge multi-agent framework that coordinates AI agents through redundancy and iterative refinement. Agents tackle the full problem, observe and build on each other’s work across cycles of refinement and restarts, then vote — and the best collectively validated answer wins. This lays the groundwork for principled multi-agent scaling and self-improvement.

See visual comparisons between MassGen and single-agent solutions, highlighting how MassGen unifies different agentic approaches for better outcomes.

Use MassGen from Claude Code, Codex, Copilot, Cursor, and other AI coding agents.

Note

For AI agents and crawlers: This site publishes a curated llms.txt index following the llmstxt.org spec, plus a concatenated llms-full.txt dump of the user guide and reference docs.

How Does MassGen Compare?#

MassGen sits in a different design space than typical multi-agent frameworks. The core differentiator across the board is parallel attempts with voting and consensus — agents tackle the same task in parallel, observe each other, and converge on a winner — backed by tools, code execution, filesystem integration, and active development.

  • MassGen vs LLM Council — dynamic voting / consensus vs a fixed 3-stage pipeline (responses → ranking → chairman synthesis).

  • MassGen vs CrewAI — parallel refinement on one task vs role-based decomposition into sub-tasks.

  • MassGen vs LangGraph — a pre-built parallel + voting protocol vs a low-level graph runtime you author yourself.

  • MassGen vs AutoGen / AG2 — parallel attempts with collective validation vs conversation-based multi-agent message passing.

Quick Start#

pip install uv        # if needed
uv venv && source .venv/bin/activate
uv pip install massgen
uv run massgen        # Setup wizard, then ask your first question

Rich terminal UI with real-time streaming, multi-turn conversations, and YAML configuration.

Installation · Running MassGen · Configuration

Video Tutorials#

Learn how to install, configure, and run your first multi-agent collaboration with MassGen.

Explore how to build custom agents and tools with MassGen.

Key Features#

🤝 Cross-Model Synergy

Use Claude, Gemini, GPT, Grok together - each agent can use a different model.

⚡ Parallel Coordination

Multiple agents work simultaneously with voting and consensus detection.

🛠️ Tools & MCP

Model Context Protocol for web search, code execution, file operations, and custom tools.

🐍 Python & LiteLLM

Full async Python API and LiteLLM integration for seamless application embedding.

📊 Live Visualization

Real-time terminal display showing agents’ working processes and coordination.

💬 Multi-Turn Sessions

Interactive conversations with context preservation across turns.

🔗 Framework Interoperability

Integrate external frameworks (AG2, LangGraph, AgentScope, OpenAI, SmolAgent) as tools.

📁 Project Integration

Work directly with your codebase using context paths with granular read/write permissions.

Recent Releases#

v0.1.87 (May 15, 2026) - Documentation: Framework Comparisons & llms.txt

Three new “MassGen vs …” comparison pages (CrewAI, LangGraph, AutoGen/AG2), a curated llms.txt index plus a full-corpus llms-full.txt dump for AI agents and crawlers (per llmstxt.org spec), and a one-line refine=False fix for the bootstrap_subagent discriminator.

v0.1.86 (May 13, 2026) - bootstrap_subagent Discriminator + Codex MCP Approval Fix

The critic-driven criteria path is now functional: orchestrator.coordination.criteria_mode: bootstrap_subagent runs an in-process LLM discriminator between rounds, merges proposed criteria into the accumulator, and augments the next round’s checklist automatically. Codex MCP tool calls under codex exec now get the non-interactive approval bypasses needed for external workflow tools.

v0.1.85 (May 11, 2026) - Discriminative Criteria Emergence (criteria_mode)

New orchestrator.coordination.criteria_mode option lets evaluation criteria emerge from observed gaps across rounds instead of being pre-authored. The bootstrap_inline variant is fully functional on all backends with checklist tool support — agents emit proposed_criteria alongside submit_checklist, the accumulator dedupes/caps, and the next round’s checklist is augmented automatically.

Full changelog →

Supported Models#

Claude (Anthropic) · Gemini (Google) · GPT (OpenAI) · Grok (xAI) · Azure OpenAI · Groq · Together · LM Studio · and more…

Documentation#