MassGen: Multi-Agent Scaling System for GenAI#
What is MassGen?#
MassGen is a cutting-edge multi-agent system that leverages the power of collaborative AI to solve complex tasks. It assigns a task to multiple AI agents who work in parallel, observe each other’s progress, and refine their approaches to converge on the best solution to deliver a comprehensive and high-quality result.
How It Works:
Work in Parallel - Multiple agents tackle the problem simultaneously, each bringing unique capabilities
See Recent Answers - At each step, agents view the most recent answers from other agents
Decide Next Action - Each agent chooses to provide a new answer or vote for an existing answer
Share Workspaces - When agents provide answers, their workspace is captured so others can review their work
Natural Consensus - Coordination continues until all agents vote, then the agent with most votes presents the final answer
Think of it as a “parallel study group” for AI - inspired by advanced systems like xAI’s Grok Heavy and Google DeepMind’s Gemini Deep Think. Agents learn from each other to produce better results than any single agent could achieve alone.
How Does MassGen Compare?#
MassGen vs LLM Council: While LLM Council follows a fixed 3-stage pipeline, MassGen agents autonomously decide to contribute new answers or vote for others, reaching consensus organically. Plus, MassGen agents can use tools, execute code, and read/write files in your codebase — backed by active development with regular releases. See full comparison →
Quick Start#
pip install uv # if needed
uv venv && source .venv/bin/activate
uv pip install massgen
uv run massgen # Setup wizard, then ask your first question
Rich terminal UI with real-time streaming, multi-turn conversations, and YAML configuration.
pip install uv # if needed
uv venv && source .venv/bin/activate
uv pip install massgen
uv run massgen --web # Open http://localhost:8000
Browser-based UI with real-time agent streaming, vote visualization, and workspace browsing.
from dotenv import load_dotenv
load_dotenv() # Load OPENROUTER_API_KEY from .env
import litellm
from massgen import register_with_litellm
register_with_litellm()
response = litellm.completion(
model="massgen/build",
messages=[{"role": "user", "content": "Your question"}],
optional_params={"models": ["openrouter/openai/gpt-5", "openrouter/anthropic/claude-sonnet-4.5"]}
)
print(response.choices[0].message.content)
Standard OpenAI-compatible interface for seamless integration with existing applications.
Key Features#
Use Claude, Gemini, GPT, Grok together - each agent can use a different model.
Multiple agents work simultaneously with voting and consensus detection.
Model Context Protocol for web search, code execution, file operations, and custom tools.
Full async Python API and LiteLLM integration for seamless application embedding.
Real-time terminal display showing agents’ working processes and coordination.
Interactive conversations with context preservation across turns.
Integrate external frameworks (AG2, LangGraph, AgentScope, OpenAI, SmolAgent) as tools.
Work directly with your codebase using context paths with granular read/write permissions.
Recent Releases#
v0.1.25 (December 15, 2025) - UI-TARS & Evolving Skills
UI-TARS backend for GUI automation with visual understanding. GPT-5.2 model support via OpenRouter. Evolving skill creator system for reusable workflows. Textual terminal display with adaptive dark/light themes.
v0.1.24 (December 12, 2025) - Enhanced Cost Tracking
Real-time token counting and cost calculation for OpenRouter, xAI/Grok, Gemini, and Claude Code backends. New /inspect c command displays detailed per-agent cost breakdown with input, output, reasoning, and cached tokens.
v0.1.23 (December 10, 2025) - Async Consistency & Web UI Automation Mode
Turn history inspection with /inspect commands, Web UI automation mode for programmatic workflows, Docker container persistence across turns, and improved Ctrl+C handling.
Supported Models#
Claude (Anthropic) · Gemini (Google) · GPT (OpenAI) · Grok (xAI) · Azure OpenAI · Groq · Together · LM Studio · and more…