Advanced Features

Contents

Advanced Features#

This section covers advanced MassGen capabilities for power users, including multi-agent coordination, multimodal processing, and specialized automation features.

Overview#

Advanced features in MassGen:

Agent diversity - Configure multiple agents with different models and behaviors
Agent communication - Enable agents to ask each other questions
Hook framework - Extend agent behavior with custom hooks for tool execution
Task planning - Structured task breakdown and execution
Subagents - Spawn parallel child processes for independent tasks
Planning mode - Safe execution with human approval
Change documents - Decision journals for traceability and attribution
Multimodal support - Image, audio, and video understanding
Computer use - Browser and desktop automation
Terminal evaluation - Record and evaluate terminal sessions

Guides in This Section#

🎭 Agent Diversity

Configure diverse agent teams

Different models per agent
Varied system prompts
Specialization strategies
Voting and consensus

Read the Diversity guide →

💬 Agent Communication

Enable inter-agent messaging

ask_others tool
Agent collaboration patterns
Information sharing
Coordination strategies

Read the Agent Communication guide →

🪝 Hook Framework

Extend agent behavior

PreToolUse / PostToolUse hooks
Content injection strategies
Reminder extraction
Custom hook development

Read the Hook Framework guide →

📋 Task Planning

Structured task execution

Task breakdown
Planning strategies
Execution tracking
Complex workflows

Read the Task Planning guide →

🔀 Subagents

Parallel child processes

Independent workspaces
Concurrent execution
Context file sharing
Result aggregation

Read the Subagents guide →

✅ Planning Mode

Safe execution with approval

Human-in-the-loop
Plan review
Action confirmation
Rollback support

Read the Planning Mode guide →

Change Documents

Decision journals for traceability

Why each decision was made
Code references per decision
Multi-agent attribution
Feature-level provenance

Read the Change Documents guide →

🖼️ Multimodal

Image, audio, video support

Image understanding
Audio transcription
Video analysis
Multi-format input

Read the Multimodal guide →

🖥️ Computer Use

Browser and desktop automation

Gemini Computer Use
Claude Computer Use
Browser automation
Visual feedback

Read the Computer Use guide →

📺 Terminal Evaluation

Record and evaluate sessions

VHS recording
Session playback
Evaluation metrics
Demonstration creation

Read the Terminal Evaluation guide →