Advanced Features#

This section covers advanced MassGen capabilities for power users, including multi-agent coordination, multimodal processing, and specialized automation features.

Overview#

Advanced features in MassGen:

  • Agent diversity - Configure multiple agents with different models and behaviors

  • Agent communication - Enable agents to ask each other questions

  • Hook framework - Extend agent behavior with custom hooks for tool execution

  • Task planning - Structured task breakdown and execution

  • Subagents - Spawn parallel child processes for independent tasks

  • Planning mode - Safe execution with human approval

  • Change documents - Decision journals for traceability and attribution

  • Multimodal support - Image, audio, and video understanding

  • Computer use - Browser and desktop automation

  • Terminal evaluation - Record and evaluate terminal sessions

Guides in This Section#

🎭 Agent Diversity

Configure diverse agent teams

  • Different models per agent

  • Varied system prompts

  • Specialization strategies

  • Voting and consensus

Read the Diversity guide →

💬 Agent Communication

Enable inter-agent messaging

  • ask_others tool

  • Agent collaboration patterns

  • Information sharing

  • Coordination strategies

Read the Agent Communication guide →

🪝 Hook Framework

Extend agent behavior

  • PreToolUse / PostToolUse hooks

  • Content injection strategies

  • Reminder extraction

  • Custom hook development

Read the Hook Framework guide →

📋 Task Planning

Structured task execution

  • Task breakdown

  • Planning strategies

  • Execution tracking

  • Complex workflows

Read the Task Planning guide →

🔀 Subagents

Parallel child processes

  • Independent workspaces

  • Concurrent execution

  • Context file sharing

  • Result aggregation

Read the Subagents guide →

✅ Planning Mode

Safe execution with approval

  • Human-in-the-loop

  • Plan review

  • Action confirmation

  • Rollback support

Read the Planning Mode guide →

Change Documents

Decision journals for traceability

  • Why each decision was made

  • Code references per decision

  • Multi-agent attribution

  • Feature-level provenance

Read the Change Documents guide →

🖼️ Multimodal

Image, audio, video support

  • Image understanding

  • Audio transcription

  • Video analysis

  • Multi-format input

Read the Multimodal guide →

🖥️ Computer Use

Browser and desktop automation

  • Gemini Computer Use

  • Claude Computer Use

  • Browser automation

  • Visual feedback

Read the Computer Use guide →

📺 Terminal Evaluation

Record and evaluate sessions

  • VHS recording

  • Session playback

  • Evaluation metrics

  • Demonstration creation

Read the Terminal Evaluation guide →