MAST: A Failure Taxonomy for Multi-Agent Systems

AI
LLMs
agentic systems
evaluation
paper
links
First empirically grounded taxonomy of why MAS fail — 14 modes across specification, alignment, and verification — and small interventions that move the needle 9–16%.
Author

synesis

Published

August 2, 2025

Multi-agent systems (MAS) promise enhanced capabilities through collaboration, but often fail to vastly outperform single-agent setups. What are their failure modes and how do we mitigate them?

A recent paper by Mert Cemri, Melissa Pan, Shuyi Yang, Lakshya A Agrawal, Bhavya Chopra, Rishabh Tiwari, Kurt Keutzer, Aditya Parameswaran, Dan Klein, Kannan Ramchandran, Matei Zaharia, Joseph Gonzalez, and Ion Stoica from University of California, Berkeley introduces MAST (Multi-Agent System Failure Taxonomy), the first empirically grounded framework for diagnosing MAS failures [1]:

MAST moves MAS development closer to science and engineering rather than guesswork.


References

[1] Cemri, Mert, et al. “Why Do Multi-Agent LLM Systems Fail?” arXiv, 2025. https://arxiv.org/abs/2503.13657

Originally posted on LinkedIn.