US Army Researchers Develop A New Framework For Collaborative Multi-Agent Reinforcement Learning Systems

June 22, 2021

Centralized learning for multi-agent systems highly depends on information-sharing mechanisms. However, there have not been significant studies within the research community in this domain.

Army researchers collaborate to propose a framework that provides a baseline for the development of collaborative multi-agent systems. The team involved Dr. Piyush K. Sharma, Drs. Erin Zaroukian, Rolando Fernandez, Derrik Asherat, Michael Dorothy from DEVCOM, Army Research Laboratory, and Anjon Basak, a postdoctoral fellow from the Oak Ridge Associated Universities fellowship program.

The team’s survey in reinforcement learning (RL) algorithms and their information sharing paradigms serves as a basis to question centralized learning for multi-agent systems that would improve their ability to work together.

Studies show that training various agents together is quite challenging. This is because the dynamic nature of complex environments suffers from dimensionality. So, increasing the number of agents while training can complicate the coordination. Moreover, information-sharing parameters are confusing and difficult to understand.

This study surpasses previous research by providing a consolidated view of the latest SOTA in RL algorithms and establishing a novel approach to define information shared during centralized learning.

Their paper, “Survey of recent multi-agent reinforcement learning algorithms utilizing centralized training,” introduces a model that can efficiently characterize the essential information-sharing parameters. The researchers suggest that centralization in training can provide us with a suitable solution with developing autonomous systems. They explain that consistent, centralized training can result in multi-agent systems that work more reliably together, increasing trust levels from the soldier of the AI.

The team investigated recent centralized learning algorithms and focused on identifying and characterizing the underlying mathematical framework. They believe these mathematical frameworks can help explore alternate centralized learning techniques to gauge their effect on learning and emergent collaborative behaviors. They surveyed the algorithms published in the last five to six years and stated that they have not yet been explored extensively as these algorithms are pretty recent. This was the major reason for exploring them.

Instead of focusing on how things are shared, they defined and categorized the mechanisms for sharing, orienting on what is being shared. They assert that they have identified gaps in the recent RL techniques that can improve the process of training agents. This work will help in training autonomous multi-agent systems. They also aim to investigate particular aspects of multi-agent RL methods that train agents in a centralized fashion.

Sharma states that centralized techniques have many limitations. Therefore, the plan to conduct an empirical analysis of existing decentralized learning techniques. They will model and simulate multi-agent RL training to validate and extend theories of agent learning, behavior, and coordination.

The team believes that their survey will help researchers develop RL techniques for collaborative multi-agent systems, including units of robots that could work along with soldiers in the future.

Source: https://www.army.mil/article/247261/army_researchers_develop_innovative_framework_for_training_ai

Paper: https://www.spiedigitallibrary.org/conference-proceedings-of-spie/11746/2585808/Survey-of-recent-multi-agent-reinforcement-learning-algorithms-utilizing-centralized/10.1117/12.2585808.short?SSO=1&tab=ArticleLinkCited

Shilpi Anand

Website | + posts

Shilpi is a Contributor to Marktechpost.com. She is currently pursuing her third year of B.Tech in computer science and engineering from IIT Bhubaneswar. She has a keen interest in exploring latest technologies. She likes to write about different domains and learn about their real life applications.

US Army Researchers Develop A New Framework For Collaborative Multi-Agent Reinforcement Learning Systems

Shilpi Anand

LLMs Can Now Talk in Real-Time with Minimal Latency: Chinese Researchers Release LLaMA-Omni2, a...

Implementing an AgentQL Model Context Protocol (MCP) Server

Google Releases 76-Page Whitepaper on AI Agents: A Deep Technical Dive into Agentic RAG,...

NVIDIA Open Sources Parakeet TDT 0.6B: Achieving a New Standard for Automatic Speech Recognition...

OpenAI Releases a Strategic Guide for Enterprise AI Adoption: Practical Lessons from the Field

A Coding Guide to Compare Three Stability AI Diffusion Models (v1.5, v2-Base & SD3-Medium)...

How AI Agents Store, Forget, and Retrieve? A Fresh Look at Memory Operations for...

8 Comprehensive Open-Source and Hosted Solutions to Seamlessly Convert Any API into AI-Ready MCP...

RWKV-X Combines Sparse Attention and Recurrent Memory to Enable Efficient 1M-Token Decoding with Linear...

How the Model Context Protocol (MCP) Standardizes, Simplifies, and Future-Proofs AI Agent Tool Calling...