Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Abstract

Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms, however, have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This article addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multiagent deep RL (MADRL) is presented, including nonstationarity, partial observability, continuous state and action spaces, multiagent training schemes, and multiagent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to the future development of more robust and highly useful multiagent learning methods for solving real-world problems.

Keywords

Reinforcement learningComputer scienceObservabilityArtificial intelligenceDeep learningTransfer of learningAction (physics)State (computer science)Machine learningAlgorithmMathematics

Affiliated Institutions

Deakin University AU

Related Publications

Deep Reinforcement Learning That Matters

Peter Henderson , Riashat Islam , Philip Bachman +3 more

In recent years, significant progress has been made in solving challenging problems across various domains using deep reinforcement learning (RL). Reproducing existing work and ...

2018 Proceedings of the AAAI Conference on... 1397 citations

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

Jakob Foerster , Nantas Nardelli , Gregory Farquhar +4 more

Many real-world problems, such as network packet routing and urban traffic control, are naturally modeled as multi-agent reinforcement learning (RL) problems. However, existing ...

2017 arXiv (Cornell University) 333 citations

An Introduction to Deep Reinforcement Learning

Vincent François-Lavet , Peter Henderson , Riashat Islam +2 more

Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. This field of research has been able to solve a wide range of complex decision-m...

2018 Foundations and Trends® in Machine Le... 1163 citations

Counterfactual Multi-Agent Policy Gradients

Jakob Foerster , Gregory Farquhar , Triantafyllos Afouras +2 more

Many real-world problems, such as network packet routing and the coordination of autonomous vehicles, are naturally modelled as cooperative multi-agent systems. There is a great...

2018 Proceedings of the AAAI Conference on... 1491 citations

Improving Elevator Performance Using Reinforcement Learning

Robert H. Crites , Andrew G. Barto

This paper describes the application of reinforcement learning (RL) to the difficult real world problem of elevator dispatching. The elevator domain poses a combination of chall...

1995 493 citations

Publication Info

Year: 2020
Type: review
Volume: 50
Issue: 9
Pages: 3826-3839
Citations: 1090
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1090

OpenAlex

Cite This

APA Style

                            
                                    Thanh Thi Nguyen, 
                                
                                    Ngoc Duy Nguyen, 
                                
                                    Saeid Nahavandi
                                
                            (2020). 
                            Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications. 
                            IEEE Transactions on Cybernetics
                            , 50
                            (9)
                            , 3826-3839.
                            https://doi.org/10.1109/tcyb.2020.2977374

Identifiers

DOI: 10.1109/tcyb.2020.2977374