Deng Xiaotie
Peking University, China
On the complexity of computing Markov perfect equilibrium in general-sum stochastic games
Similar to the role of Markov decision processes in reinforcement learning, Markov games (also called stochastic games) lay down the foundation for the study of multi-agent reinforcement learning and sequential agent interactions. We introduce approximate Markov perfect equilibrium as a solution to the computational problem of finite-state stochastic games repeated in the infinite horizon and prove its PPAD-completeness. This solution concept preserves the Markov perfect property and opens up the possibility for the success of multi-agent reinforcement learning algorithms on static two-player games to be extended to multi-agent dynamic games, expanding the reign of the PPAD-complete class.
Speaker BIO
Xiaotie Deng, Chair Professor of the Center on Frontiers of Computing Studies (CFCS) at Peking University, Director of Blockchain Committee at China Society for Industrial and Applied Mathematics (CSIAM), and Director of the Center for Multi-agent Research at Institute for Artificial Intelligence, Peking University. Before joining Peking University, he worked at Shanghai Jiaotong University, University of Liverpool, City University of Hong Kong, and York University. He received his Bachelor's degree from Tsinghua University in 1982, Master's degree from Chinese Academy of Sciences in 1984, and Ph.D. from Stanford University in 1989.
Xiaotie's main research interests are blockchain, internet economy, online algorithms and parallel computing. His recent research focuses on equilibrium calculation, multi-agent game, internet economy, and blockchain.
Xiaotie was honored with the FOCS Best Paper Award in 2006. He is a fellow of the Association of Computing Machinery (ACM) for his contributions to the interface of algorithms and game theory (2008), and the Institute of Electrical and Electronics Engineers (IEEE) for his contributions to computing in partial information and interactive environments (2019). He was elected as foreign member of Academia Europaea in 2020, CSIAM fellow in 2021. In 2022, he won ACM SIGecom Test of Time Award.
Xiaotie has undertaken dozens of research projects as the principal investigator and has served on the editorial board of top international journals. He has chaired many top international conferences, in particular, he initiated the Conference on Web and Internet Economics (WINE) organized by the three continents in turn: Asia, Europe and the United States, and the International Joint Conference on Theoretical Computer Science (IJTCS).