Dapeng Li ( 李大鹏 )
Ph.D Candidate

Institute of Automation, Chinese Academy of Sciences
School of Artificial Intelligence, University of Chinese Academy of Sciences

Location: 95 Zhongguancun East Road, BEIJING, CHINA
Interest | Education | Awards | Publications |

Email: lidapeng2020@ia.ac.cn
[ Wechat ] [公众号:Dapeng的记事本]

Fields of Interest

My research interests include reinforcement learning,multi-agent systems and data mining & analysis. Currently, I focus on the following research topics: Other Interest:

Education


Selected competitions and awards

  • The 1st (1/1122), 2021, DataFountain Green Furture Competition, Wind Power Abnormal Data Recognition Track
  • The 1st (1/620), 2021, DataFountain Green Furture Competition, Photovoltaic Abnormal Data Recognition Track
  • The 3rd (3/172), 2021, Global Open Data Application Innovation Competition, Wind Field Downscaling track
  • The 2nd (2/423), 2021, Global Open Data Application Innovation Competition, Road Detection track
  • The Grand Prize (1/158), 2021, Golden Wind Cup, Tsinghua
  • The 3rd (3/1511), 2021, DCIC Digital China Innovation Competition
  • The 3rd (3/739), 2021, iFLYTEK A.I. Advertising Picture Material Classification Algorithm Challenge
  • The 3rd, 2021, NeurIPS workshop MineRL intro
  • The 1st (1/4337), 2021, Tianchi Global AI Innovation Contest
  • The Second Prize (National), 2021, National Post-Graduate Mathematical Contest in Modeling
  • The 1st (1/2800), 2019, China Datathon
  • The First Prize, 2019, The "Challenge Cup" capital college students competition
  • The 1st (1/475), 2019, National University Student Transportation Science and Technology Competition
  • Silver medal, 2019, Microsoft Malware Prediction, Kaggle
  • The Second Prize (Global), 2019, International Competition of Autonomous Running Intelligent Robots
  • The 1st Prize, 2018, China Robot Competition, by Chinese Association of Automation (National)


  • Some awards in my graduate and undergraduate school:
  • 2021 Outstanding Mentors, CCF
  • 2021 Excellent Student, University of Chinese Academy of Sciences
  • 2020 Outstanding Graduates of Beijing
  • 2020 Beijing University of Technology Top Ten Graduates
  • 2020 Beijing University of Technology President Scholarship (Top ten student in entire school)
  • 2019 Technology Innovation and Practice Scholarship, Toyo Information Systems

Selected Publications

    [1] From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL
    International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2024. (Extended Abstract)
    Dapeng Li, Zhiwei Xu, Bin Zhang, and Guoliang Fan
    [Arxiv]

    [2] Adaptive Parameter Sharing for Multi-Agent Reinforcement Learning
    IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), in Seoul, Korea, 2024.
    Dapeng Li, Na Lou, Bin Zhang, Zhiwei Xu, and Guoliang Fan
    [Arxiv]

    [3] SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning
    International Joint Conference on Neural Networks(IJCNN),in Queensland, Australia, 2023.
    Dapeng Li, Zhiwei Xu, Bin Zhang, and Guoliang Fan,

    [4] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Thirty-Seventh AAAI Conference on Artificial Intelligence(AAAI),in Washington, DC, USA, 2023. (Oral)
    Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, and Guoliang Fan,
    [Arxiv][Code]

    [5] HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism
    Thirty-Seventh AAAI Conference on Artificial Intelligence(AAAI),in Washington, DC, USA, 2023. (Oral)
    Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, and Guoliang Fan,
    [Arxiv][Code]

    [6] Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
    Thirty-sixth Conference on Neural Information Processing Systems(NeurIPS), in New Orleans, USA, 2022.
    Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, and Guoliang Fan,
    (Spotlight) [Arxiv]

    [7] MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning
    International Joint Conference on Neural Networks(IJCNN), in Shenzhen, China, 2021.
    Zhiwei Xu, Dapeng Li , Yunpeng Bai, and Guoliang Fan*,
    (Poster) [Arxiv]

    [8] SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning
    International Conference on Autonomous Agents and Multi-Agent Systems(AAMAS), in Auckland, New Zealand, 2022.
    Zhiwei Xu, Yunpeng Bai,Dapeng Li, Bin Zhang, and Guoliang Fan*, (Oral) [Arxiv][Code]


Pre-prints:

    [1] Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning
    Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, and Guoliang Fan.
    [Arxiv]