My name is Zhancun Mu, or 牟湛存 in Chinese. You can contact me at muzhancun@stu.pku.edu.cn. I am currently a senior student in Tong Class, Peking University. I am an incoming Ph.D. candidate at the Institute for Artificial Intelligence, Peking University starting from 2025, under the supervision of Professor Yitao Liang.
My research is focused on developing autonomous agents capable of operating in open-ended, dynamic environments such as Minecraft. To achieve this goal, I concentrate on two key elements: First, designing robust planning systems that enable agents to make complex decisions over long time horizons. This involves developing algorithms for reasoning about uncertain outcomes, decomposing high-level goals into executable subgoals, and continuously revising plans as new information becomes available. Second, creating intuitive controllers that allow agents to seamlessly interact with their environments. This requires integrating perception, decision-making, and low-level control in a unified framework that can handle the rich, multimodal inputs and outputs characteristic of virtual worlds. In addition to these core areas, I am keenly interested in multi-agent systems, exploring how autonomous agents can coordinate their behaviors to accomplish shared objectives. I am also drawn to cognitive reasoning and developing computational models that capture aspects of human-like intelligence. Furthermore, I am excited by the potential of AI to accelerate scientific discovery across domains. My ultimate vision is to develop intelligent agents that can autonomously explore, understand, and shape the open-ended environments they inhabit in pursuit of complex, self-motivated goals.
🔥 News
- Mar 2025 🐣🐣 We have released a state-of-the-art Minecraft Agent, “ROCKET-2”, supporting cross-view goal specification!
- Feb 2025: 🔥🔥 Our paper ‘‘ROCKET-1: Mastering Open-Wolrd Interaction with Visual-Temporal Context Prompting’’ is accepted by CVPR 2025.
- Dec 2024: 🎉🎉 Our open-world agent developing tool MineStudio is released on GitHub and PyPI.
- Sep 2024: 🧠🧠 Our latest Vision-Language-Action model OmniJARVIS is accepted by NeurIPS 2024.
- Jan 2024: 🍫🍫 PTGM is accepted by ICLR 2024 for oral presentation (top 1.2%).
📚 Preprints



📝 Publications



A Contextual Combinatorial Bandit Approach to Negotiation
Yexin Li, Zhancun Mu, Siyuan Qi
ICML 2024 | Paper

📖 Educations

2021.09 - now, Undergraduate
Tong class, Yuanpei College
Peking University, Beijing
💻 Internships
- 2023.07 - 2023.10, BAAI, China.