I am an incoming Ph.D. student in the Computer Science department at Rice University. Prior to this, I obtained bachelor’s degrees in both science and engineering from Tsinghua University.
My research focuses on efficient machine learning systems.
Email: jingwei.zuo [at] rice [dot] edu
Research
My main research question is:
👉 How to compute more with less resources?
💡 How I develop such a research focus?
Nowadays, modern deep neural networks, represented by large language models (LLMs), have an enormous number of parameters and consume significant amounts of energy. Scaling up the model to achieve superior capabilities is important, whereas keeping the cost down is also important. The energy OpenAI’s ChatGPT uses each year to respond to the users’ requests could power 43,204 U.S. homes for the entire year.
[1] It is an outrageous number, which consolidates my belief that we should make every endeavor to cut down the cost of AI models, thereby making the new technology accessible to everybody and making the earth a greener one.
News
01/23/2025: 🎉 DuoAttention accepted by ICLR 2025!
Publications
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
Guangxuan Xiao, Jiaming Tang, Jingwei Zuo, Junxian Guo, Shang Yang, Haotian Tang, Yao Fu, Song Han
ICLR 2025
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors
Weize Chen*, Yusheng Su*, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie Zhou
ICLR 2024
Academic Services
Service as a reviewer for the following conferences:
Experiences

Moonshot AI
Beijing, China2024.12-2025.05
AI Infrastructure Internship

Carnegie Mellon University
Pittsburgh, PA, USA2024.07-10
Remote Research Internship
InfiniAI Lab, advised by Prof. Beidi Chen

Massachusetts Institute of Technology
Cambridge, MA, USA2023.10-2024.05
Research Internship
Han Lab, advised by Prof. Song Han

Tsinghua University
Beijing, China2023.03-09
Research Internship
Natural Language Processing Lab (THUNLP), advised by Prof. Zhiyuan Liu
Educations

Rice University
2025.09-Present
Doctoral Study

Tsinghua University
2021.09-2025.6
B.Eng. in Electrical Engineering
B.S. in Fundamental Sciences (Math & Physics)

Northeastern University
2023.09-12
Exchange student at College of Engineering
Selected on Dean's list
To Learn More About Me
Ideals
I would love to witness a world where humans could obtain more convenience, harmony, and happiness. Undeniably, my current research interest is only one minute factor contributing to this grand (and probably quixote) ideal. But the thing is, I would not like my research to go against this prospect at any time and under any circumstance. I advocate for the open source community.
Other Experiences
I went to Northeastern University for a one-semester exchange program in 2023 Fall and had a gorgeous time there! I love traveling around and have been to Hong Kong, Macao, Japan, Singapore, Australia, the US and of course many places of interest in mainland China.
Fun Facts
When I get nervous, I like to scratch my hair😬. So next time you see me doing that in a debate, you know you've got me there.
Hobbies
- Sports: Tennis.
- Music: Sometimes I play piano. Classical music and pop music are both my favorites.
- Photography: At the crossing between nature and humanity. Portfolio: (Ig) chris.z_pics
- Reading: books about sociology, psychology, and history, also novels.
Feel free to reach out to me by email! We may even have an in-person coffee-chat if we are in the same city! I am always glad to talk to someone else, because other’s talk often inspires me and my words may inspire others too:)
ℹ️ Preferred contact methods: Email / Online Chat > Wechat messaging / Text-messaging > Calling.