|
Research
My recent research focuses on LLMs/MLLMs reasoning, agents, and efficiency.
|
[Preprint 2026]
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning
Dachuan Shi, Hanlin Zhu, Xiangchi Yuan, Wanjia Zhao, Kejing Xia, Wen Xiao, Wenke Lee
[Paper]
[Code]
[Website]
|
[ICLR 2026]
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs
Dachuan Shi, Abedelkadir Asi, Keying Li, Xiangchi Yuan, Leyan Pan, Wenke Lee, Wen Xiao
[Paper]
[Code]
[Website]
|
[ICML 2025]
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
Dachuan Shi, Yonggan Fu, Xiangchi Yuan, Zhongzhi Yu, Haoran You, Sixu Li, Xin Dong,
Jan Kautz, Pavlo Molchanov, Yingyan Celine Lin
[Paper]
[Code]
|
[ICML 2024]
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
Dachuan Shi, Chaofan Tao, Anyi Rao, Zhendong Yang, Chun Yuan, Jiaqi Wang
[Paper]
[Code]
|
[ICML 2023]
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
Dachuan Shi, Chaofan Tao, Ying Jin, Zhendong Yang, Chun Yuan, Jiaqi Wang
[Paper]
[Code]
[Website]
|
Experience
2017โ21
Tsinghua Outstanding Bachelor's Thesis
2021โ24
Tsinghua Outstanding Master's Thesis
Shanghai AI Lab
Research Intern
Microsoft
Research Intern
Meta Superintelligence Labs
Research Scientist Intern
2022โ24
Multimodal LLMs, Inference Optimization
2025
Reasoning LLMs, Math & Coding
2026
Agentic Post-Training
|
|