About Me
I’m a Senior Researcher at Microsoft Research. I received my Bachelor’s degree and Ph.D. from Tsinghua University in 2016 and 2021, respectively. My recent research focuses on computer use agents, tool calling, and agentic AI.
📢 I’m looking for a self-motivated research intern who is passionate about AI agents (GUI/embodied agents, tool calling, and context management). If you are interested in these topics, feel free to email me your resume and a brief self-intro!
News 🌟
Jan. 2026 Welcome to join our 5th Workshop on Computer Vision in the Wild (CVinW) at @CVPR 2026!
Jan. 2026 We release SynthAgent, a task and trajectory synthetic framework for web agents!
Jan. 2026 Dyna-Mind is accepted by ICLR 2026!
Dec. 2025 We release Argos, a principled reward agent to train LMRMs for agentic tasks.
Nov. 2025 We release GUI-360, a comprehensive dataset and benchmark for CUA!
Jun. 2025 Excited to release GUI-Actor, a coordinate-free visual grounding method for GUI agents!
Show more news
Apr. 2025 You’re welcome to join our 4th Workshop on Computer Vision in the Wild (CVinW) at @CVPR 2025!
Apr. 2025 We release MMInference, accelerating pre-filling for long-context VLMs!
Feb. 2025 We release Magma, a foundation model for multimodal AI agents!
Jan. 2025 SeCom and SCBench are accepted at ICLR 2025!
Sep. 2024 MInference is accepted at NeurIPS 2024 as a spotlight!
Sep. 2024 I’m serving as an Area Chair for COLING 25!
Jun. 2024 MInference and LLM Position Bias paper are accepted to ES-FoMo II @ ICML24 and LCFM @ ICML24, respectively.
May. 2024 LLMLingua Series has been integrated as a custom tool in Prompt Flow, AutoGen, LangChain and LlamaIndex.
May. 2024 LongLLMLingua and LLMLingua-2 are accepted to ACL-2024 in main track and findings!
Mar. 2024 We release LLMLingua-2, an efficient option for task-agnostic prompt compression with good performance and generalizability across different scenarios, boasting a 3x-6x speed improvement over LLMLingua!
Oct. 2023 We release LongLLMLingua, aiming to accelerate and enhance LLM inference in long-context scenarios via question-aware prompt compression and content reorganization!
Oct. 2023 We release LLMLingua, a coarse-to-fine prompt compression method based on perplexity from a small language model such as LLaMA-7B!
Recommended Repos 🧰
- GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
- Magma: A Foundation Model for Multimodal AI Agents
- MInference: Million-Tokens Prompt Inference for Long-context LLMs
- LLMLingua Series for Prompt Compression
- Versatile Entity Recognition & disambiguation Toolkit
Selected Publications 📚
Agentic AI
- Preprint 2025 Multimodal Reinforcement Learning with Agentic Verifier for AI Agents
Reuben Tan, Baolin Peng, Zhengyuan Yang, Hao Cheng, Oier Mees, Theodore Zhao, Andrea Tupini, Isar Meijier, Qianhui Wu, Yuncong Yang, Lars Liden, Yu Gu, Sheng Zhang, Xiaodong Liu, Lijuan Wang, Marc Pollefeys, Yong Jae Lee, Jianfeng Gao - Preprint 2025 Adapting Web Agents with Synthetic Supervision
Zhaoyang Wang, Yiming Liang, Xuchao Zhang, Qianhui Wu, Siwei Han, Anson Bastos, Rujia Wang, Chetan Bansal, Baolin Peng, Jianfeng Gao, Saravan Rajmohan, Huaxiu Yao - Preprint 2025 GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents
Jian Mu, Chaoyun Zhang, Chiming Ni, Lu Wang, Bo Qiao, Kartik Mathur, Qianhui Wu, Yuhang Xie, Xiaojun Ma, Mengyu Zhou, Si Qin, Liqun Li, Yu Kang, Minghua Ma, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang - ICLR-2026 Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Xiao Yu, Baolin Peng, Michel Galley, Hao Cheng, Qianhui Wu, Janardhan Kulkarni, Suman Nath, Zhou Yu, Jianfeng Gao - NeurIPS-2025 GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
Qianhui Wu*, Kanzhi Cheng*, Rui Yang*, Chaoyun Zhang, Jianwei Yang, Huiqiang Jiang, Jian Mu, Baolin Peng, Bo Qiao, Reuben Tan, Si Qin, Lars Liden, Qingwei Lin, Huan Zhang, Tong Zhang, Jianbing Zhang, Dongmei Zhang, Jianfeng Gao. - CVPR-2025 Magma: A Foundation Model for Multimodal AI Agents
Jianwei Yang*, Reuben Tan*, Qianhui Wu*, Ruijie Zheng, Baolin Peng, Yongyuan Liang, Yu Gu, Mu Cai, Seonghyeon Ye, Joel Jang, Yuquan Deng, Jianfeng Gao. - ICLR-2025 SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents
Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Xufang Luo, Hao Cheng, Dongsheng Li, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Jianfeng Gao.
Efficient LLM Inference
- ICML-2025 MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention
Yucheng Li, Huiqiang Jiang, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Amir H. Abdi, Dongsheng Li, Jianfeng Gao, Yuqing Yang, Lili Qiu. - ICLR-2025 SharedContextBench: How Lossy are Long-context Methods in KV Cache Reuse
Yucheng Li, Huiqiang Jiang, Qianhui Wu, Xufang Luo, Surin Ahn, Chengruidong Zhang, Amir H. Abdi, Dongsheng Li, Jianfeng Gao, Yuqing Yang, Lili Qiu. - NeurIPS-2024 MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
Huiqiang Jiang*, Yucheng Li*, Chengruidong Zhang*, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu. - ACL-2024 Findings LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Menglin Xia, Xufang Luo, Jue Zhang, Qingwei Lin, Victor Rühle, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Dongmei Zhang. - ACL-2024 LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression
Huiqiang Jiang, Qianhui Wu, Xufang Luo, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu. - EMNLP-2023 LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
Huiqiang Jiang, Qianhui Wu, Chin-Yew Lin, Yuqing Yang, Lili Qiu. - ACL-2023 Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text
Qianhui Wu, Huiqiang Jiang, Haonan Yin, Börje Karlsson, Chin-Yew Lin.
Information Extraction & Low-Resource NLP
- ACL-2023 CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition
Tingting Ma, Qianhui Wu, Huiqiang Jiang, Börje Karlsson, Tiejun Zhao, Chin-Yew Lin. - ACL-2022 Findings Decomposed Meta-Learning for Few-Shot Named Entity Recognition
Tingting Ma*, Huiqiang Jiang*, Qianhui Wu*, Tiejun Zhao, Chin-Yew Lin. - NAACL-2022 On the Effectiveness of Sentence Encoding for Intent Detection Meta-Learning
Tingting Ma, Qianhui Wu, Zhiwei Yu, Tiejun Zhao, Chin-Yew Lin. - IJCAI-2020 UniTrans: Unifying Model Transfer and Data Transfer for Cross-Lingual Named Entity Recognition with Unlabeled Data
Qianhui Wu, Zijia Lin, Börje Karlsson, Biqing Huang, Jian-Guang Lou. - ACL-2020 Single-/Multi-Source Cross-Lingual NER via Teacher-Student Learning on Unlabeled Data in Target Language
Qianhui Wu, Zijia Lin, Börje Karlsson, Jian-Guang Lou, Biqing Huang. - AAAI-2020 Enhanced Meta-Learning for Cross-Lingual Named Entity Recognition with Minimal Resources
Qianhui Wu, Zijia Lin, Guoxin Wang, Hui Chen, Börje Karlsson, Biqing Huang, Chin-Yew Lin. - NTCIR-2019 DeepMRT at the NTCIR-14 FinNum Task: A Hybrid Neural Model for Numeral Type Classification in Financial Tweets
Qianhui Wu*, Guoxin Wang*, Yuyin Zhu, Haoyan Liu, Börje Karlsson.
Honors & Awards 🏆
2024 Microsoft 2024 Global Hackathon Executive Challenge Winner
2024 Microsoft Machine Learning, AI & Data Science Conference (MLADS) Distinguished Contribution
2023 Microsoft 2023 Global Hackathon Award Winner
2020 Outstanding Intern of “Stars of Tomorrow” Program, Microsoft Research Asia
2020 Intel Scholarship
2018 Outstanding (12-9) Counselor Prize, Tsinghua University
2016 Outstanding Bachelor Thesis, Tsinghua University
2014 National Encouragement Scholarship
2014 Scholarship of Art Excellence, Tsinghua University
2013 Scholarship of Academic Excellence, Tsinghua University
Other Information 📝
▶ Invited Talks
Sep. 2025 Towards AI Agents That Can See And Act @ Shanghai Artificial Intelligence Laboratory
Jun. 2025 Act Where You See: Coordinate-Free Visual Grounding for GUI Agents @ Simular Seminar
▶ Academic Service
- Area Chair: COLING-25.
- Conference Reviewer: ICLR, NeurIPS, CVPR, ACL, EMNLP, NAACL, AAAI, NLPCC.
- Journal Reviewer: CSUR, TOIS, Pattern Recognition, TASLP, IPM, JIM, ESIN, SIVP.
▶ Other Activities
Sep. 2016 - Aug. 2018 Counselor at Center for Student Learning and Development, Tsinghua University
Aug. 2012 - Jun. 2021 Member of Chinese National Orchestra, Tsinghua University