I am Heming Xia, a Ph.D. student at the NLP Group of The Hong Kong Polytechnic University, supervised by Prof. Wenjie Li. I obtained my master’s degree from the MOE Key Lab of Computational Linguistics at Peking University, advised by Prof. Zhifang Sui. Before that, I received my bachelor’s degree from the School of Physics at Peking University in 2020. I have also spent time at the NLC Group @ Microsoft Research Asia as a Research Intern, where I was fortunate to work with Dr. Tao Ge. Please check my CV for further information.

Research

I am broadly interested in natural language processing and machine learning. My current research focuses on 1) efficient and effective NLP 2) tool learning, and 3) cross vision and language understanding.

News

[2024.10] Released SWIFT: on-the-fly self-speculative decoding for LLM inference acceleration🔥.
[2024.09] Got four papers accepted by EMNLP 2024, congrats to all co-authors🎉!
[2024.05] Got two papers accepted by ACL 2024.
[2024.01] Released Spec-Bench: a comprehensive benchmark for Speculative Decoding.
[2024.01] Released our new survey 📖 on Speculative Decoding.
[2024.01] Started my Ph.D. study at the NLP Group @ PolyU, supervised by Prof. Wenjie Li.
[2023.10] Got three papers accepted by EMNLP 2023.
[2023.05] Got one short paper accepted by ACL 2023.
[2022.05] Got one paper accepted by ACL 2022.
[2021.10] Started my research internship at NLC Group @ Microsoft Research Asia, advised by Dr. Tao Ge.
[2020.09] Started my Master study at the MOE Key Laboratory of Computational Linguistics, Peking University, advised by Prof. Zhifang Sui.

Publications

Most recent publications on Google Scholar.
* indicates equal contribution

SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
Heming Xia, Yongqi Li, Jun Zhang, Cunxiao Du, Wenjie Li
Arxiv Preprint. [link] [code]

AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
Hongru Wang, Rui Wang, Boyang Xue, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong
EMNLP 2024. [link] [code]

A Survey on In-context Learning
Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui
EMNLP 2024. [link] [code] [机器之心]

Enhancing Tool Retrieval with Iterative Feedback from Large Language Models
Qiancheng Xu, Yongqi Li, Heming Xia, Wenjie Li
Findings of EMNLP 2024. [link]

Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens
Weiyao Luo, Suncong Zheng, Heming Xia, Weikang Wang, Yan Lei, Tianyu Liu, Shuang Chen, Zhifang Sui
Findings of EMNLP 2024. [link]

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding
Heming Xia, Zhe Yang, Qingxiu Dong, Peiyi Wang, Yongqi Li, Tao Ge, Tianyu Liu, Wenjie Li, Zhifang Sui
Findings of ACL 2024. [link] [code] [机器之心]

Can Large Multimodal Models Uncover Deep Semantics Behind Images?
Yixin Yang, Zheng Li, Qingxiu Dong, Heming Xia, Zhifang Sui
Findings of ACL 2024. [link] [code]

ImageNetVC: Zero- and Few-Shot Visual Commonsense Evaluation on 1000 ImageNet Categories
Heming Xia*, Qingxiu Dong*, Lei Li, Jingjing Xu, Tianyu Liu, Ziwei Qin, Zhifang Sui
Findings of EMNLP 2023. [link] [code]

Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization
Shoujie Tong*, Heming Xia*, Damai Dai, Runxin Xu, Tianyu Liu, Binghuai Lin, Yunbo Cao, and Zhifang Sui
Findings of EMNLP 2023. [link]

Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation
Heming Xia*, Tao Ge*, Peiyi Wang, Si-Qing Chen, Furu wei, and Zhifang Sui
Findings of EMNLP 2023. [link] [code]

Enhancing Continual Relation Extraction via Classifier Decomposition
Heming Xia, Peiyi Wang, Tianyu Liu, Binghuai Lin, Yunbo Cao, and Zhifang Sui
Findings of ACL 2023. [link] [code]

Premise-based Multimodal Reasoning: Conditional Inference on Joint Textual and Visual Clues
Qingxiu Dong*, Ziwei Qin*, Heming Xia, Tian Feng, Shoujie Tong, Haoran Meng, Lin Xu, Zhongyu Wei, Weidong Zhan, Baobao Chang, Sujian Li, Tianyu Liu and Zhifang Sui
ACL 2022. [link] [code]

Improved deep learning techniques in gravitational-wave data analysis
Heming Xia, Lijing Shao, Junjie Zhao and Zhoujian Cao
Phys. Rev. D 103 2021. [link] [code]

Service

Reviewer:
NeurIPS 2022, AACL 2022, AACL 2023, ARR (Feb, Apr, Jun, Aug, Oct-2024), ICLR 2025

Teaching Assistant:
COMP 2S01: Technology Beyond Borders: Service Learning Across Cultural and Ethnic, Spring 2024, PolyU
COMP 5140: Metaverse Fundamentals, Fall 2024, PolyU

Invited Talks

[2024.03] Unlocking the Efficiency of LLM Inference: A Comprehensive Survey of Speculative Decoding at NICE and CIP Group @CASIA. [video] [slides]

Awards

Merit Student, Peking University (2021)
Scholarship of National Astronomical Observatory, Chinese Academy of Sciences (2019)
Merit Student, Henan Province, China (2016)