Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. ArXiv
    Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning
    Xinghao Chen*, Anhao Zhao*, Heming Xia, Xuan Lu, Hanlin Wang, Yanjun Chen, Wei Zhang, Jian Wang, Wenjie Li, and Xiaoyu Shen
    In ArXiv, 2025
  2. ArXiv
    KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization
    Mingbo Song, Heming Xia, Jun Zhang, Chak Tou Leong, Qiancheng Xu, Wenjie Li, and Sujian Li
    In ArXiv, 2025
  3. ArXiv
    From Query to Logic: Ontology-Driven Multi-Hop Reasoning in LLMs
    Haonan Bian, Yutao Qi, Rui Yang, Yuanxi Che, Jiaqian Wang, and Ranran Zhen Heming Xia
    In ArXiv, 2025
  4. EMNLP
    TokenSkip: Controllable Chain-of-Thought Compression in LLMs
    Heming Xia, Yongqi Li, Chak Tou Leong, Wenjie Wang, and Wenjie Li
    In EMNLP, 2025
  5. EMNLP
    SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning
    Yicheng Ji*, Jun Zhang*, Heming Xia, Jinpeng Chen, Lidan Shou, Gang Chen, and Huan Li
    In EMNLP, 2025
  6. EMNLP
    Beyond Single Frames: Can LMMs Comprehend Temporal and Contextual Narratives in Image Sequences?
    Xiaochen Wang*, Heming Xia*, Jialin Song, Longyu Guan, Yixin Yang, Qingxiu Dong, Weiyao Luo, Yifan Pu, Yiru Wang, Xiangdi Meng, Wenjie Li, and Zhifang Sui
    In Findings of EMNLP, 2025
  7. ACL
    Towards Harmonized Uncertainty Estimation for Large Language Models
    Rui Li, Jing Long, Muge Qi, Heming Xia, Lei Sha, Peiyi Wang, and Zhifang Sui
    In ACL (Oral), 2025
  8. ACL
    How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain Simulation
    Rui Li, Heming Xia, Xinfeng Yuan, Qingxiu Dong, Lei Sha, Wenjie Li, and Zhifang Sui
    In Findings of ACL, 2025
  9. ACL
    PEToolLLM: Towards Personalized Tool Learning in Large Language Models
    Qiancheng Xu, Yongqi Li, Heming Xia, Fan Liu, Min Yang, and Wenjie Li
    In Findings of ACL, 2025
  10. ICLR
    SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
    Heming Xia, Yongqi Li, Jun Zhang, Cunxiao Du, and Wenjie Li
    In ICLR, 2025