Publications

Xin Men, Mingyu Xu, Qingyu Zhang, Qianhao Yuan, Bingning Wang, Hongyu Lin, Yaojie Lu, Xianpei Han, Weipeng Chen

五月, 2025 In ACL Findings 2025

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

We investigate the redundancy within Transformer layers and propose an effective layer-based pruning method.

Xin Men*, Mingyu Xu*, Bingning Wang†, Qingyu Zhang, Hongyu Lin, Xianpei Han, Weipeng Chen

九月, 2024 In NeurIPS 2024

Base of RoPE Bounds Context Length

This work contributes to the investigation of the lower bounds of the Base in RoPE, providing a theoretical foundation for the long-context extrapolation of models.