Inference Acceleration for the 70B LLaMA-2 Large Language Model

Qingyu Zhang
Qingyu Zhang
Master Student of Computer Science and Technology

Research interests include LLM Long Context and Post-training.