Search

Home
News
Publications
Projects
Awards
Contact

Light Dark Automatic

Inference Acceleration for the 70B LLaMA-2 Large Language Model

Last updated on Aug 4, 2025

Large Language Models Inference Acceleration VLLM ASC24

Qingyu Zhang

Master Student of Computer Science and Technology

I work on AI sales and customer-service agents, with experience across LLM pretraining, post-training, evaluation, and model efficiency.

© 2026 Qingyu Zhang. This work is licensed under CC BY NC ND 4.0

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.

Cite