Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models (bibtex)
by Siyan Zhao, Daniel Israel, Guy Van den Broeck and Aditya Grover
View — Paper PDF
Reference:
Siyan Zhao, Daniel Israel, Guy Van den Broeck and Aditya Grover. Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models, In Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.
Bibtex Entry:
@inproceedings{ZhaoAISTATS25,
author = {Zhao, Siyan and Israel, Daniel and Van den Broeck, Guy and Grover, Aditya},
title = {Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models},
booktitle = {Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS)},
month = {may},
year = {2025},
url = "https://arxiv.org/pdf/2404.09529.pdf",
keywords = {conference,selective}
}PDF Preview:
Powered by bibtexbrowser