This repository contains the implementation code and demonstration effects for the paper "TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text".
Note: The testing code and models will be open-sourced soon.
If you use this code or data in your work, please cite our paper:
@article{lu2024turborag,
title={TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text},
author={Lu, Songshuo and Wang, Hua and Rong, Yutian and Chen, Zhi and Tang, Yaohua},
journal={arXiv preprint arXiv:2410.07590},
year={2024}
}