Skip to content

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation #295

Open
@gaocegege

Description

@gaocegege

https://arxiv.org/pdf/2404.12457

Co-design RAG 和 LLM inference

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions