Adding a llama.cpp LLM Component #1052

edlee123 · 2024-12-20T03:45:43Z

Description

I added a llama.cpp LLM OPEA component. Llama.cpp is a popular LLM inference library/server "with minimal setup and state-of-the-art performance on a wide range of hardware - locally and in the cloud" written in pure C/C++.

The component code is written in llm.py, and is most similar to the existing code in llms/text-generation/ray_serve. I also referred to ollama, and tgi to try keep with conventions.

Please see the README.md provides instructions how to use it.

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

The dependencies are similar to other llm components.

Tests

This was tested on CPU (laptop) with Phi3.5 mini 4k instruct. The Llama Cpp can use GPU as needed but didn't test it.

Signed-off-by: Ed Lee <[email protected]>

First commit of llamacpp Opea component

397f7b8

Signed-off-by: Ed Lee <[email protected]>

edlee123 requested a review from lvliang-intel as a code owner December 20, 2024 03:45

edlee123 added 2 commits December 19, 2024 21:50

Removed unneeded requirements file

cb4f5e5

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into llamacpp

df3d943

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a llama.cpp LLM Component #1052

Adding a llama.cpp LLM Component #1052

edlee123 commented Dec 20, 2024 •

edited

Loading

Adding a llama.cpp LLM Component #1052

Are you sure you want to change the base?

Adding a llama.cpp LLM Component #1052

Conversation

edlee123 commented Dec 20, 2024 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

edlee123 commented Dec 20, 2024 •

edited

Loading