This repository was archived by the owner on Aug 7, 2025. It is now read-only.
Commit a07b7d9
Llama.cpp example for cpp backend (#2904)
* Version1 of llm inference with cpp backend
Signed-off-by: Shrinath Suresh <[email protected]>
Updating llm handler - loadmodel, preprocess, inference methods
Signed-off-by: Shrinath Suresh <[email protected]>
Fixed infinite lock by adding request ids to the preprocess method
Signed-off-by: Shrinath Suresh <[email protected]>
Adding test script for finding tokens per second llama-7b-chat and ggml version
Signed-off-by: Shrinath Suresh <[email protected]>
GGUF Compatibility
Signed-off-by: Shrinath Suresh <[email protected]>
Fixing unit tests
Signed-off-by: Shrinath Suresh <[email protected]>
Fix typo
Signed-off-by: Shrinath Suresh <[email protected]>
Using folly to read config path
Signed-off-by: Shrinath Suresh <[email protected]>
Removing debug couts
Signed-off-by: Shrinath Suresh <[email protected]>
Processing all the items in the batch
Signed-off-by: Shrinath Suresh <[email protected]>
Adopted llama.cpp api changes
* Adapt to removal of TS backend
* Re-add test for llama.cpp example
* Add llama.cpp as a submodule
* Point to correct llama.cpp installation
* Build llama.cpp in build.sh
* Skip llama.cpp example test if model weights are not available
* renamed torchscript_model folder into examples
* Adjust to new base_handler interface
* Remove debug statement
* Rename llamacpp class + remove dummy.pt file
* Move llamacpp config.json
* Moved and created prompt file
* Reset context for mutiple batch entries
* Add doc for llamacpp example
* Fix spell check
* Replace output example in llamacpp example
* Move cpp example src into main examples folder
* Convert cerr/cout into logs
---------
Co-authored-by: Shrinath Suresh <[email protected]>1 parent 3ecaf0b commit a07b7d9
File tree
40 files changed
+564
-67
lines changed- cpp
- src/examples
- test
- backends
- examples
- resources
- examples
- babyllama
- babyllama_handler
- MAR-INF
- llamacpp/llamacpp_handler/MAR-INF
- mnist
- base_handler
- MAR-INF
- mnist_handler
- MAR-INF
- wrong_handler/MAR-INF
- wrong_model/MAR-INF
- torchscript_model/babyllama/babyllama_handler
- torch_scripted
- utils
- third-party
- examples/cpp
- babyllama
- src
- llama2.c
- llamacpp
- src
- mnist
- src
- ts_scripts/spellcheck_conf
40 files changed
+564
-67
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
49 | 49 | | |
50 | 50 | | |
51 | 51 | | |
52 | | - | |
| 52 | + | |
53 | 53 | | |
54 | 54 | | |
55 | 55 | | |
56 | 56 | | |
57 | 57 | | |
58 | 58 | | |
59 | 59 | | |
60 | | - | |
| 60 | + | |
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
64 | 64 | | |
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
68 | | - | |
| 68 | + | |
69 | 69 | | |
70 | 70 | | |
71 | 71 | | |
| |||
74 | 74 | | |
75 | 75 | | |
76 | 76 | | |
77 | | - | |
| 77 | + | |
78 | 78 | | |
79 | 79 | | |
80 | 80 | | |
| |||
85 | 85 | | |
86 | 86 | | |
87 | 87 | | |
88 | | - | |
| 88 | + | |
89 | 89 | | |
90 | 90 | | |
91 | 91 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
136 | 136 | | |
137 | 137 | | |
138 | 138 | | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
139 | 147 | | |
140 | 148 | | |
141 | 149 | | |
| |||
206 | 214 | | |
207 | 215 | | |
208 | 216 | | |
209 | | - | |
210 | | - | |
211 | | - | |
212 | | - | |
213 | | - | |
214 | | - | |
215 | | - | |
216 | | - | |
217 | | - | |
218 | | - | |
219 | 217 | | |
220 | 218 | | |
221 | 219 | | |
| |||
311 | 309 | | |
312 | 310 | | |
313 | 311 | | |
| 312 | + | |
| 313 | + | |
314 | 314 | | |
315 | 315 | | |
316 | 316 | | |
317 | 317 | | |
| 318 | + | |
318 | 319 | | |
319 | 320 | | |
320 | 321 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
2 | 1 | | |
3 | | - | |
4 | | - | |
5 | | - | |
6 | | - | |
7 | | - | |
| 2 | + | |
8 | 3 | | |
| 4 | + | |
9 | 5 | | |
10 | | - | |
11 | | - | |
12 | | - | |
13 | | - | |
14 | | - | |
15 | | - | |
16 | | - | |
| 6 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
27 | | - | |
| 27 | + | |
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
| |||
44 | 44 | | |
45 | 45 | | |
46 | 46 | | |
47 | | - | |
48 | | - | |
49 | | - | |
| 47 | + | |
| 48 | + | |
50 | 49 | | |
51 | 50 | | |
52 | 51 | | |
| |||
60 | 59 | | |
61 | 60 | | |
62 | 61 | | |
63 | | - | |
| 62 | + | |
64 | 63 | | |
65 | 64 | | |
66 | 65 | | |
| |||
71 | 70 | | |
72 | 71 | | |
73 | 72 | | |
74 | | - | |
| 73 | + | |
75 | 74 | | |
76 | 75 | | |
77 | 76 | | |
| |||
126 | 125 | | |
127 | 126 | | |
128 | 127 | | |
129 | | - | |
| 128 | + | |
130 | 129 | | |
131 | 130 | | |
132 | 131 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
1 | 3 | | |
2 | 4 | | |
3 | 5 | | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
4 | 34 | | |
5 | 35 | | |
6 | | - | |
7 | | - | |
8 | | - | |
9 | | - | |
| 36 | + | |
| 37 | + | |
10 | 38 | | |
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
File renamed without changes.
0 commit comments