Provide sensible defaults for Model settings #40

fbricon · 2024-09-23T08:56:50Z

Continue supports fine-tuned model configuration.

We should be able to provide proper defaults for each model size. @jamescho72 can you help here?

jamescho72 · 2024-09-29T21:21:58Z

Soon as we build the performance test against and get baselines we will fine-tune with these configurations and the settings that the granite team recommended.
for code tasks,
completionOptions": {
"temperature": 0.2 or 0.3 (for higher precision, more deterministic)
"topP": 0.9 or 1
"topK": 40
"presencePenalty": 0.0
"frequencyPenalty": 0.1
"stop": null,
"maxTokens": (start small, test and expand)
}
e.g. start maxTokens at 2K or 3K , i.e. the maximum output length, that leaves plenty of room for inputs (over 120K+) but to minimize hallucination, we need to regulate both input and output size , and work to find a balance ... it all depends on the capability of the model

jamescho72 self-assigned this Sep 29, 2024

jamescho72 assigned harshmittalibm and lavanyaj3 Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide sensible defaults for Model settings #40

Provide sensible defaults for Model settings #40

fbricon commented Sep 23, 2024 •

edited

Loading

jamescho72 commented Sep 29, 2024

Provide sensible defaults for Model settings #40

Provide sensible defaults for Model settings #40

Comments

fbricon commented Sep 23, 2024 • edited Loading

jamescho72 commented Sep 29, 2024

fbricon commented Sep 23, 2024 •

edited

Loading