Ques regarding deploying api server, help #690
-
After i deployed api connection, i was using completions mode. I constantly receive same result when I ask the same prompt for different times. Does any one know what issue may cause this problem? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
Did you mean that you expect different responses with the same input? You can set the temperature to a higher value as described here |
Beta Was this translation helpful? Give feedback.
-
generation_config = GenerationConfig(
temperature=temperature,
top_p=top_p,
top_k=top_k,
num_beams=num_beams,
do_sample=True,
**kwargs,
) @ychel You can add |
Beta Was this translation helpful? Give feedback.
@ychel You can add
do_sample=True
atopenai_api_server.py
's predict function, I will fix this insufficient problem ASAP.