You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With strictOpenAICompliance: true, we are still seeing Google's Gemini Flash 2.0 model return a finish reason of MAX_TOKENS, ideally this would be translated to match Open AI's specifications.
What Should Have Happened?
A stop reason of length is returned when Gemini Flash 2.0 runs out of tokens.
Relevant Code Snippet
Use Gemini Flash 2.0 with a particularly long block of text and ask it to add comments to the entire block. Gemini will then return a finish reason of MAX_TOKENS.