max token length for finetune and completion endpoints on Lllama-2?

Great job with this repo. I was able to finetune Llama-2 and it certainly seems to have an effect.

Unfortunately the finetune silently accepts all inputs and the documentation states that you simply truncate inputs to max length. But it's not specified anywhere what's LLama-2's max length. Originally Meta released it with a bug that caused max length to be 2048 while the native max length seems to be 4096. So which is it?

Also, I tested my finetune model's completion code with inputs as big as 12,000 tokens and it still makes a completion. So I assume you truncate there as well? Only taking the tail of the prompt, presumably? 

tldr:
1. What is llama-2's max token length?
2. Is there anything we can do to effect this or get better visibility into how the input got tokenized, etc?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

max token length for finetune and completion endpoints on Lllama-2? #208

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

max token length for finetune and completion endpoints on Lllama-2? #208

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions