-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Model Request Tracking #1042
Comments
I am interested in adding a new model, but after seeing the tutorial, I find myself confused and unsure about how to begin. I have several questions regarding the tutorial:
As someone new to LLM-MLC, I apologize for the multitude of questions and hope you don't mind. |
Hi @tlopex, thanks for the questions!
That is largely correct, as long as some performance is taken into account.
Based on the answer to question 2, some of these parameters are not needed to get the results that are consistent. In this case,
Typically, we want to implement the module that has the transformer model and a linear head -- since our goal is autoregressive generation. In this case, it is
Sorry I made a typo; it should be
That is correct, not all parameters in the |
@CharlieFRuan Thank you for your previous excellent answers, which indeed taught me a lot and allowed me to start. However, I still encountered some issues today:
Moreover, it seems that the explanations for the functions like
Sorry to have so many questions again, I hope to get your answers. |
@tlopex Apologies for the late reply. Please keep the questions coming, it'd also be helpful for other people trying to learn the workflow.
Yep, please use the newest version. The repo is likely to be updated continuously, but the main concept and procedure should be largely the same. Otherwise, we would update the tutorial.
Perhaps try
5 is indeed just the fifth layer. There isn't a specific reason for this being picked, we just wanted to use a single layer to demonstrate the validation process. Any layer would work.
Directly passing in an integer should work, e.g. Let me know if there are other questions! |
This is a pinned issue directed to the Model Request Tracking Board.
The text was updated successfully, but these errors were encountered: