model loading problem #5

xiaocangnn · 2024-08-28T15:49:22Z

Hello, when I was loading the pre-trained code generation model given in your github repository, the problem shown in the picture below occurred: the model is missing a word segmenter or the loading path conflicts. I have tried many methods but have not been able to solve it. Have you ever encountered this kind of problem? If so, how was it resolved?

shunzh · 2024-09-01T22:50:10Z

Hello, Thanks for your question! I quickly tried it on Google Colab and there doesn't seem to be a problem loading the GPT2Tokenizer.
Based on the error message, is there a local directory called "gpt2" in your workspace?

xiaocangnn · 2024-09-02T01:32:58Z

Thank you very much for your reply. I all used the code you gave on github and then ran it on the Auto DL server, but it prompted me missing the file vocab.json and merges.txt files, trying to download both files from Hugging Face and import, solved the above problem.
But I have a new question: can I take your PG-TD model as a pre-trained model and train it on my own dataset? Attempt to train into a code generation model for exploit exploits.

shunzh · 2024-09-06T05:24:13Z

Thank you very much for your reply. I all used the code you gave on github and then ran it on the Auto DL server, but it prompted me missing the file vocab.json and merges.txt files, trying to download both files from Hugging Face and import, solved the above problem.

Great to know that the problem is solved!

But I have a new question: can I take your PG-TD model as a pre-trained model and train it on my own dataset? Attempt to train into a code generation model for exploit exploits.

We don't own the pre-trained models. We use the models from this paper: https://arxiv.org/abs/2105.09938. You may check but I think it's okay to fine-tune their models.
However there have been newer code models since then, like CodeGen and Code Llama.

xiaocangnn closed this as completed Aug 28, 2024

xiaocangnn reopened this Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model loading problem #5

model loading problem #5

xiaocangnn commented Aug 28, 2024 •

edited

Loading

shunzh commented Sep 1, 2024

xiaocangnn commented Sep 2, 2024

shunzh commented Sep 6, 2024

model loading problem #5

model loading problem #5

Comments

xiaocangnn commented Aug 28, 2024 • edited Loading

shunzh commented Sep 1, 2024

xiaocangnn commented Sep 2, 2024

shunzh commented Sep 6, 2024

xiaocangnn commented Aug 28, 2024 •

edited

Loading