Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added a low resource Indic GPT Model. #254

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

abhaskumarsinha
Copy link

Hello community,

  • Added a low-resource GPT Model that is pre-trained in Indic languages such as English, Hindi, Bengali, Odia, Telugu, Malayalam and Gujarati.
  • 80M parameters are suitable for fine-tuning for classification, recognition, data extraction, NER, Chatbots, etc like NLP tasks.
  • Can be run locally on mobile devices too.
  • Trained on Bharat4AIv1 corpus and some additional translation datasets: English-Odia, English-Hindi and English-Telugu
  • Training specs: 3 TPUv3 Hours; Keras Framework; TensorFlow backend
  • Model specs: 128 context length, 5 stacks decoder; 2D masked attention mechanism; 32 heads; 32 dims/head; 1024 embedding size; 7000 tokens combined vocabulary on sentencepiece
  • Model and weights under Apache 2.0.

Best Regards,
Abhas

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant