SkyText

SkyText is a Chinese GPT3 pre-trained large model released by Singularity-AI, which can perform different tasks such as chatting, Q&A, and Chinese-English translation.

Hugging Face home pages:

https://huggingface.co/SkyWork/SkyText

https://huggingface.co/SkyWork/SkyTextTiny

Here are some show cases:

Show cases:

For experience and trial, please visit Singularity-AI-trail

Chat

Question and answer

Generate recipes：

input——

output——

Couplet

Project Highlights

Technical advantage 1: data cleaning of more than 30 processes

With the development of NLP technology, pre-training large models has gradually become one of the core technologies of artificial intelligence. Pre-training large models usually requires a large amount of text for training, and network text naturally becomes the most important source of corpus. The quality of the training corpus undoubtedly directly affects the effect of the model. In order to train a model with outstanding capabilities, Singularity-AI has used more than 30 cleaning processes in data cleaning. Excellence in details, casting excellent model effect.
Technical advantage 2: optimized and innovative Chinese coding method for Chinese

In the field of pre-training large models, it has always been dominated by the English community, and the importance of Chinese pre-training large models is self-evident. Unlike English, the Chinese input method（pinyin text) of the Chinese pre-trained large model should obviously be different. According to the characteristics of Chinese, Singularity-AI has optimized and innovated a unique Chinese encoding method, which is more in line with Chinese language habits, and rebuilt a Chinese dictionary that is more conducive to model understanding.

News of Singularity-AI

[2022.12.15] AIGC Press Conference of Singularity-AI

—————————————————————————————————

Installation

Recommand
transformers>=4.18.0

Model Usage

# -*- coding: utf-8 -*-
from transformers import GPT2LMHeadModel
from transformers import AutoTokenizer
from transformers import TextGenerationPipeline

# 13Billions
model = GPT2LMHeadModel.from_pretrained("SkyWork/SkyText")
tokenizer = AutoTokenizer.from_pretrained("SkyWork/SkyText", trust_remote_code=True)

# or 2.6Billions
model = GPT2LMHeadModel.from_pretrained("SkyWork/SkyTextTiny")
tokenizer = AutoTokenizer.from_pretrained("SkyWork/SkyTextTiny", trust_remote_code=True)

text_generator = TextGenerationPipeline(model, tokenizer, device=0)
input_str = "Today is a "
max_new_tokens = 20
print(text_generator(input_str, max_new_tokens=max_new_tokens, do_sample=True))

License

[MIT License]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README-EN.md

README-EN.md

SkyText

Hugging Face home pages:

Here are some show cases:

Show cases:

Chat

Question and answer

Generate recipes：

Couplet

Project Highlights

News of Singularity-AI

Installation

Model Usage

License

Developer Group

Scan the code below with WeChat to join in the developer group

Files

README-EN.md

Latest commit

History

README-EN.md

File metadata and controls

SkyText

Hugging Face home pages:

Here are some show cases:

Show cases:

Chat

Question and answer

Generate recipes：

Couplet

Project Highlights

News of Singularity-AI

Installation

Model Usage

License

Developer Group

Scan the code below with WeChat to join in the developer group