[docs] Redesign #31757

stevhliu · 2024-07-02T18:27:19Z

The main goal of this PR is to redesign the Transformers docs to:

Be more developer-friendly.
Improve navigation by replacing the existing structure with a more organic one that scales naturally instead of forcing content into the 4 current predefined sections.
Create a more unified docs experience by integrating content rather than adding it on.

This PR proposes a potential structure for achieving 2 and 3. Once the structure is in place, each doc will be rewritten to achieve 1.

If you're interested in more details about the redesign's motivation, please read this blog post. If you want more details about 1, 2, and 3, please read this post and this one too.

All feedback, alternative structures, and comments are welcomed! Thanks 🙂

HuggingFaceDocBuilderDev · 2024-07-02T18:55:36Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

docs/source/en/_toctree.yml

gante

Like! 👍

docs/source/en/_toctree.yml

gante · 2024-07-09T13:46:45Z

docs/source/en/_toctree.yml

+      title: Pipelines for webserver inference
+    - local: add_new_pipeline
+      title: How to add a pipeline to 🤗 Transformers?
+  - title: LLMs


A type of model that's becoming increasingly common are VLMs: they are the same as LLMs, but also accept image inputs.

Would it make sense to call this section "LLMs and VLMs"?

For sure! Let's rename the section when we have some VLM-specific docs?

docs/source/en/_toctree.yml

ydshieh · 2024-07-09T15:45:26Z

Indeed easier to read ❤️ . But there are a few places need to be moved if I understand correctly?

stevhliu · 2024-07-26T15:54:18Z

I've kicked off the redesign with the "Get Started" section. Feel free to review this section while I start on the next one (Base classes)!

The main changes are:

index.md

A cleaner index page that better describes what Transformers is in terms of its features and design. I believe this is more impactful than listing all the tasks you can solve across modalities. The focus shouldn't be on the tasks that you can solve; it should be on the models themselves. By describing the type of models available, I think users will understand that they can use them for their tasks. Having a more holistic description of the library here is more important than focusing on the different tasks/modalities.
More of a question here, but would it be better to maybe have badges on each model API doc that indicate whether it supports PyTorch/TensorFlow/Flax? Instead of having/maintaining such a long list that clutters up the main landing page, I think it'd be a lot cleaner to have this information on each model page. This way, users can see everything at once on the model page.

quicktour.md

Removed the "vertical" PyTorch/TensorFlow blocks in favor of the "horizontal" ones which I think it cleaner and less overwhelming.
Removed the big table of tasks available to Pipeline with just three code examples. I think this makes it simpler and more approachable. I also removed the Pipeline video because it felt very NLP-heavy, but we can add it back if we want to keep it.
Updated the AutoClass section to also be simpler and faster for users to start. A lot of these details (eg, tensors are outputted before the final activation function, custom model builds) can be explained in more depth in later docs. Also took this opportunity to introduce the generate API.
A better Next steps section directing users to topics of interest.

installation.md

Removed the options for downloading files in favor of just one method to keep it simple, and link to the Download files from the Hub doc for more details.

stevhliu · 2024-08-07T18:39:56Z

Hi, I'm back with an update! I've wrapped up the technical guides in the Models section. I'll circle back to the more conceptual docs later and also create some visual diagrams in Figma. Next up, I'll start working on the Preprocessors section. 🙂

The main focus is on how to load, customize, share, and contribute a model, basically a one-stop section for all your general model docs. The Load and Contribute docs have more significant changes:

models.md

Repurposed to show how to load a model. I start with a quick example of AutoModelFor.from_pretrained() so you can immediately get started, and then progressively peel back the layers. From how models and configurations interact, to the AutoClass API, and then model-specific classes. To make it easier to find how to load any model, I also added big models (device_map="auto") and custom models (trust_remote_code="True")to this page.

add_new_model.md

Updated structure to make the steps more discoverable. Before, many of the steps were hidden in "5.-14. Port BrandNewBert to Transformers" but now what these actual steps are more clear.

stevhliu · 2024-08-13T20:58:37Z

Finished the first draft of the Tokenizers doc, and I'm pretty excited that it reduces "content creep" from 6 different docs to just 1! 😎

stevhliu · 2024-08-20T22:54:12Z

The first draft of the practical guides in the base classes section is finished now! Please feel free to check it out and leave any comments or feedback (not sure why the feature extractor and processor docs aren't showing in the preview at the moment) 😄

I'll start working on the inference section after I review the first draft.

ylacombe reviewed Jul 9, 2024

View reviewed changes

docs/source/en/_toctree.yml Show resolved Hide resolved

gante approved these changes Jul 9, 2024

View reviewed changes

ydshieh reviewed Jul 9, 2024

View reviewed changes

docs/source/en/_toctree.yml Outdated Show resolved Hide resolved

ydshieh reviewed Jul 9, 2024

View reviewed changes

docs/source/en/_toctree.yml Outdated Show resolved Hide resolved

ydshieh reviewed Jul 9, 2024

View reviewed changes

docs/source/en/_toctree.yml Outdated Show resolved Hide resolved

stevhliu force-pushed the doc-redesign branch from 23b1066 to 1bc9299 Compare July 22, 2024 16:34

stevhliu force-pushed the doc-redesign branch from 31b60e4 to 4b1130c Compare August 6, 2024 19:17

stevhliu mentioned this pull request Aug 13, 2024

Language modeling examples do not show how to do multi-gpu training / fine-tuning #31323

Closed

4 tasks

stevhliu force-pushed the doc-redesign branch 3 times, most recently from 01434fd to ca38c6a Compare August 26, 2024 21:11

stevhliu force-pushed the doc-redesign branch 3 times, most recently from 0813af1 to bfc386d Compare September 9, 2024 23:23

stevhliu force-pushed the doc-redesign branch from 3076497 to a524d56 Compare September 16, 2024 21:03

stevhliu force-pushed the doc-redesign branch 3 times, most recently from fc48f55 to 75bace2 Compare September 25, 2024 23:32

stevhliu force-pushed the doc-redesign branch 6 times, most recently from d66930e to 9312112 Compare October 22, 2024 22:16

stevhliu added 29 commits January 14, 2025 17:19

optims

49e4b7d

optimizers

c8ab473

accelerate

f9e5429

parallelism

552e9af

fsdp

4ba26a7

update

8340c45

distributed cpu

561c1c2

hardware training

1322fec

gpu training

203e15e

gpu training 2

10433ae

peft

ed01153

distrib debug

32984c2

deepspeed 1

1ace040

deepspeed 2

2860c8e

chat toctree

f8dae62

quant pt 1

d8115dd

quant pt 2

91a2023

fix toctree

8bf53a5

fix

50a0d11

fix

3fa0d35

quant pt 3

c06721d

quant pt 4

cb04632

serialization

19b99c6

torchscript

7a53593

scripts

e1956d7

tpu

73b3929

review

a9c20f3

model addition timeline

bcc276e

modular

efaa75b

stevhliu force-pushed the doc-redesign branch from 03733c4 to efaa75b Compare January 15, 2025 01:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] Redesign #31757

[docs] Redesign #31757

stevhliu commented Jul 2, 2024

HuggingFaceDocBuilderDev commented Jul 2, 2024

gante left a comment

gante Jul 9, 2024

stevhliu Jul 9, 2024

ydshieh commented Jul 9, 2024

stevhliu commented Jul 26, 2024

stevhliu commented Aug 7, 2024

stevhliu commented Aug 13, 2024

stevhliu commented Aug 20, 2024

[docs] Redesign #31757

Are you sure you want to change the base?

[docs] Redesign #31757

Conversation

stevhliu commented Jul 2, 2024

HuggingFaceDocBuilderDev commented Jul 2, 2024

gante left a comment

Choose a reason for hiding this comment

gante Jul 9, 2024

Choose a reason for hiding this comment

stevhliu Jul 9, 2024

Choose a reason for hiding this comment

ydshieh commented Jul 9, 2024

stevhliu commented Jul 26, 2024

stevhliu commented Aug 7, 2024

stevhliu commented Aug 13, 2024

stevhliu commented Aug 20, 2024