File tree Expand file tree Collapse file tree 1 file changed +2
-6
lines changed Expand file tree Collapse file tree 1 file changed +2
-6
lines changed Original file line number Diff line number Diff line change @@ -4,19 +4,15 @@ This script modifies [OpenAI's Whisper](https://github.com/openai/whisper) to pr
4
4
5
5
![ jfk] ( https://user-images.githubusercontent.com/28970749/225825244-f3df9607-91ab-4011-a333-7e3ae94da08f.PNG )
6
6
7
-
8
7
https://user-images.githubusercontent.com/28970749/225825286-cdb14d70-566f-454b-a2b3-b61b4b3e09c9.mp4
9
8
10
-
11
9
### What's new in 2.0.0 ?
12
10
- updated to use Whisper's more reliable word-level timestamps method.
13
11
- the more reliable word timestamps allows regrouping segments word by word.
14
12
- can now suppress silence with [ Silero VAD] ( https://github.com/snakers4/silero-vad ) (requires PyTorch 1.2.0+)
15
13
- non-VAD silencing suppress is also more robust
16
14
17
-
18
- https://user-images.githubusercontent.com/28970749/225825316-41060f7f-0676-4d7c-972a-cd318b8a79c2.mp4
19
-
15
+ https://user-images.githubusercontent.com/28970749/225826345-ef7115db-51e4-4b23-aedd-069389b8ae43.mp4
20
16
21
17
### Features
22
18
- more control over the timestamps than default Whisper
@@ -67,7 +63,7 @@ result.save_as_json('audio.json')
67
63
68
64
### Regrouping Words
69
65
Stable-ts has a preset for regrouping word into different segments. This preset is enabled by ` regroup=True ` .
70
- But are other built-in regrouping methods that allow you to customize the regrouping logic.
66
+ But there are other built-in regrouping methods that allow you to customize the regrouping logic.
71
67
This preset is just a predefined a combination of those methods.
72
68
73
69
You can’t perform that action at this time.
0 commit comments