Skip to content

Commit 45ffff7

Browse files
committed
Merge pull request #3 from pettarin/master
aeneas v1.0.4: added boundary adjustment algorithm, run_vad, subtitle…
2 parents 75a0ec8 + 186ba18 commit 45ffff7

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

70 files changed

+2720
-483
lines changed

README.md

Lines changed: 23 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,8 @@
22

33
**aeneas** is a Python library and a set of tools to automagically synchronize audio and text.
44

5-
* Version: 1.0.3
6-
* Date: 2015-06-12
5+
* Version: 1.0.4
6+
* Date: 2015-08-09
77
* Developed by: [ReadBeyond](http://www.readbeyond.it/)
88
* Lead Developer: [Alberto Pettarin](http://www.albertopettarin.it/)
99
* License: the GNU Affero General Public License Version 3 (AGPL v3)
@@ -39,7 +39,7 @@ and [this audio file](aeneas/tests/res/container/job/assets/p001.mp3),
3939

4040
Moreover, the map can be output in several formats: SMIL for EPUB 3,
4141
SRT/TTML/VTT for closed captioning, JS for Web usage,
42-
or raw CSV/TSV/TXT/XML for further processing.
42+
or raw CSV/SSV/TSV/TXT/XML for further processing.
4343

4444

4545
## System Requirements, Supported Platforms and Installation
@@ -76,7 +76,8 @@ callable by the `subprocess` Python module.
7676
A way to ensure the latter consists
7777
in adding the three executables to your `$PATH`.
7878
Alternatively, you can use VirtualBox
79-
to run **aeneas** inside a virtualized Debian image.
79+
to run **aeneas** inside a virtualized Debian image,
80+
for example using [aeneas-vagrant](https://github.com/readbeyond/aeneas-vagrant).
8081

8182
### Installation
8283

@@ -91,6 +92,15 @@ If the last command prints a success message,
9192
you have all the required dependencies installed
9293
and you can confidently run **aeneas** in production.
9394

95+
If you get an error, try running the
96+
[provided `install_dependencies.sh` script](install_dependencies.sh)
97+
98+
```bash
99+
$ sudo bash install_dependencies.sh
100+
```
101+
102+
and then try running `check_dependencies.py` again.
103+
94104
Alternatively, consider using the [Vagrant box](http://www.vagrantup.com)
95105
created by [aeneas-vagrant](https://github.com/readbeyond/aeneas-vagrant).
96106

@@ -156,10 +166,12 @@ $ make html
156166

157167
Tutorial: [A Practical Introduction To The aeneas Package](http://www.albertopettarin.it/blog/2015/05/21/a-practical-introduction-to-the-aeneas-package.html)
158168

169+
Mailing list: [https://groups.google.com/d/forum/aeneas-forced-alignment](https://groups.google.com/d/forum/aeneas-forced-alignment)
170+
159171

160172
## Supported Features
161173

162-
* Input text files in plain, parsed or unparsed format
174+
* Input text files in plain, parsed, subtitles, or unparsed format
163175
* Text extraction from XML (e.g., XHTML) files using `id` and `class` attributes
164176
* Arbitrary text fragment granularity (single word, subphrase, phrase, paragraph, etc.)
165177
* Input audio file formats: all those supported by `ffmpeg`
@@ -168,6 +180,7 @@ Tutorial: [A Practical Introduction To The aeneas Package](http://www.albertopet
168180
* Supported (= tested) languages: BG, CA, CY, DA, DE, EL, EN, ES, ET, FI, FR, GA, GRC, HR, HU, IS, IT, LA, LT, LV, NL, NO, RO, RU, PL, PT, SK, SR, SV, TR, UK
169181
* Robust against misspelled/mispronounced words, local rearrangements of words, background noise/sporadic spikes
170182
* Code suitable for a Web app deployment (e.g., on-demand AWS instances)
183+
* Adjustable splitting times, including a max character/second constraint for CC applications
171184

172185

173186
## Limitations and Missing Features
@@ -189,9 +202,8 @@ Tutorial: [A Practical Introduction To The aeneas Package](http://www.albertopet
189202
* Reporting the alignment score
190203
* Improving (removing?) dependency from `espeak`, `ffmpeg`, `ffprobe` executables
191204
* Multilevel sync map granularity (e.g., multilevel SMIL output)
192-
* Enforcing a max char/second constraint for CC applications
193205
* Supporting input text encodings other than UTF-8
194-
* Adding more languages
206+
* Adding (testing) more languages
195207
* Better documentation
196208
* Testing other approaches, like HMM
197209
* Publishing the package on PyPI
@@ -249,6 +261,10 @@ No copy rights were harmed in the making of this project.
249261

250262
## Supporting and Contributing
251263

264+
### Sponsors
265+
266+
* **July 2015**: [Michele Gianella](https://plus.google.com/+michelegianella/about) generously supported the development of the boundary adjustment code
267+
252268
### Supporting
253269

254270
Would you like supporting the development of **aeneas**?
@@ -334,4 +350,3 @@ helped shaping the structure of this package
334350
for its asynchronous usage.
335351
336352
337-

README.txt

Lines changed: 27 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -4,8 +4,8 @@ aeneas
44
**aeneas** is a Python library and a set of tools to automagically
55
synchronize audio and text.
66

7-
- Version: 1.0.3
8-
- Date: 2015-06-12
7+
- Version: 1.0.4
8+
- Date: 2015-08-09
99
- Developed by: `ReadBeyond <http://www.readbeyond.it/>`__
1010
- Lead Developer: `Alberto Pettarin <http://www.albertopettarin.it/>`__
1111
- License: the GNU Affero General Public License Version 3 (AGPL v3)
@@ -43,7 +43,7 @@ audio file <aeneas/tests/res/container/job/assets/p001.mp3>`__,
4343

4444
Moreover, the map can be output in several formats: SMIL for EPUB 3,
4545
SRT/TTML/VTT for closed captioning, JS for Web usage, or raw
46-
CSV/TSV/TXT/XML for further processing.
46+
CSV/SSV/TSV/TXT/XML for further processing.
4747

4848
System Requirements, Supported Platforms and Installation
4949
---------------------------------------------------------
@@ -81,7 +81,8 @@ sure ``ffmpeg``, ``ffprobe`` and ``espeak`` are properly installed and
8181
callable by the ``subprocess`` Python module. A way to ensure the latter
8282
consists in adding the three executables to your ``$PATH``.
8383
Alternatively, you can use VirtualBox to run **aeneas** inside a
84-
virtualized Debian image.
84+
virtualized Debian image, for example using
85+
`aeneas-vagrant <https://github.com/readbeyond/aeneas-vagrant>`__.
8586

8687
Installation
8788
~~~~~~~~~~~~
@@ -97,6 +98,15 @@ If the last command prints a success message, you have all the required
9798
dependencies installed and you can confidently run **aeneas** in
9899
production.
99100

101+
If you get an error, try running the `provided
102+
``install_dependencies.sh`` script <install_dependencies.sh>`__
103+
104+
.. code:: bash
105+
106+
$ sudo bash install_dependencies.sh
107+
108+
and then try running ``check_dependencies.py`` again.
109+
100110
Alternatively, consider using the `Vagrant
101111
box <http://www.vagrantup.com>`__ created by
102112
`aeneas-vagrant <https://github.com/readbeyond/aeneas-vagrant>`__.
@@ -165,10 +175,12 @@ Generated from the source (requires ``sphinx``):
165175
Tutorial: `A Practical Introduction To The aeneas
166176
Package <http://www.albertopettarin.it/blog/2015/05/21/a-practical-introduction-to-the-aeneas-package.html>`__
167177

178+
Mailing list: https://groups.google.com/d/forum/aeneas-forced-alignment
179+
168180
Supported Features
169181
------------------
170182

171-
- Input text files in plain, parsed or unparsed format
183+
- Input text files in plain, parsed, subtitles, or unparsed format
172184
- Text extraction from XML (e.g., XHTML) files using ``id`` and
173185
``class`` attributes
174186
- Arbitrary text fragment granularity (single word, subphrase, phrase,
@@ -183,6 +195,8 @@ Supported Features
183195
of words, background noise/sporadic spikes
184196
- Code suitable for a Web app deployment (e.g., on-demand AWS
185197
instances)
198+
- Adjustable splitting times, including a max character/second
199+
constraint for CC applications
186200

187201
Limitations and Missing Features
188202
--------------------------------
@@ -206,9 +220,8 @@ TODO List
206220
- Improving (removing?) dependency from ``espeak``, ``ffmpeg``,
207221
``ffprobe`` executables
208222
- Multilevel sync map granularity (e.g., multilevel SMIL output)
209-
- Enforcing a max char/second constraint for CC applications
210223
- Supporting input text encodings other than UTF-8
211-
- Adding more languages
224+
- Adding (testing) more languages
212225
- Better documentation
213226
- Testing other approaches, like HMM
214227
- Publishing the package on PyPI
@@ -265,6 +278,13 @@ No copy rights were harmed in the making of this project.
265278
Supporting and Contributing
266279
---------------------------
267280

281+
Sponsors
282+
~~~~~~~~
283+
284+
- **July 2015**: `Michele
285+
Gianella <https://plus.google.com/+michelegianella/about>`__
286+
generously supported the development of the boundary adjustment code
287+
268288
Supporting
269289
~~~~~~~~~~
270290

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
1.0.3
1+
1.0.4

aeneas/__init__.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@
66
to automagically synchronize audio and text.
77
"""
88

9+
from aeneas.adjustboundaryalgorithm import AdjustBoundaryAlgorithm
910
from aeneas.analyzecontainer import AnalyzeContainer
1011
from aeneas.audiofile import AudioFile
1112
from aeneas.container import Container, ContainerFormat
@@ -27,6 +28,7 @@
2728
from aeneas.synthesizer import Synthesizer
2829
from aeneas.task import Task, TaskConfiguration
2930
from aeneas.textfile import TextFile, TextFileFormat, TextFragment
31+
from aeneas.vad import VAD
3032
from aeneas.validator import Validator
3133

3234
__author__ = "Alberto Pettarin"
@@ -35,7 +37,7 @@
3537
Copyright 2013-2015, ReadBeyond Srl (www.readbeyond.it)
3638
"""
3739
__license__ = "GNU AGPL v3"
38-
__version__ = "1.0.3"
40+
__version__ = "1.0.4"
3941
__email__ = "[email protected]"
4042
__status__ = "Production"
4143

0 commit comments

Comments
 (0)