Open
Description
When collecting the metadata, the following loop consumes most of the processing time and would benefit from a parallelisation.
pygac-fdr/pygac_fdr/metadata.py
Lines 157 to 181 in 4e52f7d
In combination with #31 this could be implemented using
import multiprocessing
[...]
with multiprocessing.Pool(n_worker) as pool:
for metadata in pool.imap_unordered(extract_metadata, filenames):
session.add(metadata)
[...]