-
Notifications
You must be signed in to change notification settings - Fork 0
Description
Hi there,
Thanks for the wonderful and clean front-end. Great work.
I did similar work in https://github.com/mxav1111/pdf_text_audio but through command line (parameterized through environment variables just as yours). While thinking about creating front-end , I stumbled upon this.
Front-end isn't really my forte. But if you are ok, we can mix match these both repository and provide all those options on front-end to generate audio (from scanned pdf), it would be great. I can talk to couple of other friends who are involved and we can put full working front-end with these all features (the ones you already have and the ones we will need to add front-end for options that are on command line) on NGO websites working extensively on literature and scriptures.
Microsoft based apis for pdf - text - mp3/wav, along with scripting for adding background music is also completed. Will upload shortly on pdf_text_audio repository. Everything in pdf_txt_audio is command line based and uses environment variables for everything.
PDF to TXT requires some involvement due to the fact that after text file creation, we need to perform manual correction including problems posed (especially by vision apis) .
Haven't spent much time on documentation but hopefully we can mix match this?
This seem to have great potential. In addition to options that are covered by environment variables, we also need to add few more such as emotion selection (if available for selected language), upload of pdf, download of text, upload of corrected text file, generating wav or mp3 , choice for quality , choice of background music , whether to split as well as merge of audio files and finally download of all this.
Website is just recently created with LOTs to do .. you can see some of the audio created from existing pdfs at https://awgpsac.org/gp
We use signal and if possible, we can regroup there... still need to upload updated scripts. Will do shortly.
What do you think? Thanks again for wonderful work.