More

icer2020 · on May 17, 2021

The very first run needs to wait for the machine translation model to be downloaded so it won't be quick. For the subsequent runs, the model will be reused as a cache.

icer2020 · on May 17, 2021

Hi, stanchsk,

After installation, simply passing in the source and target language codes should do the job. E.g., $ subaligner -m dual -v video.mp4 -s subtitle.srt -t eng,spa

Or just translate without synchronisation: $ subaligner_convert -i subtitle_en.srt -o subtitle_es.srt -t eng,spa

stanchsk · on May 17, 2021

Thanks. How can I find which language codes I should use?

icer2020 · on May 17, 2021

You can get them displayed by running: $ subaligner -lgs (on the first column). Notably, it is not the comprehensive list.

icer2020 · on May 16, 2021

Subaligner provides a one-stop solution on automatic subtitle synchronisation and translation between 1,000+ language pairs.

icer2020 · on Feb 22, 2021

It does not produce audio itself but markups used for generating speech audio.

icer2020 · on Feb 22, 2021

Wiki2SSML eases the burden of voice editors preparing scripts in SSML, widely understood by modern speech synthesizers including Amazon Polly, Google TTS, IBM Watson TTS and Microsoft Azure TTS, etc. It is powered by WikiVoice which provides an unobtrusive solution of blending voice-tuning markups with free texts and creates seamless experiences of editing scripts and voices in one go.

narrationbox · on Feb 22, 2021

It looks great, have you considered adding a visual editor?

We have one for our systems: https://narrationbox.com

jnwatson · on Feb 22, 2021

Nice plug. Your digital marketing is on point. I didn't know this was a thing, and now I do, and I'm adjacent to someone that would be in the market. Well done.

icer2020 · on Feb 22, 2021

Very handy indeed. I wonder if a plugin can be made for Wiki2SSML on Narration Box.

narrationbox · on Feb 23, 2021

I will look into it, Wiki2SSML looks very handy.

icer2020 · on Feb 23, 2021

Thanks and your select&annotate&render approach is definitely a cool solution. Some editors such as Wiki-authors may prefer plain text editing so I feel there could be a middle ground where users can toggle on or off the raw markups.

icer2020 · on Feb 1, 2021

Just realised another user reported that it did not work well for Russian movie and Polish subtitles. Nonetheless, it doesn't stop you from training your own subaligner with those media assets you possess.

icer2020 · on Feb 1, 2021

Yes, subaligner should work for audio files as it does for video files.

icer2020 · on Feb 1, 2021

Oh good to know! Never tried that combination before. Maybe this was due to the model pre-trained with the speech in English. Nonetheless, have you tried switching off the stretch with "-so"?

icer2020 · on Feb 1, 2021

The model was trained with features of human voice bound to a frequency range so it may work for "cross-language" sync. Why not give it a go and check the quality? It won't change the content of original segments but only shift them along the timeline if there are gaps.

icer2020 · on Feb 1, 2021

A useful link. Need to do some comparison against different genres. Subaligner is just yet another tool and not one of a kind.