Hacker Newsnew | past | comments | ask | show | jobs | submit | icer2020's commentslogin

The very first run needs to wait for the machine translation model to be downloaded so it won't be quick. For the subsequent runs, the model will be reused as a cache.


Hi, stanchsk,

After installation, simply passing in the source and target language codes should do the job. E.g., $ subaligner -m dual -v video.mp4 -s subtitle.srt -t eng,spa

Or just translate without synchronisation: $ subaligner_convert -i subtitle_en.srt -o subtitle_es.srt -t eng,spa


Thanks. How can I find which language codes I should use?


You can get them displayed by running: $ subaligner -lgs (on the first column). Notably, it is not the comprehensive list.


Subaligner provides a one-stop solution on automatic subtitle synchronisation and translation between 1,000+ language pairs.


It does not produce audio itself but markups used for generating speech audio.


Wiki2SSML eases the burden of voice editors preparing scripts in SSML, widely understood by modern speech synthesizers including Amazon Polly, Google TTS, IBM Watson TTS and Microsoft Azure TTS, etc. It is powered by WikiVoice which provides an unobtrusive solution of blending voice-tuning markups with free texts and creates seamless experiences of editing scripts and voices in one go.


It looks great, have you considered adding a visual editor?

We have one for our systems: https://narrationbox.com


Nice plug. Your digital marketing is on point. I didn't know this was a thing, and now I do, and I'm adjacent to someone that would be in the market. Well done.


Very handy indeed. I wonder if a plugin can be made for Wiki2SSML on Narration Box.


I will look into it, Wiki2SSML looks very handy.


Thanks and your select&annotate&render approach is definitely a cool solution. Some editors such as Wiki-authors may prefer plain text editing so I feel there could be a middle ground where users can toggle on or off the raw markups.


Just realised another user reported that it did not work well for Russian movie and Polish subtitles. Nonetheless, it doesn't stop you from training your own subaligner with those media assets you possess.


Yes, subaligner should work for audio files as it does for video files.


Oh good to know! Never tried that combination before. Maybe this was due to the model pre-trained with the speech in English. Nonetheless, have you tried switching off the stretch with "-so"?


The model was trained with features of human voice bound to a frequency range so it may work for "cross-language" sync. Why not give it a go and check the quality? It won't change the content of original segments but only shift them along the timeline if there are gaps.


A useful link. Need to do some comparison against different genres. Subaligner is just yet another tool and not one of a kind.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: