I don't know where to put this so I hope this is the right place.
I just typed in audio description into youtube and came across this.
I honestly don't like it, I feel like AI isn't there yet when it comes to describing things, also you can hear that the voice is AI if you listen carefully.
It's ok I guess for describing a nature doc or something but I feel it doesn't have that human touch so wouldn't work for a horror movie or comedy for example.
The idea is interesting, but I hope that people like game studios and so on don't pick this up because it's easy.
Anyway; here's there youtube link: https://www.youtube.com/@Visonic-AI-m2c and here's their website: https://visonicai.com/
What do you guys think?
Comments
They should use the Microsoft Neural voices
Those Microsoft voices are so good. But again, someone needs to supervise how the TTS pronounces names etc. It shouldn't sound weird.
Also, I doubt if this can be utilized for actual movies and shows. Adding AD in such content requires carefull attention to place AD exactly at the right place, in between dialogs. An AI can certainly analyse the dialogs and the gaps between dialogs, but doubt if this one is doing it yet.
The page says it can annalise the movie for silences.
I like AI for what it can do for us but then things like this come along.
I wonder how much practise they have at actually audio describing things? Do they have actual audio describers backing this product? I doubt it.
Their two demos show the exact same movie clip and one for the slightly longer one and i'm just not impressed at all.
Maybe i'm just old but I just can't see a world where something like this would be prefered over a human narrator.
Games
Honestly if it's a way to get more accessibility in games I'd be for it, even if we have to wean them off it later. A fair number of games are adding menu narration with no other accessibility features for us and if this gives them another simple drop in tool at least that's more than we would get otherwise.