New AI app for describing images and video: PiccyBot

By Martijn - Spar…, 1 March, 2024

Forum
iOS and iPadOS

Hello guys,

I have created the free app PiccyBot that speaks out the description of the photo/image you give it. And you can then ask detailed questions about it.

I have adjusted the app to make it as low vision friendly as I could, but I would love to receive feedback on how to improve it further!

The App Store link can be found here:
https://apps.apple.com/us/app/piccybot/id6476859317

I am really hoping it will be of use to some. I have earlier created the app 'Talking Goggles' which was well received by the low vision community, but PiccyBot is a lot more powerful and hopefully useful!

Thanks and best regards,

Martijn van der Spek

Options

Comments

By LaBoheme on Saturday, May 10, 2025 - 03:06

the retry button, i believe, is right below the play button? it is accessible by swipe but not by touch. can we update this so it is accessible by explore/touch? thanks.

By Martijn - Spar… on Saturday, May 10, 2025 - 12:06

As Missy and Winter reported, currently the new audio descriptions with personality 'on' won't work for very long descriptions. The TTS model can't cope with these longer descriptions. Looking into a way around this. For now, the only thing you can do is to reduce the length of the description (set length to 40 or so). Then the audio description should work. If you just want to use voiceover instead, set voice to 'none' and length to 100.

Carter, I will look at the translation of the intro pages, thanks for pointing out Chinese is not working properly yet. Appreciate the offer for help, please contact me privately?

LaBoheme, thanks for noting the retry accessibility, should be no issue to improve that, it's on the list now.

Thanks again guys!

By Missy Hoppe on Saturday, May 10, 2025 - 16:06

I really like both of these suggestions. Maybe have the completion sound be toggleable in settings, but it would be really nice, although, as the other commenter said, assuming the voice starts talking, we know when processing is complete. I keep on the sound as well, so I know when it's processing stuff, but a processing complete sound would be great. Magic tap to start and stop speech would also be an excellent quality of life improvement if making it an option is possible.

By Winter Roses on Saturday, May 10, 2025 - 19:06

It would also be nice to have a voice mode where you could talk to the AI and get the description that way, and even ask follow-up questions. I know most AI programs that use voice also give you a text transcript of what you said and what the AI responded, to help keep track of the conversation. Since this is focused on descriptions and it’s more of a one-off chat instead of an ongoing conversation, I’m not sure if that’s fully necessary, but I do think it would still be really useful.

For reading purposes, it would be great if the transcript stayed on the screen, so if I’m speaking into the microphone, I’d love to have a text version of both what I said and what PiccyBot replied. If this can be implemented, I say definitely go for it! Of course, it should be optional, so people who prefer to type or use both could customize everything in the settings to fit their needs.