Hi all!
Having switched from Android to iOS, I thought that the latest versions of TalkBack significantly surpass Voiceover in the possibilities of image recognition. However, I recently found that if the image, say, from the channel in Telegram, save the photo in the standard application, then Voiceover provides much more details about its contents. For example, recently I needed to get acquainted with the schedule presented in the form of a screenshot from Google Sheets -- In the messenger where I received this photo, Voiceover simply reported something like “black text on a white background”, but after saving the same image in the photo application, the “recognize text” button appeared, which not only successfully extracted the table from the image, but even offered buttons to quickly add the relevant events to the calendar. It looks really amazing!
But the problem is that exporting each image in this way for a long time and inconvenient. Why, if there are already technologies for optical recognition of such quality in the system, not provide Voiceover users with a command for their quick use in any application?
Of course, in the Voiceover settings, I turned on and downloaded the necessary packages to recognize the text and images, but these functions work somehow strangely, without telling anything more than "black text on a white background".