You could use the Google cloud api to recognise the text in the image:
You could read the text from the image, translate it and then speak it out using the following API’s:
- Google Cloud Vision
- Google Cloud Translate
- Goofle Cloud Text To Speech
Google documentation explaining how to do it:
Translating text from a photo | Cloud Translation | Google Cloud