Image to Audio Conversion to Aid Visually Impaired People by CNN

Dimensions

Sivaganesan, D and Venkateshwaran, M (2023) Image to Audio Conversion to Aid Visually Impaired People by CNN. 2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC). pp. 1707-1713.

[thumbnail of Image to Audio Conversion to Aid Visually Impaired People by CNN.pdf]

Text
Image to Audio Conversion to Aid Visually Impaired People by CNN.pdf - Published Version
Download (151kB)

Official URL: https://doi.org/10.1109/ICESC57686.2023.10193308

Abstract

This study suggests an innovative method for helping people who are blind or visually handicapped by turning visuals into sounds. In the proposed system, audio descriptions are produced in real-time together with significant features that are extracted from photos using deep learning algorithms. The proposed work is developed to be user-friendly, which includes a simple interface that aids blind individuals to easily capture and process images using a mobile device. A user research was undertaken to assess the efficiency of the suggested method, and the results were encouraging in terms of precision and usability. This initiative offers a promising technique to give people who are blind or visually impaired an alternate means of perceiving and interacting with their environment, therefore improving their quality of life. The suggested picture to audio converter system aims to overcome the drawbacks of current assistive devices that rely on braille or textual descriptions. Blind people can more easily interpret visual information that is necessary for daily life, such as recognising items, interpreting signs, or navigate unfamiliar situations, through offering audio descriptions of images. The system makes use of recent deep learning developments that have significantly improved picture identification as natural language processing. As a result, the suggested technique has the ability to offer audio descriptions that are more precise and comprehensive than current methods. This technology has the potential to be implemented into a variety of products, from cellphones to intelligent glasses, and could significantly improve the lives of people who are blind or visually impaired.

Item Type:	Article
Uncontrolled Keywords:	'current; Audio description; Convolutional neural network; Deep learning; Innovative method; Real- time; Smart phones; Visually handicapped; Visually impaired; Visually impaired people
Subjects:	A Artificial Intelligence and Data Science > Deep Learning A Artificial Intelligence and Data Science > Text and Speech Analysis A Artificial Intelligence and Data Science > Artificial intelligence E Electronics and Communication Engineering > Image Processing
Divisions:	Computer Science and Engineering
Depositing User:	Users 5 not found.
Date Deposited:	25 Jul 2024 03:31
Last Modified:	11 Jan 2025 10:45
URI:	https://ir.psgitech.ac.in/id/eprint/864

Actions (login required)

: View Item