ETRI Knowledge Sharing Platform : Analysis and Validation of Cross-Modal Generative Adversarial Network for Sensory Substitution

BROWSE

Titles

논문 검색
Type		SCI
Year	~	Keyword

Detail

List

Journal Article Analysis and Validation of Cross-Modal Generative Adversarial Network for Sensory Substitution

Cited 6 time in scopus

Download 306 time Share share

Authors: Mooseop Kim, YunKyung Park, KyeongDeok Moon, Chi Yoon Jeong

Issue Date: 2021-06

Citation: International Journal of Environmental Research and Public Health, v.18, no.12, pp.1-22

ISSN: 1661-7827

Publisher: MDPI

Language: English

Type: Journal Article

DOI: https://dx.doi.org/10.3390/ijerph18126216

Abstract: Visual-auditory sensory substitution has demonstrated great potential to help visually impaired and blind groups to recognize objects and to perform basic navigational tasks. However, the high latency between visual information acquisition and auditory transduction may contribute to the lack of the successful adoption of such aid technologies in the blind community; thus far, substitution methods have remained only laboratory-scale research or pilot demonstrations. This high latency for data conversion leads to challenges in perceiving fast-moving objects or rapid environmental changes. To reduce this latency, prior analysis of auditory sensitivity is necessary. However, existing auditory sensitivity analyses are subjective because they were conducted using human behavioral analysis. Therefore, in this study, we propose a cross-modal generative adversarial network-based evaluation method to find an optimal auditory sensitivity to reduce transmission latency in visual-auditory sensory substitution, which is related to the perception of visual information. We further conducted a human-based assessment to evaluate the effectiveness of the proposed model-based analysis in human behavioral experiments. We conducted experiments with three participant groups, including sighted users (SU), congenitally blind (CB) and late-blind (LB) individuals. Experimental results from the proposed model showed that the temporal length of the auditory signal for sensory substitution could be reduced by 50%. This result indicates the possibility of improving the performance of the conventional vOICe method by up to two times. We confirmed that our experimental results are consistent with human assessment through behavioral experiments. Analyzing auditory sensitivity with deep learning models has the potential to improve the efficiency of sensory substitution.

KSP Keywords: Auditory signal, Behavioral analysis, Behavioral experiments, Environmental changes, Evaluation method, High latency, Information acquisition, Laboratory-scale, Model-based analysis, Network-based, Proposed model

This work is distributed under the term of Creative Commons License (CCL)
(CC BY)

ETRI-Knowledge Sharing Plaform

BROWSE

Titles

Detail

ETRI