Paper Abstract and Keywords |
Presentation |
2024-09-05 16:30
Proposal of an Emotion Recognition System for Improving Video Viewing Experience of Visually Impaired Individuals Zhiyuan Ning, Hiroyuki Nakamura (S.I.T) |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
The rapid growth of short video platforms like TikTok has highlighted the need for improved accessibility for visually impaired individuals. Traditional audio descriptions require specialized skills and resources, limiting their availability. This study proposes an emotion recognition system that converts visual and vocal emotional cues into accessible auditory outputs. Utilizing convolutional neural networks (CNNs) and recurrent neural networks (RNNs), the system detects and translates emotions from both audio and visual data. By integrating speech emotion recognition using the JVNV dataset and facial expression recognition using the FER2013 dataset, the system enables visually impaired users to perceive emotional changes in videos through sound. Future efforts will focus on enhancing model accuracy, developing a user-friendly interface, and evaluating the system's effectiveness, ultimately aiming to significantly improve the accessibility of video content for visually impaired individuals. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Deep Learning / Emotion Recognition / Audio Processing / Image Processing / / / / |
Reference Info. |
ITE Tech. Rep., vol. 48, no. 29, ME2024-86, pp. 37-40, Sept. 2024. |
Paper # |
ME2024-86 |
Date of Issue |
2024-08-28 (ME) |
ISSN |
Online edition: ISSN 2424-1970 |
Download PDF |
|
Conference Information |
Committee |
ME IEICE-EMM IEICE-IE IEICE-LOIS IEE-CMN IPSJ-AVM |
Conference Date |
2024-09-04 - 2024-09-05 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Hiroshima Institute of Technology |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
ME |
Conference Code |
2024-09-ME-EMM-IE-LOIS-CMN-AVM |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Proposal of an Emotion Recognition System for Improving Video Viewing Experience of Visually Impaired Individuals |
Sub Title (in English) |
|
Keyword(1) |
Deep Learning |
Keyword(2) |
Emotion Recognition |
Keyword(3) |
Audio Processing |
Keyword(4) |
Image Processing |
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Zhiyuan Ning |
1st Author's Affiliation |
Shibaura Institute of Technology (S.I.T) |
2nd Author's Name |
Hiroyuki Nakamura |
2nd Author's Affiliation |
Shibaura Institute of Technology (S.I.T) |
3rd Author's Name |
|
3rd Author's Affiliation |
() |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2024-09-05 16:30:00 |
Presentation Time |
20 minutes |
Registration for |
ME |
Paper # |
ME2024-86 |
Volume (vol) |
vol.48 |
Number (no) |
no.29 |
Page |
pp.37-40 |
#Pages |
4 |
Date of Issue |
2024-08-28 (ME) |