Paper Abstract and Keywords |
Presentation |
2020-06-04 14:00
An experimental comparison of CNN- and CRNN-CTC for automatic phrase speech recognition systems using a children's speech database Yunzhe Wang, Yu Tian (Hokkaido Univ.), Yoshikazu Miyanaga (CIST), Hiroshi Tsutsui (Hokkaido Univ.) |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Children's speech recognition is still a challenging issue. In the case of children's speeches, the accuracy of conventional phrase speech recognition approaches is significantly low. This is mainly owing to the high variability of pronunciation patterns due to children's physical activity. Motivated by this, in this paper, we present a phrase speech recognition system using neural networks. We use a convolutional neural network (CNNs) and its recurrent neural network (RNN) version, say CRNN. Also, both approaches utilize a connectionist temporal classification (CTC) loss function, which allows networks to be trained without any prior alignment. Through experiments using a children's speech database, we show the comparison results of CNN- and CRNN-CTC approaches. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Children's speech recognition / convolutional recurrent neural network (CRNN) / connectionist temporal classification (CTC) / / / / / |
Reference Info. |
ITE Tech. Rep. |
Paper # |
|
Date of Issue |
|
ISSN |
|
Download PDF |
|
Conference Information |
Committee |
3DMT IEICE-SIS IPSJ-AVM |
Conference Date |
2020-06-03 - 2020-06-04 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
G Square (Hakodate Community Plaza) |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Intelligent Multimedia Systems, Applied Enbedded Systems, Three-Dimensional Image Technology (3DIT), etc. |
Paper Information |
Registration To |
IEICE-SIS |
Conference Code |
2020-06-SIS-AVM-3DIT |
Language |
English |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
An experimental comparison of CNN- and CRNN-CTC for automatic phrase speech recognition systems using a children's speech database |
Sub Title (in English) |
|
Keyword(1) |
Children's speech recognition |
Keyword(2) |
convolutional recurrent neural network (CRNN) |
Keyword(3) |
connectionist temporal classification (CTC) |
Keyword(4) |
|
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Yunzhe Wang |
1st Author's Affiliation |
Hokkaido University (Hokkaido Univ.) |
2nd Author's Name |
Yu Tian |
2nd Author's Affiliation |
Hokkaido University (Hokkaido Univ.) |
3rd Author's Name |
Yoshikazu Miyanaga |
3rd Author's Affiliation |
Chitose Institute of Science and Technology (CIST) |
4th Author's Name |
Hiroshi Tsutsui |
4th Author's Affiliation |
Hokkaido University (Hokkaido Univ.) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2020-06-04 14:00:00 |
Presentation Time |
20 minutes |
Registration for |
IEICE-SIS |
Paper # |
|
Volume (vol) |
vol.44 |
Number (no) |
|
Page |
|
#Pages |
|
Date of Issue |
|