ITE Technical Group Submission System
Conference Paper's Information
Online Proceedings
[Sign in]
 Go Top Page Go Previous   [Japanese] / [English] 

Paper Abstract and Keywords
Presentation 2023-02-22 13:30
A study of the image captioning method with person names
Naotsuna Fujimori, Takahiro Mochizuki (NHK)
Abstract (in Japanese) (See Japanese page) 
(in English) We have been working on image captioning techniques with the main goal of automatically generating audio description. In most conventional image captioning techniques, people are represented by common nouns such as "man" or "woman" instead of their names. However, in audio description, the names of people are indispensable for program comprehension, and a new image captioning technique that can describe the names of people is required. Therefore, we investigated a method to automatically generate captions that include people's names using a person name estimation technique that utilizes open and closed captions assigned to programs and does not require human annotation.
Keyword (in Japanese) (See Japanese page) 
(in English) Image Captioning / Face Detection / Face Recognition / Clustering / Phrase Grounding / / /  
Reference Info. ITE Tech. Rep., vol. 47, no. 6, ME2023-49, pp. 225-230, Feb. 2023.
Paper # ME2023-49 
Date of Issue 2023-02-14 (MMS, ME, AIT) 
ISSN Print edition: ISSN 1342-6893    Online edition: ISSN 2424-1970
Download PDF

Conference Information
Committee MMS ME AIT IEICE-IE IEICE-ITS  
Conference Date 2023-02-21 - 2023-02-22 
Place (in Japanese) (See Japanese page) 
Place (in English) Hokkaido Univ. 
Topics (in Japanese) (See Japanese page) 
Topics (in English) Image Processing, etc. 
Paper Information
Registration To ME 
Conference Code 2023-02-MMS-ME-AIT-IE-ITS 
Language Japanese 
Title (in Japanese) (See Japanese page) 
Sub Title (in Japanese) (See Japanese page) 
Title (in English) A study of the image captioning method with person names 
Sub Title (in English)  
Keyword(1) Image Captioning  
Keyword(2) Face Detection  
Keyword(3) Face Recognition  
Keyword(4) Clustering  
Keyword(5) Phrase Grounding  
Keyword(6)  
Keyword(7)  
Keyword(8)  
1st Author's Name Naotsuna Fujimori  
1st Author's Affiliation Japan Broadcasting Corporation (NHK)
2nd Author's Name Takahiro Mochizuki  
2nd Author's Affiliation Japan Broadcasting Corporation (NHK)
3rd Author's Name  
3rd Author's Affiliation ()
4th Author's Name  
4th Author's Affiliation ()
5th Author's Name  
5th Author's Affiliation ()
6th Author's Name  
6th Author's Affiliation ()
7th Author's Name  
7th Author's Affiliation ()
8th Author's Name  
8th Author's Affiliation ()
9th Author's Name  
9th Author's Affiliation ()
10th Author's Name  
10th Author's Affiliation ()
11th Author's Name  
11th Author's Affiliation ()
12th Author's Name  
12th Author's Affiliation ()
13th Author's Name  
13th Author's Affiliation ()
14th Author's Name  
14th Author's Affiliation ()
15th Author's Name  
15th Author's Affiliation ()
16th Author's Name  
16th Author's Affiliation ()
17th Author's Name  
17th Author's Affiliation ()
18th Author's Name  
18th Author's Affiliation ()
19th Author's Name  
19th Author's Affiliation ()
20th Author's Name  
20th Author's Affiliation ()
Speaker Author-1 
Date Time 2023-02-22 13:30:00 
Presentation Time 15 minutes 
Registration for ME 
Paper # MMS2023-29, ME2023-49, AIT2023-29 
Volume (vol) vol.47 
Number (no) no.6 
Page pp.225-230 
#Pages
Date of Issue 2023-02-14 (MMS, ME, AIT) 


[Return to Top Page]

[Return to ITE Web Page]


The Institute of Image Information and Television Engineers (ITE), Japan