| Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
| IEICE-IE, IEICE-ITS, MMS, ME, AIT, SIP [detail] |
2026-02-20 10:30 |
Hokkaido |
|
[Special Talk]
Anomaly Detection Using Semantic Segmentation Model and Large Vision-Language Model for Efficient Daily Inspection on Highways Ren Tasai, Xiang Li, Ryota Goka, Naoki Saito, Keisuke Maeda (Hokkaido Univ.), Fumiyuki Kamada (Nexco-Engineering Hokkaido), Ryushi Kubo (NEXCO-East Engineering), Yuji Kawasaki (East Nippon Expressway Kanto Regional Head Office), Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.) |
This paper proposes an anomaly detection method for road facilities based on a semantic segmentation model and a vision ... [more] |
MMS2026-24 ME2026-24 AIT2026-24 SIP2026-24 pp.101-105 |
| IEICE-IE, IEICE-ITS, MMS, ME, AIT, SIP [detail] |
2026-02-20 15:15 |
Hokkaido |
|
VLM-based Zero-shot Dense Video Captioning of Instructional Videos Using Hand-Centric Object Context Riku Yamaguchi, Yota Yamamoto, Ryosuke Furuta, Yukinobu Taniguchi (TUS) |
Against the background of labor shortages in the manufacturing and service industries, there is growing demand for autom... [more] |
MMS2026-36 ME2026-36 AIT2026-36 SIP2026-36 pp.157-160 |
| ME, AIT, MMS, IEICE-IE, IEICE-ITS, SIP [detail] |
2025-02-18 15:55 |
Hokkaido |
Hokkaido Univ. |
A Note on Personalized Anomaly Detection Based on Vision Language Model Using Image Prompt Haruka Matsuda, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.) |
This paper presents a personalized anomaly detection method based on the Vision Language Model (VLM) that utilizes image... [more] |
MMS2025-18 ME2025-18 AIT2025-18 SIP2025-18 pp.91-95 |
| ME, AIT, MMS, IEICE-IE, IEICE-ITS, SIP [detail] |
2025-02-18 16:25 |
Hokkaido |
Hokkaido Univ. |
Trial for Recognizing Hazards in Traffic Scene Using a Vision-Language Model Kazuma Nishimura, Kazuto Nakagawa, Temma Okamoto, Osamu Sugiyama, Masahiro Tada (Kindai Univ.) |
For the societal implementation of autonomous driving systems, it is essential that these systems understand collision r... [more] |
MMS2025-20 ME2025-20 AIT2025-20 SIP2025-20 pp.102-106 |
| IEICE-SIS, BCT |
2024-10-03 14:30 |
Hokkaido |
Hokusei Gakuen Univ. (Primary: On-site, Secondary: Online) |
Object Location Interpretation for Service Robots using Vision-Language Model and Object Detection Model Kosei Yamao, Daiju Kanaoka, Kosei Isomoto, Hakaru Tamukoh (Kyutech) |
Service robots are required to understand and execute various commands from humans. However, robots have challenges reco... [more] |
|
| ME, IST, IEICE-BioX, IEICE-SIP, IEICE-MI, IEICE-IE [detail] |
2024-06-06 14:10 |
Niigata |
Nigata University (Ekinan-Campus "TOKIMATE") |
A trial for recognizing traffic scene using a Vision-Language Model Kazuto Nakagawa, Kazuma Nishimura, Osamu Sugiyama, Masahiro Tada (Kindai Univ.) |
For the societal implementation of autonomous driving systems, it is essential that these systems understand the behavio... [more] |
IST2024-25 ME2024-50 pp.15-18 |
| AIT, IIEEJ, AS, CG-ARTS |
2024-03-05 15:16 |
Tokyo |
Tokyo University of Technology |
A Fundamental Study on 3D CG Image Quality Assessment in Vision & Language Based on Stable Diffusion Norifumi Kawabata (Kanazawa Gakuin Univ.) |
GPT-4, which is a multimodal large-scale language model, was released on March 14, 2023. GPT-4 is equipped with Transfor... [more] |
AIT2024-115 pp.288-291 |