ITE Technical Report

Online edition: ISSN 2424-1970

Volume 49, Number 4

Media Engineering

Artistic Image Technology

Multi-media Storage

Sport Information Processing

Workshop Date : 2025-02-18 - 2025-02-19 / Issue Date : 2025-02-11

[PREV] [NEXT]

[TOP] | [2019] | [2020] | [2021] | [2022] | [2023] | [2024] | [2025] | [Japanese] / [English]

[PROGRAM] [BULK PDF DOWNLOAD]


Table of contents

MMS2025-1 ME2025-1 AIT2025-1 SIP2025-1
Momentum-Aware Difficulty Computation for Music Games and Its Application to Stage Data Editing
Ryosuke Nosaka, Yoshinori Dobashi (Hokkaido Univ.)
pp. 1 - 6

MMS2025-2 ME2025-2 AIT2025-2 SIP2025-2
Extraction of Important Scenes by Multimodal LLM Using Video and Speech Transcription Data -- A Study on the Accurate Understanding of Timestamp Information --
Tomoki Haruyama, Cheng Zhou (NTT DOCOMO)
pp. 7 - 12

MMS2025-3 ME2025-3 AIT2025-3 SIP2025-3
A Note on Performance Improvement of Visual Emotion Classification via Multimodal LLM Introducing Text Prompt Optimization
Ryo Takahashi, Naoki Saito, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 13 - 17

MMS2025-4 ME2025-4 AIT2025-4 SIP2025-4
A Note on Image-to-music Generation via Musical Caption Based on In-context Learning
Shilin Liu, Kyohei Kamikawa, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 18 - 22

MMS2025-5 ME2025-5 AIT2025-5 SIP2025-5
Automatic Identification Using GAN for Human-Unreadable License Plates
Atsushi Ichiyanagi, Ryuya Uda (TUT)
pp. 23 - 28

MMS2025-6 ME2025-6 AIT2025-6 SIP2025-6
Efficient Physics Informed Dynamic Neural Fluid Fields Reconstruction From Sparse Video
Yangcheng Xiang, Yoshinori Dobashi (Hokudai)
pp. 29 - 33

MMS2025-7 ME2025-7 AIT2025-7 SIP2025-7
A Note on Interpretability of Visual Language Model by Few-shot Learning based on the Linear Representation Hypothesis
Hiroki Okamura, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 34 - 39

MMS2025-8 ME2025-8 AIT2025-8 SIP2025-8
Study on matching method between satellite and ground images using CLIP model
Momoko Maezawa, Rei Endo, Naotsuna Fujimori, Takahiro Mochizuki (NHK)
pp. 40 - 45

MMS2025-9 ME2025-9 AIT2025-9 SIP2025-9
Enhancing Attracting-and-Dispersing Source-Free Domain Adaptation with Vision-and-Language Model
Xinqi Shu (TMU), Shuhei Tarashima (NTT Com), Norio Tagawa (TMU)
pp. 46 - 51

MMS2025-10 ME2025-10 AIT2025-10 SIP2025-10
Effectiveness Verification of Introducing Model Merging in Federated Learning -- Investigation from Multi-domain Image Classification Tasks --
Kenta Kubota, Reb Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 52 - 56

MMS2025-11 ME2025-11 AIT2025-11 SIP2025-11
Developing an Intersection Risk Estimation Using Semantic Segmentation
Kazuto Nakagawa, Kazuma Nishimura, Jin Kadotani, Osamu Sugiyama, Masahiro Tada (Kindai Univ.)
pp. 57 - 61

MMS2025-12 ME2025-12 AIT2025-12 SIP2025-12
Analysis of mountain climbing motion using loads on forefoot, midfoot and rearfoot
Satoshi Shimada, Daisuke Kawamura (Nihon Univ.)
pp. 62 - 65

MMS2025-13 ME2025-13 AIT2025-13 SIP2025-13
バドミントンラケット検出のための特化型データセット構築
Muhammad Abdul Haq (TMU), Tarashima Shuhei (NTT Com), Tagawa Norio (TMU)
pp. 66 - 70

MMS2025-14 ME2025-14 AIT2025-14 SIP2025-14
Evaluating Existing Dense Pose Estimators on Diverse Body Shape Images
BoTao Zhang (TMU), Shuhei Tarashima (NTT Communications), Norio Tagawa (TMU)
pp. 71 - 76

MMS2025-15 ME2025-15 AIT2025-15 SIP2025-15
Trial for Investigating Relationship between Older Drivers' Physical Function and Vehicle Control Behavior Using RTK-GNSS
Ayumu Tsujimura (Kindai Univ..), Shohei Kagino (Morinomiya Univ. of Medical Sciences), Shingo Moriizumi (Tezukayama Univ..), Yoshio Fujita (CPUOHS), Kazumi Renge (Tezukayama Univ..), Osamu Sugiyama, Masahiro Tada (Kindai Univ..)
pp. 77 - 80

MMS2025-16 ME2025-16 AIT2025-16 SIP2025-16
Improving Object Detection Performance in Low-Light Scenes using Add-On Far-Infrared System
Hikaru Fukushima, Isamu Takai (TOYOTA CENTRAL R&D LABS., INC.)
pp. 81 - 85

MMS2025-17 ME2025-17 AIT2025-17 SIP2025-17
A Note on Sensitivity Evaluation of Novel View Synthesis Metrics in 3D Scenes with Limited Conditions
Haoyang Wang, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 86 - 90

MMS2025-18 ME2025-18 AIT2025-18 SIP2025-18
A Note on Personalized Anomaly Detection Based on Vision Language Model Using Image Prompt
Haruka Matsuda, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 91 - 95

MMS2025-19 ME2025-19 AIT2025-19 SIP2025-19
Experimental Discussion on the Attention Mechanism of HEVC and 3D CG Images Using Vision Transformer
Norifumi Kawabata (Computational Imaging Lab)
pp. 96 - 101

MMS2025-20 ME2025-20 AIT2025-20 SIP2025-20
Trial for Recognizing Hazards in Traffic Scene Using a Vision-Language Model
Kazuma Nishimura, Kazuto Nakagawa, Temma Okamoto, Osamu Sugiyama, Masahiro Tada (Kindai Univ.)
pp. 102 - 106

MMS2025-21 ME2025-21 AIT2025-21 SIP2025-21
[Special Talk] Efforts at Hokkaido University Data-Driven Interdisciplinary Research Emergence Department -- Establishing Foundations for Advanced Interdisciplinary Research and Digital Core Human Resource Development to Address Regional Issues --
Miki Haseyama, Yusuke Mizutani, Kosui Horiuchi, Hiroyuki Sasaki (Hokkaido Univ.)
pp. 107 - 110

MMS2025-22 ME2025-22 AIT2025-22 SIP2025-22
[Special Talk] Developing Next-generation Infrastructure Maintenance Technologies with NEXCO EAST Group
Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 111 - 113

MMS2025-23 ME2025-23 AIT2025-23 SIP2025-23
[Special Talk] Development and Practice of Problem-Solving Research for Promoting Social Implementation in Collaboration with Nitori Holdings Co., Ltd
Atsuo Maruyama, Ren Togo, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 114 - 116

MMS2025-24 ME2025-24 AIT2025-24 SIP2025-24
[Special Talk] Development and operation of Hokkaido University's Digital Reskilling Program
Hisashi Ukawa, Takahiro Ogawa, Keisuke Maeda, Yusuke Mizutani, Katsutoshi Kondo, Miki Haseyama (Hokkaido Univ.)
pp. 117 - 120

MMS2025-25 ME2025-25 AIT2025-25 SIP2025-25
[Special Talk] Government-academia collaboration efforts with the Hokkaido Development Bureau
Katsutoshi Kondo (Hokkaido Univ.), Mitsuaki Yonemoto (Hokkaido Regional Development Bureau, Ministry of Land, Infrastr)
pp. 121 - 124

MMS2025-26 ME2025-26 AIT2025-26 SIP2025-26
[Special Talk] Withered Tree Detection Technology by Semantic Segmentation and Depth Estimation for Efficient Daily Inspection on Highways
Naoki Saito, Kazuki Yamamoto, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 125 - 128

MMS2025-27 ME2025-27 AIT2025-27 SIP2025-27
[Special Talk] Technology of Finding Generation Using Vision Language Model for Efficient Bridge Inspection on Highway
Tatsuki Seino, Naoki Saito, Keisuke Maeda, Takahiro Ogawa, MIki Haseyama (Hokkaido Univ)
pp. 129 - 132

MMS2025-28 ME2025-28 AIT2025-28 SIP2025-28
[Special Talk] A Note on Customer Interest Estimation Method Based on Multiple Transformer Models Using Real Store Video Data
Teruhisa Yamashiro (NDB), Ren Togo (Hokkaido Univ.), Yuki Honma, Yu Yoshida (NDB), Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 133 - 138

MMS2025-29 ME2025-29 AIT2025-29 SIP2025-29
[Special Talk] A Note on Supporting Interior Coordination Using Image Generation and Complementary Recommendation Techniques
Keigo Sakurai, Hiroki Okamura, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 139 - 143

MMS2025-30 ME2025-30 AIT2025-30 SIP2025-30
[Special Talk] Damage Classification Using Road Attachment Images Based on Vision Transformer and Vision Language Model
Koshi Watanabe, Keisuke Maeda, Reb Togo, Takahiro Ogawa, Miki Haseyama (HU)
pp. 144 - 148

MMS2025-31 ME2025-31 AIT2025-31 SIP2025-31
[Special Talk] Advanced Finding Generation AI Based on In-context Learning for Inspection Report Creation
Masaya Sato, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 149 - 154

MMS2025-32 ME2025-32 AIT2025-32 SIP2025-32
[Special Talk] Event Location Prediction from Urgent Calls based on Fine-tuning of Speech Recognition Models for Geographic Name Recognition
Masaki Yoshida, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 155 - 160

MMS2025-33 ME2025-33 AIT2025-33 SIP2025-33
Experiments on image recognition by optoelectronic deep neural network with scattering medium insertion
Kaito Inoue, Takumi Hashiguchi, Taichi Takatsu, Rio Tomioka (Kyutech), Atsushi Shibukawa (Hokkaido Univ.), Masanori Takabayashi (Kyutech)
pp. 161 - 166

MMS2025-34 ME2025-34 AIT2025-34 SIP2025-34
Wavefront shaping-assisted multisite two-photon microscopy for recording distributed neuronal activity across the mouse cortex
Hyojeong Shon (Hokkaido Univ.), Atsushi Shibukawa, Hideharu Mikami (RIES, Hokkaido Univ.)
pp. 167 - 168

MMS2025-35 ME2025-35 AIT2025-35 SIP2025-35
Investigation of the dependence of the accuracy of signal beam detection on the center wavelength during the partially coherent readout by the Transport of Intensity Equation method
Tomohiro Nishimura, Masatoshi Bunsen (Fukuoka Univ.)
pp. 169 - 172

MMS2025-36 ME2025-36 AIT2025-36 SIP2025-36
Reducing Computational Cost of 3D Human Pose and Shape Estimation Using Group-Mix Attention
Yushan Wang (TMU), Shuhei Tarashima (NTT Com), Norio Tagawa (TMU)
pp. 173 - 178

MMS2025-37 ME2025-37 AIT2025-37 SIP2025-37
Study on Investigating Effectiveness of Lane Positioning Assistance from Road-Side Equipment in Low-Visibility Conditions Using Eye-Tracking HMD
Kidai Morimoto, Ryutaro Ohta (Kindai Univ.), Hidekatsu Hamaoka (Akita Univ.), Toru Hagiwara (RMEC), Sho Takahashi (Hokkaido Univ.), Toshihiro Hiraoka (JARI), Kazumi Renge, Shingo Moriizumi (Tezukayama Univ.), Masahiro Tada (Kindai Univ.)
pp. 179 - 182

MMS2025-38 ME2025-38 AIT2025-38 SIP2025-38
Retrieval-based Nutrition Estimation from Food Images
Satayu Parinayok, Yoko Yamakata, Kiyoharu Aizawa (UTokyo)
pp. 183 - 188

MMS2025-39 ME2025-39 AIT2025-39 SIP2025-39
A Note on the Effectiveness of Brain Activity Information Against Adversarial Attacks -- Utilization of Image Reconstruction Method from Brain Signals Using Generative Models --
Tasuku Nakajima, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 189 - 193

MMS2025-40 ME2025-40 AIT2025-40 SIP2025-40
A Note on Consideration of Spatial Integrity in Continuous 3D Scene Generation Method
Yuki Era, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama (Hokkaido Univ.)
pp. 194 - 198

Note: Each article is a technical report without peer review, and its polished version will be published elsewhere.


The Institute of Image Information and Television Engineers (ITE), Japan