Ph.D. Student at Hokkaido University
Yuki Abe, Daisuke Sakamoto, and Tetsuo Ono
Overview of Auditory Comment Display (ACD). (a) ACD offers text video comments via text-to-speech synthesis, (b) enabling eyes-free listeners to enjoy a social-viewing experience while listening to music videos. We used music concert videos as example content and explored ACD within this context. This work aims to broaden the accessibility of a social-viewing experience, particularly for eyes-free listeners, by designing and understanding the user experience of listening to comment-to-speech synthesis with music video.
Online music videos on video-sharing platforms offer video comments that viewers can read to enjoy their social-viewing experience. However, because these comments rely on visual elements through texts, they are not accessible to eyes-free listeners, such as those who listen to music videos while jogging, commuting, or showering. To address this gap, we explore Auditory Comment Display (ACD), which offers text comments via text-to-speech (TTS) synthesis, enabling eyes-free listeners to enjoy a social-viewing experience while listening to music videos. We used music concert videos as example content and prototyped varying comment- to-speech styles in this context. We conducted a formative study (N = 8), prototyping (N = 10), and a user study (N = 12). The results indicated that ACD enhanced eyes-free listeners’ social-viewing experience, although it may not be appropriate for specific situations and users. We discuss the design implications and future directions for the eyes-free social-viewing experience via comment-to-speech synthesis.
Yuki Abe, Daisuke Sakamoto, and Tetsuo Ono. “I feel lonely when they stop chatting”: Exploring Auditory Comment Display for Eyes-Free Social-Viewing Experience in Online Music Videos. Proc. ACM Hum.-Comput. Interact. 9, 2, Article CSCW106 (April 2025) 30 pages (to appear at CSCW 2025). [DOI]