2025-01-09 マサチューセッツ工科大学 (MIT)
<関連情報>
- https://news.mit.edu/2025/teaching-ai-communicate-sounds-humans-do-0109
- https://dl.acm.org/doi/10.1145/3680528.3687679
声でスケッチ:声帯模写による音の 「非フォノリアリスティック 」な表現 Sketching With Your Voice: “Non-Phonorealistic” Rendering of Sounds via Vocal Imitation
Matthew Caren, Kartik Chandra, Joshua Tenenbaum, Jonathan Ragan-Kelley, Karima Ma
SA ’24: SIGGRAPH Asia 2024 Conference Papers Published: 03 December 2024
DOI:https://doi.org/10.1145/3680528.3687679
Abstract
We present a method for automatically producing human-like vocal imitations of sounds: the equivalent of “sketching,” but for auditory rather than visual representation. Starting with a simulated model of the human vocal tract, we first try generating vocal imitations by tuning the model’s control parameters to make the synthesized vocalization match the target sound in terms of perceptually-salient auditory features. Then, to better match human intuitions, we apply a cognitive theory of communication to take into account how human speakers reason strategically about their listeners. Finally, we show through several experiments and user studies that when we add this type of communicative reasoning to our method, it aligns with human intuitions better than matching auditory features alone does. This observation has broad implications for the study of depiction in computer graphics.