イヤホン内蔵カメラによる視覚情報×AI対話技術(Tiny cameras in earbuds let users talk with AI about what they see)

2026-04-14 ワシントン大学(UW)

米国のUniversity of Washingtonの研究チームは、カメラを内蔵したワイヤレスイヤホン「Vuebuds」を開発した。小型カメラで周囲の映像を取得し、AIが物体や文字、人物の動きをリアルタイムで解析し、音声としてユーザーに伝える仕組みである。これにより、視覚障害者のナビゲーション支援や日常生活の補助が可能となるほか、一般ユーザーにとってもハンズフリーでの情報取得ツールとして活用が期待される。プライバシー保護の観点から、撮影データの処理は端末内で行う設計が採用されている。研究は、ウェアラブル技術とAIの融合により、人間の感覚拡張を実現する新たな方向性を示している。

イヤホン内蔵カメラによる視覚情報×AI対話技術(Tiny cameras in earbuds let users talk with AI about what they see)
UW researchers developed a system called VueBuds that uses tiny cameras in off-the-shelf wireless earbuds to allow users to talk with an AI model about the scene in front of them. Here, the altered headphones are shown with the camera inserted. Photo: Kim et al./CHI ‘26

<関連情報>

VueBuds:ワイヤレスイヤホンによるビジュアルインテリジェンス VueBuds: Visual Intelligence with Wireless Earbuds

Maruchi Kim, Rasya Fawwaz, Zhi Yang Lim, Brinda Moudgalya, Hexi Wang, Yuanhao Zeng, Shyamnath Gollakota
CHI ’26: Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems  Published: 13 April 2026
DOI:https://doi.org/10.1145/3772318.3791322

Abstract

Despite their ubiquity, wireless earbuds remain audio-centric due to size and power constraints. We present VueBuds, the first camera-integrated wireless earbuds for egocentric vision, capable of operating within stringent power and form-factor limits. Each VueBud embeds a camera into a Sony WF-1000XM3 to stream visual data over Bluetooth to a host device for on-device vision language model (VLM) processing. We show analytically and empirically that while each camera’s field of view is partially occluded by the face, the combined binocular perspective provides comprehensive forward coverage. By integrating VueBuds with VLMs, we build an end-to-end system for real-time scene understanding, translation, visual reasoning, and text reading; all from low-resolution monochrome cameras drawing under 5mW through on-demand activation. Through online and in-person user studies with 90 participants, we compare VueBuds against smart glasses across 17 visual question-answering tasks, and show that our system achieves response quality on par with Ray-Ban Meta. Our work establishes low-power camera-equipped earbuds as a compelling platform for visual intelligence, bringing rapidly advancing VLM capabilities to one of the most ubiquitous wearable form factors.

0403電子応用
ad
ad
Follow
ad
タイトルとURLをコピーしました