AIツールが視覚障害者にも3Dモデリングを可能に(New AI tool opens 3D modeling to blind and low-vision programmers)

2025-10-07 ミシガン大学

ミシガン大学の研究チームは、視覚障害のあるプログラマが3Dモデリングを行えるよう支援するAIツール「A11yShape」を開発した。このツールはコードベースの3D設計環境「OpenSCAD」と連携し、LLM(大規模言語モデル)がモデル構造を自動解析して音声やテキストで説明。部品構造と対応コード、図形を相互にハイライトする機能により、視覚に頼らず設計意図を把握できる。盲目・弱視プログラマ4名の実験では、12種のモデルを作成・修正でき、AI補助が操作理解と設計精度を大幅に向上させた。研究はarXivに公開された。

AIツールが視覚障害者にも3Dモデリングを可能に(New AI tool opens 3D modeling to blind and low-vision programmers)
Figure 1: With A11yShape, (A) a blind or low-vision (BLV) user can create, interpret, and verify 3-D models through (B) a user interface composed of three parts: Code Editor Panel, AI Assistant Panel, and Model Panel. . Image credit: Study authors, CC 4.0

<関連情報>

A11yShape: 視覚障碍者および低視力のプログラマー向けの AI 支援 3D モデリング A11yShape: AI-Assisted 3-D Modeling for Blind and Low-Vision Programmers

Zhuohao Jerry Zhang, Haichang Li, Chun Meng Yu, Faraz Faruqi, Junan Xie, Gene S-H Kim, Mingming Fan, Angus G. Forbes, Jacob O. Wobbrock, Anhong Guo, Liang He
arXive  last revised 7 Aug 2025 (this version, v2)
DOI:https://doi.org/10.48550/arXiv.2508.03852

Abstract

Building 3-D models is challenging for blind and low-vision (BLV) users due to the inherent complexity of 3-D models and the lack of support for non-visual interaction in existing tools. To address this issue, we introduce A11yShape, a novel system designed to help BLV users who possess basic programming skills understand, modify, and iterate on 3-D models. A11yShape leverages LLMs and integrates with OpenSCAD, a popular open-source editor that generates 3-D models from code. Key functionalities of A11yShape include accessible descriptions of 3-D models, version control to track changes in models and code, and a hierarchical representation of model components. Most importantly, A11yShape employs a cross-representation highlighting mechanism to synchronize semantic selections across all model representations — code, semantic hierarchy, AI description, and 3-D rendering. We conducted a multi-session user study with four BLV programmers, where, after an initial tutorial session, participants independently completed 12 distinct models across two testing sessions, achieving results that aligned with their own satisfaction. The result demonstrates that participants were able to comprehend provided 3-D models, as well as independently create and modify 3-D models — tasks that were previously impossible without assistance from sighted individuals.

1603情報システム・データ工学
ad
ad
Follow
ad
タイトルとURLをコピーしました