研究者らが報告:携帯電話の振動から遠隔で会話が検出される(Conversations remotely detected from cellphone vibrations, researchers report)

2025-08-08 ペンシルベニア州立大学(PennState)

ペンシルベニア州立大学の研究チームは、スマートフォンの通話時に発生する微細な振動を、ミリ波レーダーとAI音声認識モデルで読み取り、最大約3メートル(10フィート)離れた位置から会話内容を推定できることを実証しました。これは通信のプライバシーに対する新たな脅威として注目されています。精度は語彙1万単語以内で約60%ですが、キーワードレベルでも文脈的に意味の推定が可能なため、セキュリティ上で無視できないリスクです。研究成果は WiSec 2025(ACM会議)で発表されました。

研究者らが報告:携帯電話の振動から遠隔で会話が検出される(Conversations remotely detected from cellphone vibrations, researchers report)
A team of computer science researchers used a millimeter-wave radar sensor to collect conversations from the vibrations of smartphones and adapted an open-access, large-scale artificial intelligence-integrated speech recognition model, to decode the vibrations into recognizable speech transcriptions. Credit: iStock/martin-dm. All Rights Reserved.

<関連情報>

ワイヤレスタップ:ミリ波レーダーセンシングを用いた電話通話の自動転写 Wireless-Tap: Automatic Transcription of Phone Calls Using Millimeter-Wave Radar Sensing

Suryoday Basak, Mahanth Gowda
WiSec 2025: 18th ACM Conference on Security and Privacy in Wireless and Mobile Networks  Published: 30 June 2025
DOI:https://doi.org/10.1145/3734477.3734708

Abstract

This paper presents WirelessTap, a system that demonstrates the potential for automated speech recognition (ASR) on phone call audio eavesdropped remotely using commercially available frequency modulated continuous wave millimeter-wave (mmWave) radars operating in the 77-81 GHz range. WirelessTap detects minute vibrations from smartphone earpieces, converts them into audio, and processes this audio for speech transcription. This work presents the first full-sentence ASR using mmWave radars on earpiece vibrations using a 10,000-word vocabulary, achieving a 300 cm attack range across multiple smartphone models. It surpasses prior radar-based eavesdropping studies limited to loudspeakers, small vocabularies, or constrained evaluations. To address challenges like the absence of large mmWave radar-based audio datasets, low signal-to-noise ratio, and limited voice frequency ranges extractable from radar data, WirelessTap incorporates synthetic data generation, domain adaptation, and inference using OpenAI’s Whisper ASR model. Our experiments systematically show how word accuracy rate gradually decreases with distance, from as high as 59.25% at 50 cm to 2% at 300 cm; additionally, we deploy this attack to a real-world setting with a user study targeting a victim holding a smartphone to their ear. This paper highlights the evolving risks of artificial intelligence and sensor systems being misused as technology advances.

1603情報システム・データ工学
ad
ad
Follow
ad
タイトルとURLをコピーしました