ズームバイアス: ビデオ会議中に「音が小さい」ことの社会的コスト(Zoom bias: The social costs of having a ‘tinny’ sound during video conferences)

2025-03-26

2025-03-24 イェール大学

イェール大学の研究によると、ビデオ会議中の音質が話し手の評価に大きく影響することが明らかになりました。具体的には、低品質のマイクによる金属的でこもった音声は、聞き手に話し手の知性や信頼性、さらには魅力を低く評価させる傾向があります。この研究では、参加者に同じ内容の録音を高品質と低品質の音声で聞かせ、その後の評価を比較しました。その結果、音質が悪い場合、話し手の印象が一貫して低下することが確認されました。研究者は、ビデオ会議が一般化した現代において、音質が無意識の偏見や差別の原因となる可能性があると指摘しています。

<関連情報>

表層的な聴覚(流暢でない)バイアスは、より高いレベルの社会的判断を偏らせる Superficial auditory (dis)fluency biases higher-level social judgment

Robert Walter-Terrill, Joan Danielle K. Ongchoco, and Brian J. Scholl
Proceedings of the National Academy of Sciences Published:March 24, 2025
DOI:https://doi.org/10.1073/pnas.2415254122

Significance

In recent years, tools such as videoconferencing have shifted many conversations online, with stark auditory ramifications—such that some voices sound clear and resonant while others sound hollow or tinny, based on microphone quality and characteristics. A series of experiments shows that such differences, while clearly not reflective of the speakers themselves, nevertheless have broad and powerful consequences for social evaluation, leading listeners to make lower judgments of speakers’ intelligence, hireability, credibility, and even romantic desirability. Such effects may be potential sources of unintentional bias and discrimination, given the likelihood that microphone quality is correlated with socioeconomic status. So, before joining your next videoconference, you may want to consider how much a cheap microphone may really be costing you.

Abstract

When talking to other people, we naturally form impressions based not only on what they say but also on how they say it—e.g., how confident they sound. In modern life, however, the sounds of voices are often determined not only by intrinsic qualities (such as vocal anatomy) but also by extrinsic properties (such as videoconferencing microphone quality). Here, we show that such superficial auditory properties can have surprisingly deep consequences for higher-level social judgments. Listeners heard short narrated passages (e.g., from job application essays) and then made various judgments about the speakers. Critically, the recordings were modified to simulate different microphone qualities, while carefully equating listeners’ comprehension of the words. Though the manipulations carried no implications about the speakers themselves, common disfluent auditory signals (as in “tinny” speech) led to decreased judgments of intelligence, hireability, credibility, and romantic desirability. These effects were robust across speaker gender and accent, and they occurred for both human and clearly artificial (computer-synthesized) speech. Thus, just as judgments from written text are influenced by factors such as font fluency, judgments from speech are not only based on its content but also biased by the superficial vehicle through which it is delivered. Such effects may become more relevant as daily communication via videoconferencing becomes increasingly widespread.

月	火	水	木	金	土	日
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31