Uncovering Bias in ASR Systems

Fuckner, Marcio; Horsman, Sophie; Wiggers, Pascal; Janssen, Iskaj

Samenvatting

It is crucial that ASR systems can handle the wide range of variations in speech of speakers from different demographic groups, with different speaking styles, and of speakers with (dis)abilities. A potential quality-of-service harm arises when ASR systems do not perform equally well for everyone. ASR systems may exhibit bias against certain types of speech, such as non-native accents, different age groups and gender. In this study, we evaluate two widely-used neural network-based architectures: Wav2vec2 and Whisper on potential biases for Dutch speakers. We used the Dutch speech corpus JASMIN as a test set containing read and conversational speech in a human-machine interaction setting. The results reveal a significant bias against non-natives, children and elderly and some regional dialects. The ASR systems generally perform slightly better for women than for men.

Thema

Algemeen

Bestand/Link	Toegang Materialen met beperkte toegang zijn alleen beschikbaar voor bepaalde hogescholen.	Licentie Voor meer informatie over de verschillende gebruiksrechten, klik op het bijbehorende icoon/link.
Bestand 1	Open access
Bekijk URL

Organisatie	Hogeschool van Amsterdam

Gepubliceerd in	2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Bucharest, Romania, ROM
Jaar	2023
Type	Conferentiebijdrage
Taal	Engels

Uncovering Bias in ASR Systems

Uncovering Bias in ASR Systems

Samenvatting

Misschien ook interessant voor jou?

Expecting the unexpected

Confronting bias in the online representation of pregnancy

MIMU PDR with bias estimation using an optimization-based approach.