Skip navigation
Please use this identifier to cite or link to this item: https://repository.esi-sba.dz/jspui/handle/123456789/851
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZEMOURI, OUassim-
dc.date.accessioned2026-06-23T11:51:08Z-
dc.date.available2026-06-23T11:51:08Z-
dc.date.issued2025-
dc.identifier.urihttps://repository.esi-sba.dz/jspui/handle/123456789/851-
dc.description.abstractThis thesis explores Automatic Speech Recognition (ASR) for Breton, a low-resource language with significant dialectal variation. We evaluate several ASR models including OpenAI’s Whisper models, focusing on Whisper-Large, across two datasets: Mozilla Common Voice 21 and La Banque Sonore des Dialectes Bretons (BSDB). Experiments were conducted with and without text cleaning, using Word Error Rate (WER) and Character Error Rate (CER) as evaluation metrics. Due to resource limitations, full fine-tuning of Whisper proved challenging, leading to the use of Parameter-Efficient Fine-Tuning (PEFT) methods such as LoRA. A finetuned Whisper-Small model was produced, demonstrating the effectiveness of PEFT for under-resourced languages. This work underlines the potential of modern ASR models and efficient adaptation techniques to improve speech recognition for Breton language and offers insights applicable to other low-resource languages.en_US
dc.language.isoenen_US
dc.subjectDeep Learningen_US
dc.subjectAutomatic Speech Recognitionen_US
dc.subjectBreton Languageen_US
dc.subjectModel Evaluationen_US
dc.subjectLow-resource Languagesen_US
dc.subjectData Cleaningen_US
dc.titleDevelopment of a Speech-to-Text (STT) System for the Breton Languageen_US
dc.typeThesisen_US
Appears in Collections:Master

Files in This Item:
File Description SizeFormat 
Zemouri_Ouassim_Thesis_Master-1-1.pdf84,35 kBAdobe PDFView/Open
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.