Shvach Shprach: An analysis of automatic speech recognition on low-resource languages
dc.contributor.advisor | Waxman, Joshua | |
dc.contributor.author | Schachter, Racheli | |
dc.date.accessioned | 2024-05-30T14:00:09Z | |
dc.date.available | 2024-05-30T14:00:09Z | |
dc.date.issued | 2024-05-12 | |
dc.description | Undergraduate honors thesis / YU only | |
dc.description.abstract | Natural language processing (NLP) is a field of artificial intelligence that is focused on training computers to understand human language. By analyzing copious amounts of data, computers are trained to recognize patterns that can be represented numerically as vectors. Through this, computers are able to complete many language related tasks. One such task is automatic speech recognition (ASR), a process in which audio can be converted into text. Like all NLP tasks, the performance of an ASR model is heavily dependent on the quality of data the model is trained on. Whisper is an ASR model created by OpenAI. Though Whisper works very well when transcribing English audio, its accuracy plummets when transcribing languages that are less represented in the training data, also known as low-resource languages. For example, in its current state, Whisper is highly inaccurate when used to transcribe Torah lectures given in Yeshivish English, a sociolect of English spoken by American Orthodox Jews. • This paper sets out to provide the background knowledge necessary to understand how ASR models work, with a focus on OpenAI’s Whisper model. Additionally, it includes a thorough analysis of Whisper’s performance on low-resource languages through experimentation with transcriptions of Rabbi Aryeh Lebowitz’s “Ten Minute Halacha” lecture series. Finally, this paper explores different technologies and techniques that can be used to improve Whisper’s performance. | |
dc.description.sponsorship | Funded in part by the S. Daniel Abraham Honors Program | |
dc.identifier.citation | Schachter, R. (2024, May 12). Shvach Shprach: An analysis of automatic speech recognition on low-resource languages [Unpublished undergraduate honors thesis, Yeshiva University]. | |
dc.identifier.uri | https://hdl.handle.net/20.500.12202/10234 | |
dc.language.iso | en_US | |
dc.publisher | Yeshiva University, Stern College for Women | |
dc.relation.ispartofseries | S. Daniel Abraham Honors Student Theses; May 12, 2024 | |
dc.subject | automatic speech recognition (ASR) | |
dc.subject | OpenAI | |
dc.subject | Yeshivish English | |
dc.subject | Natural language processing (NLP) | |
dc.title | Shvach Shprach: An analysis of automatic speech recognition on low-resource languages | |
dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Schachter, Racheli Schachter - May 2024 Thesis.pdf
- Size:
- 2.85 MB
- Format:
- Adobe Portable Document Format