Month/Year of Graduation
5-2024
Degree Name
Bachelor of Science (B.S.)
Department
Computer Science
First Advisor
Harvey Siy, Ph.D.
Abstract
Speech is a complex field, and remarkably difficult to master. There exist many groups who could use more tools to help improve their pronunciation of English words. The goal of this project was to create a text-to-speech animation program as an addition to our group capstone, Pronunciation Pal. Pronunciation Pal is a web-based application meant to assist those who are deaf or hard of hearing in improving their pronunciation, likely as an addition to therapy with a speech pathologist. This application uses tools like diagrams and references to help users better understand how to pronounce a word. The goal of this extension was to dynamically generate a 3D facial animation for any word or set of phonemes. This is an addition to the features on Pronunciation Pal that could be helpful for users, especially to the deaf. The project was successful, generating a smooth and reasonably accurate animation for most words, with a virtual face appearing to enunciate the sounds in the given word. It’s able to be hosted on our SvelteKit web application and run with minimal resource utilization entirely on the client-side.
Recommended Citation
Tomcak, Colin, "Text-to-Speech Animation: Generating Visuals Based on Phonetic Spelling" (2024). Theses/Capstones/Creative Projects. 302.
https://digitalcommons.unomaha.edu/university_honors_program/302
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.
Files over 3MB may be slow to open. For best results, right-click and select "save as..."