Evaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks

dc.contributor.authorMekiker, Batuhan
dc.contributor.authorWittie, Mike P.
dc.date.accessioned2025-11-12T18:44:12Z
dc.date.issued2024-12
dc.description.abstractVoice communications are valued for their ease of use and the rich information they provide, offering an immediate, clear, and efficient way to convey messages. However, ensuring the clarity and reliability of voice communications in low-bandwidth networks poses a technical challenge. This research explores the efficacy of Text-to-Speech (TTS) models and vocoder combinations versus traditional audio codecs in low-bandwidth networks, highlighting considerations for voice clarity and network resource management. Traditional audio codecs in bandwidth-limited environments often compromise audio quality and reliability. On the contrary, TTS models, supported by the advancements in deep and machine learning, present a potential alternative. Through a methodical comparison using various evaluation metrics, the study aims to offer valuable insights into their comparative impacts on audio quality and network behavior.
dc.identifier.citationMekiker, B., & Wittie, M. P. (2024, October). Evaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks. In 2024 20th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob) (pp. 312-317). IEEE.
dc.identifier.doi10.1109/WIMOB61911.2024.10770530
dc.identifier.issn2160-4894
dc.identifier.urihttps://scholarworks.montana.edu/handle/1/19544
dc.language.isoen_US
dc.publisherIEEE
dc.rightsCopyright IEEE 2025
dc.rights.urihttps://www.ieee.org/publications/rights
dc.subjectTTS
dc.subjecttext-to-speech
dc.subjectaudio codecs
dc.subjectCLIP
dc.subjectvoice communication
dc.subjectresource-constrained networks
dc.titleEvaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks
dc.typeArticle
mus.citation.extentfirstpage1
mus.citation.extentlastpage6
mus.citation.journaltitle2024 20th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob)
mus.relation.collegeCollege of Engineering
mus.relation.departmentComputer Science
mus.relation.universityMontana State University - Bozeman

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
mekiker-text-to-speech-audio-codec-performance-2025.pdf
Size:
11.23 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
825 B
Format:
Item-specific license agreed upon to submission
Description: