Evaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks

Mekiker, Batuhan; Wittie, Mike P.

doi:10.1109/WIMOB61911.2024.10770530

Evaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks

dc.contributor.author	Mekiker, Batuhan
dc.contributor.author	Wittie, Mike P.
dc.date.accessioned	2025-11-12T18:44:12Z
dc.date.issued	2024-12
dc.description.abstract	Voice communications are valued for their ease of use and the rich information they provide, offering an immediate, clear, and efficient way to convey messages. However, ensuring the clarity and reliability of voice communications in low-bandwidth networks poses a technical challenge. This research explores the efficacy of Text-to-Speech (TTS) models and vocoder combinations versus traditional audio codecs in low-bandwidth networks, highlighting considerations for voice clarity and network resource management. Traditional audio codecs in bandwidth-limited environments often compromise audio quality and reliability. On the contrary, TTS models, supported by the advancements in deep and machine learning, present a potential alternative. Through a methodical comparison using various evaluation metrics, the study aims to offer valuable insights into their comparative impacts on audio quality and network behavior.
dc.identifier.citation	Mekiker, B., & Wittie, M. P. (2024, October). Evaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks. In 2024 20th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob) (pp. 312-317). IEEE.
dc.identifier.doi	10.1109/WIMOB61911.2024.10770530
dc.identifier.issn	2160-4894
dc.identifier.uri	https://scholarworks.montana.edu/handle/1/19544
dc.language.iso	en_US
dc.publisher	IEEE
dc.rights	Copyright IEEE 2025
dc.rights.uri	https://www.ieee.org/publications/rights
dc.subject	TTS
dc.subject	text-to-speech
dc.subject	audio codecs
dc.subject	CLIP
dc.subject	voice communication
dc.subject	resource-constrained networks
dc.title	Evaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks
dc.type	Article
mus.citation.extentfirstpage	1
mus.citation.extentlastpage	6
mus.citation.journaltitle	2024 20th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob)
mus.relation.college	College of Engineering
mus.relation.department	Computer Science
mus.relation.university	Montana State University - Bozeman

Files

Original bundle

Now showing 1 - 1 of 1

Name:: mekiker-text-to-speech-audio-codec-performance-2025.pdf
Size:: 11.23 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 825 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Work - Computer Science