Evaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks
| dc.contributor.author | Mekiker, Batuhan | |
| dc.contributor.author | Wittie, Mike P. | |
| dc.date.accessioned | 2025-11-12T18:44:12Z | |
| dc.date.issued | 2024-12 | |
| dc.description.abstract | Voice communications are valued for their ease of use and the rich information they provide, offering an immediate, clear, and efficient way to convey messages. However, ensuring the clarity and reliability of voice communications in low-bandwidth networks poses a technical challenge. This research explores the efficacy of Text-to-Speech (TTS) models and vocoder combinations versus traditional audio codecs in low-bandwidth networks, highlighting considerations for voice clarity and network resource management. Traditional audio codecs in bandwidth-limited environments often compromise audio quality and reliability. On the contrary, TTS models, supported by the advancements in deep and machine learning, present a potential alternative. Through a methodical comparison using various evaluation metrics, the study aims to offer valuable insights into their comparative impacts on audio quality and network behavior. | |
| dc.identifier.citation | Mekiker, B., & Wittie, M. P. (2024, October). Evaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks. In 2024 20th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob) (pp. 312-317). IEEE. | |
| dc.identifier.doi | 10.1109/WIMOB61911.2024.10770530 | |
| dc.identifier.issn | 2160-4894 | |
| dc.identifier.uri | https://scholarworks.montana.edu/handle/1/19544 | |
| dc.language.iso | en_US | |
| dc.publisher | IEEE | |
| dc.rights | Copyright IEEE 2025 | |
| dc.rights.uri | https://www.ieee.org/publications/rights | |
| dc.subject | TTS | |
| dc.subject | text-to-speech | |
| dc.subject | audio codecs | |
| dc.subject | CLIP | |
| dc.subject | voice communication | |
| dc.subject | resource-constrained networks | |
| dc.title | Evaluating Text-to-Speech and Audio Codec Performance for Voice Communication in Resource-Constrained Networks | |
| dc.type | Article | |
| mus.citation.extentfirstpage | 1 | |
| mus.citation.extentlastpage | 6 | |
| mus.citation.journaltitle | 2024 20th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob) | |
| mus.relation.college | College of Engineering | |
| mus.relation.department | Computer Science | |
| mus.relation.university | Montana State University - Bozeman |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- mekiker-text-to-speech-audio-codec-performance-2025.pdf
- Size:
- 11.23 MB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 825 B
- Format:
- Item-specific license agreed upon to submission
- Description: