Deep Learning Based Speech Quality Prediction

CHF 137.25
Auf Lager
SKU
9T073H7B8GG
Stock 1 Verfügbar
Geliefert zwischen Mi., 26.11.2025 und Do., 27.11.2025

Details

This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.


Presents how to apply deep learning methods for the task of speech quality prediction Includes a model that outperforms traditional speech quality models Presents an in-depth analysis and comparison of different deep learning

Autorentext
Gabriel Mittag received his B.Sc. and M.Sc. degree in electrical and electronic engineering at the Technische Universität Berlin. During his master's degree he spent two semesters at the RMIT University in Melbourne, Australia and focused primarily on biomedical and speech signal processing. From 2016 he was employed as research assistant at the Quality and Usability Lab at the TU Berlin, where he finished his PhD on the machine learning based prediction of speech quality. In May 2021, Gabriel Mittag started as Machine Learning Scientist at Microsoft in Redmond, WA, USA.

Inhalt

  1. Introduction.- 2. Quality Assessment of Transmitted Speech.- 3. Neural Network Architectures for Speech Quality Prediction.- 4. Double-Ended Speech Quality Prediction Using Siamese Networks.- 5. Prediction of Speech Quality Dimensions With Multi-Task Learning.- 6. Bias-Aware Loss for Training From Multiple Datasets.- 7. NISQA A Single-Ended Speech Quality Model.- 8. Conclusions.- A. Dataset Condition Tables.- B. Train and Validation Dataset Dimension Histograms.- References.

Weitere Informationen

  • Allgemeine Informationen
    • GTIN 09783030914813
    • Lesemotiv Verstehen
    • Genre Electrical Engineering
    • Auflage 1st edition 2022
    • Sprache Englisch
    • Anzahl Seiten 180
    • Herausgeber Springer International Publishing
    • Größe H235mm x B155mm x T11mm
    • Jahr 2023
    • EAN 9783030914813
    • Format Kartonierter Einband
    • ISBN 303091481X
    • Veröffentlichung 26.02.2023
    • Titel Deep Learning Based Speech Quality Prediction
    • Autor Gabriel Mittag
    • Untertitel T-Labs Series in Telecommunication Services
    • Gewicht 283g

Bewertungen

Schreiben Sie eine Bewertung
Nur registrierte Benutzer können Bewertungen schreiben. Bitte loggen Sie sich ein oder erstellen Sie ein Konto.
Made with ♥ in Switzerland | ©2025 Avento by Gametime AG
Gametime AG | Hohlstrasse 216 | 8004 Zürich | Schweiz | UID: CHE-112.967.470