Quality of Synthetic Speech

CHF 141.15
Auf Lager
SKU
DG6KJL12G5A
Stock 1 Verfügbar
Geliefert zwischen Fr., 27.02.2026 und Mo., 02.03.2026

Details

This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.

Developes a set of five universal perceptual quality dimensions for TTS signals Introduces a test protocol for the assessment of the five dimensions in a listening test Investigates factors influencing the five perceptual quality dimensions Presents different approaches towards instrumental quality assessment of synthetic speech. Examines the integration of an instrumental quality assessment model into a TTS system for quality improvement Includes supplementary material: sn.pub/extras

Inhalt

Introduction.- Speech Synthesis.- Auditory and Instrumental Quality Evaluation Metrics.- Perceptual Quality Dimensions.- Influencing Factors on Perceptual Quality.- Instrumental Quality Assessment.- Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System.- Conclusions.

Weitere Informationen

  • Allgemeine Informationen
    • GTIN 09789811037337
    • Lesemotiv Verstehen
    • Genre Electrical Engineering
    • Auflage 1st edition 2017
    • Sprache Englisch
    • Anzahl Seiten 176
    • Herausgeber Springer Nature Singapore
    • Größe H241mm x B160mm x T16mm
    • Jahr 2017
    • EAN 9789811037337
    • Format Fester Einband
    • ISBN 9811037337
    • Veröffentlichung 18.04.2017
    • Titel Quality of Synthetic Speech
    • Autor Florian Hinterleitner
    • Untertitel Perceptual Dimensions, Influencing Factors, and Instrumental Assessment
    • Gewicht 436g

Bewertungen

Schreiben Sie eine Bewertung
Nur registrierte Benutzer können Bewertungen schreiben. Bitte loggen Sie sich ein oder erstellen Sie ein Konto.
Made with ♥ in Switzerland | ©2025 Avento by Gametime AG
Gametime AG | Hohlstrasse 216 | 8004 Zürich | Schweiz | UID: CHE-112.967.470
Kundenservice: customerservice@avento.shop | Tel: +41 44 248 38 38