Voice Modeling Methods

CHF 107.95
Auf Lager
SKU
7IFH4MO650G
Stock 1 Verfügbar
Geliefert zwischen Fr., 27.02.2026 und Mo., 02.03.2026

Details

Building a voice model means to capture the characteristics of a speaker's voice in a data structure. This data structure is then used by a computer for further processing, such as comparison with other voices. Voice modeling is a vital step in the process of automatic speaker recognition that itself is the foundation of several applied technologies: (a) biometric authentication, (b) speech recognition and (c) multimedia indexing. Current automatic speaker recognition works well under relatively constrained circumstances, such as studio recordings, or when prior knowledge on the number and identity of occurring speakers is available. Under more adverse conditions, such as in feature films or amateur material on the web, the achieved speaker recognition scores drop below a rate that is acceptable for an end user or for further processing. In this book, algorithmic and methodic improvements to the state of the art in automatic speaker recognition are presented. They are accompanied by a capacious software toolkit called "sclib". Additionally, the method of "Eidetic Design" facilitates intuitive algorithm design, development and teaching.

Autorentext

Thilo was born in Lemgo/Germany in the 1980's and still loves themusic of this time. Maybe this is why he choose to analyzeacoustic data in his doctoral studies? When he's not playingmusic or doing research and development, he's probably exploringsome new fun sport together with his wife or thinks about god andhis world.


Klappentext

Building a voice model means to capture the characteristics of a speaker's voice in a data structure. This data structure is then used by a computer for further processing, such as comparison with other voices. Voice modeling is a vital step in the process of automatic speaker recognition that itself is the foundation of several applied technologies: (a) biometric authentication, (b) speech recognition and (c) multimedia indexing. Current automatic speaker recognition works well under relatively constrained circumstances, such as studio recordings, or when prior knowledge on the number and identity of occurring speakers is available. Under more adverse conditions, such as in feature films or amateur material on the web, the achieved speaker recognition scores drop below a rate that is acceptable for an end user or for further processing. In this book, algorithmic and methodic improvements to the state of the art in automatic speaker recognition are presented. They are accompanied by a capacious software toolkit called "sclib". Additionally, the method of "Eidetic Design" facilitates intuitive algorithm design, development and teaching.

Weitere Informationen

  • Allgemeine Informationen
    • GTIN 09783838116327
    • Sprache Englisch
    • Größe H220mm x B150mm x T15mm
    • Jahr 2015
    • EAN 9783838116327
    • Format Kartonierter Einband
    • ISBN 3838116321
    • Veröffentlichung 28.10.2015
    • Titel Voice Modeling Methods
    • Autor Thilo Stadelmann
    • Untertitel for Automatic Speaker Recognition
    • Gewicht 375g
    • Herausgeber Südwestdeutscher Verlag für Hochschulschriften AG Co. KG
    • Anzahl Seiten 240
    • Genre Informatik

Bewertungen

Schreiben Sie eine Bewertung
Nur registrierte Benutzer können Bewertungen schreiben. Bitte loggen Sie sich ein oder erstellen Sie ein Konto.
Made with ♥ in Switzerland | ©2025 Avento by Gametime AG
Gametime AG | Hohlstrasse 216 | 8004 Zürich | Schweiz | UID: CHE-112.967.470
Kundenservice: customerservice@avento.shop | Tel: +41 44 248 38 38