Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Who spoke when?
Details
Speaker diarization is the process which detects active speakers and groups those speech signals which has been uttered by the same speaker. Generally we can find two main applications for speaker diarization. Automatic Speech Recognition systems make use of the speaker homogeneous clusters to adapt the acoustic models to be speaker dependent and therefore increase recognition performance. Speaker indexing and rich transcription systems also use the speaker diarization output as one of information extracted from a recording, which allow its automatic indexation and other further processing. In this study a speaker diarization application is developed using multiparty binaural speech recordings to track speaker activity based on interaural time difference (ITD) cues. These cues, for a given speech signal frame, are computed using gammatone filtering and cross-correlation technique. Their values are used to determine which speaker in the recording produce the considered speech fragment. This study has been supervised by Dr. Jon Barker, and defended to fulfill the requirements for the degree of Master in Advanced Computer Science, University of Sheffield, United Kingdom, 2007.
Autorentext
Maral Dadvar trabaja en el Grupo de Interacción con los Medios Humanos de la Universidad de Twente, en los Países Bajos, como investigador de doctorado. Desarrolló un interés en el procesamiento del lenguaje natural cuando implementó la diarización del hablante para su tesis de maestría. Maral tiene una maestría en ciencias informáticas avanzadas de la Universidad de Sheffield, Reino Unido.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09783844386288
- Sprache Englisch
- Genre Anwendungs-Software
- Größe H220mm x B150mm x T5mm
- Jahr 2011
- EAN 9783844386288
- Format Kartonierter Einband
- ISBN 3844386289
- Veröffentlichung 01.07.2011
- Titel Who spoke when?
- Autor Maral Dadvar
- Untertitel Audio-based speaker location estimation for diarization
- Gewicht 119g
- Herausgeber LAP LAMBERT Academic Publishing
- Anzahl Seiten 68