Privacy-Preserving Machine Learning for Speech Processing
Details
This thesis discusses the privacy issues in speech-based applications such as biometric authentication, surveillance, and external speech processing services. Author Manas A. Pathak presents solutions for privacy-preserving speech processing applications such as speaker verification, speaker identification and speech recognition. The author also introduces some of the tools from cryptography and machine learning and current techniques for improving the efficiency and scalability of the presented solutions. Experiments with prototype implementations of the solutions for execution time and accuracy on standardized speech datasets are also included in the text. Using the framework proposed may now make it possible for a surveillance agency to listen for a known terrorist without being able to hear conversation from non-targeted, innocent civilians.
Nominated as outstanding PhD thesis from Carnegie Mellon University Develops an efficient computational framework, making it possible to create speech processing applications such as voice biometrics, mining and speech recognition that are privacy-preserving Presents a technology solution, which would allow a user to utilize an IVR system without fear that the system could learn undesired information, such as gender or nationality, or be able to record and edit voice Includes supplementary material: sn.pub/extras
Autorentext
Dr. Manas A. Pathak received the BTech degree in computer science from Visvesvaraya National Institute of Technology, Nagpur, India, in 2006, and the MS and PhD degrees from the Language Technologies Institute at Carnegie Mellon University (CMU) in 2009 and 2012 respectively. He is currently working as a research scientist at Adchemy, Inc. His research interests include intersection of data privacy, machine learning, speech processing.
Klappentext
This thesis discusses the privacy issues in speech-based applications, including biometric authentication, surveillance, and external speech processing services. Manas A. Pathak presents solutions for privacy-preserving speech processing applications such as speaker verification, speaker identification, and speech recognition.
The thesis introduces tools from cryptography and machine learning and current techniques for improving the efficiency and scalability of the presented solutions, as well as experiments with prototype implementations of the solutions for execution time and accuracy on standardized speech datasets. Using the framework proposed may make it possible for a surveillance agency to listen for a known terrorist, without being able to hear conversation from non-targeted, innocent civilians.
Inhalt
Thesis Overview.- Speech Processing Background.- Privacy Background.- Overview of Speaker Verification with Privacy.- Privacy-Preserving Speaker Verification Using Gaussian Mixture Models.- Privacy-Preserving Speaker Verification as String Comparison.- Overview of Speaker Indentification with Privacy.- Privacy-Preserving Speaker Identification Using Gausian Mixture Models.- Privacy-Preserving Speaker Identification as String Comparison.- Overview of Speech Recognition with Privacy.- Privacy-Preserving Isolated-Word Recognition.- Thesis Conclusion.- Future Work.- Differentially Private Gaussian Mixture Models.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09781461446385
- Genre Elektrotechnik
- Auflage 2013
- Sprache Englisch
- Lesemotiv Verstehen
- Anzahl Seiten 160
- Größe H241mm x B160mm x T12mm
- Jahr 2012
- EAN 9781461446385
- Format Fester Einband
- ISBN 1461446384
- Veröffentlichung 25.10.2012
- Titel Privacy-Preserving Machine Learning for Speech Processing
- Autor Manas A. Pathak
- Untertitel Springer Theses
- Gewicht 412g
- Herausgeber Springer New York