Speech Enhancement: Algorithm and Architecture
Details
This thesis presents algorithm and architecture for
simultaneous diagonalization of matrices. As an
example, a subspace-based speech enhancement problem
is considered, where in the covariance matrices of
the speech and noise are diagonalized simultaneously.
In order to compare the system performance of the
proposed algorithm, objective measurements of speech
enhancement is shown in terms of the signal to noise
ratio and mean bark spectral distortion at various
noise levels. In addition, an innovative subband
analysis technique for subspace-based time-domain
constrained speech enhancement technique is proposed.
The proposed technique analyses the signal in its
subbands to build accurate estimates of the
covariance matrices of speech and noise, exploiting
the inherent low varying characteristics of speech
and noise signals in narrow bands. The subband
approach also decreases the computation time by
reducing the order of the matrices to be
simultaneously diagonalized. Further, architecture is
proposed to implement the simultaneous
diagonalization scheme. The architecture is
implemented on a Xilinx FPGA. FPGA resource
utilization re-enforces on the practicability of the
design.
Autorentext
I graduated from University of Windsor with a Bachelor of Applied Science in Electrical Engineering in 2005. I earned my Magisteriate in Applied Science from the Department of E.C.E, Concordia University, under the guidance of Prof. M.N.S Swamy. Since then I am working at the Paradigm Advanced Research Center (P.A.R.C), Paradigm Electronics, Canada.
Klappentext
This thesis presents algorithm and architecture for simultaneous diagonalization of matrices. As an example, a subspace-based speech enhancement problem is considered, where in the covariance matrices of the speech and noise are diagonalized simultaneously. In order to compare the system performance of the proposed algorithm, objective measurements of speech enhancement is shown in terms of the signal to noise ratio and mean bark spectral distortion at various noise levels. In addition, an innovative subband analysis technique for subspace-based time-domain constrained speech enhancement technique is proposed. The proposed technique analyses the signal in its subbands to build accurate estimates of the covariance matrices of speech and noise, exploiting the inherent low varying characteristics of speech and noise signals in narrow bands. The subband approach also decreases the computation time by reducing the order of the matrices to be simultaneously diagonalized. Further, architecture is proposed to implement the simultaneous diagonalization scheme. The architecture is implemented on a Xilinx FPGA. FPGA resource utilization re-enforces on the practicability of the design.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09783639098372
- Anzahl Seiten 152
- Genre Wärme- und Energietechnik
- Herausgeber VDM Verlag Dr. Müller e.K.
- Größe H220mm x B220mm
- Jahr 2013
- EAN 9783639098372
- Format Kartonierter Einband (Kt)
- ISBN 978-3-639-09837-2
- Titel Speech Enhancement: Algorithm and Architecture
- Autor Pavel Sinha
- Untertitel Algorithm and Architecture for Simultaneous Diagonalization of Matrices Applied to Subspace-Based Speech Enhancement
- Sprache Deutsch