This book discusses large margin and kernel methods for speech
and speaker recognition
Speech and Speaker Recognition: Large Margin and Kernel
Methods is a collation of research in the recent advances in
large margin and kernel methods, as applied to the field of speech
and speaker recognition. It presents theoretical and practical
foundations of these methods, from support vector machines to large
margin methods for structured learning. It also provides examples
of large margin based acoustic modelling for continuous speech
recognizers, where the grounds for practical large margin sequence
learning are set. Large margin methods for discriminative language
modelling and text independent speaker verification are also
addressed in this book.
Key Features:
* Provides an up-to-date snapshot of the current state of
research in this field
* Covers important aspects of extending the binary support vector
machine to speech and speaker recognition applications
* Discusses large margin and kernel method algorithms for
sequence prediction required for acoustic modeling
* Reviews past and present work on discriminative training of
language models, and describes different large margin algorithms
for the application of part-of-speech tagging
* Surveys recent work on the use of kernel approaches to
text-independent speaker verification, and introduces the main
concepts and algorithms
* Surveys recent work on kernel approaches to learning a
similarity matrix from data
This book will be of interest to researchers, practitioners,
engineers, and scientists in speech processing and machine learning
fields.
About the author
Dr Joseph Keshet, IDIAP, Switzerland
Dr Keshet received his B.Sc. and M.Sc. in electrical engineering from the Tel-Aviv University, Tel-Aviv, Israel, in 1994 and 2002, respectively. He got his Ph.D. from the Hebrew University of Jerusalem, Israel in 2007. From 1994 to 2002, he was with the Israeli Defense Forces (Intelligence Corps), where he was in charge of advanced research activities in the fields of speech coding. Since 2007, he is a research scientist in speech recognition at IDIAP Research Institute, Martigny, Switzerland.
Dr Samy Bengio, Google, California, US
Dr Bengio received his M.Sc. and Ph.D. degrees in Computer Science from University of Montreal in 1989 and 1993 respectively. Between 1999 and 2006, he was a senior researcher in statistical machine learning at IDIAP Research Institute, where he supervised Ph D students and postdoctoral fellows working on many areas of machine learning. He is the author/co-author of more than 160 international publications, including 30 journal papers. He has organized several international workshops (such as the MLMI series) and been in the organization committee of several well known conferences (such as NIPS). Since early 2007, he is a research scientist in machine learning at Google, in Mountain View, California.