Wolfgang Minker kirjat ja teokset - sivu 2

Semi-Supervised and Unsupervised Machine Learning

Amparo Albalate; Wolfgang Minker

ISTE Ltd and John Wiley Sons Inc

2010

sidottu

This book provides a detailed and up-to-date overview on classification and data mining methods. The first part is focused on supervised classification algorithms and their applications, including recent research on the combination of classifiers. The second part deals with unsupervised data mining and knowledge discovery, with special attention to text mining. Discovering the underlying structure on a data set has been a key research topic associated to unsupervised techniques with multiple applications and challenges, from web-content mining to the inference of cancer subtypes in genomic microarray data. Among those, the book focuses on a new application for dialog systems which can be thereby made adaptable and portable to different domains. Clustering evaluation metrics and new approaches, such as the ensembles of clustering algorithms, are also described.

Vertaile hintoja

Bandwidth Extension of Speech Signals

Bernd Iser; Gerhard Schmidt; Wolfgang Minker

Springer-Verlag New York Inc.

2010

nidottu

Vertaile hintoja

Bandwidth Extension of Speech Signals describes the theory and methods for quality enhancement of clean speech signals and distorted speech signals such as those that have undergone a band limitation, for instance, in a telephone network. Problems and the respective solutions are discussed for the different approaches. The different approaches are evaluated and a real-time implementation of the most promising approach is presented. The book includes topics related to speech coding, pattern- / speech recognition, speech enhancement, statistics and digital signal processing in general.

Vertaile hintoja

Time-Domain Beamforming and Blind Source Separation

Julien Bourgeois; Wolfgang Minker

Springer-Verlag New York Inc.

2010

nidottu

Vertaile hintoja

The development of computer and telecommunication technologies led to a revolutioninthewaythatpeopleworkandcommunicatewitheachother.One of the results is that large amount of information will increasingly be held in a form that is natural for users, as speech in natural language. In the presented work, we investigate the speech signal capture problem, which includes the separation of multiple interfering speakers using microphone arrays. Adaptive beamforming is a classical approach which has been developed since the seventies. However it requires a double-talk detector (DTD) that interrupts the adaptation when the target is active, since otherwise target cancelation occurs. The fact that several speakers may be active simulta- ouslymakesthisdetectiondi?cult,andifadditionalbackgroundnoiseoccurs, even less reliable. Our proposed approaches address this separation problem using continuous, uninterrupted adaptive algorithms. The advantage seems twofold:Firstly,thealgorithmdevelopmentismuchsimplersincenodetection mechanism needs to be designed and no threshold is to be tuned. Secondly, the performance may be improved due to the adaptation during periods of double-talk. In the ?rst part of the book, we investigate a modi?cation of the widely usedNLMSalgorithm,termedImplicitLMS(ILMS),whichimplicitlyincludes an adaptation control and does not require any threshold. Experimental ev- uations reveal that ILMS mitigates the target signal cancelation substantially with the distributed microphone array. However, in the more di?cult case of the compact microphone array, this algorithm does not su?ciently reduce the target signal cancelation. In this case, more sophisticated blind source se- ration techniques (BSS) seem necessary.

Vertaile hintoja

Incorporating Knowledge Sources into Statistical Speech Recognition

Sakriani Sakti; Konstantin Markov; Satoshi Nakamura; Wolfgang Minker

Springer-Verlag New York Inc.

2010

nidottu

Vertaile hintoja

Incorporating Knowledge Sources into Statistical Speech Recognition addresses the problem of developing efficient automatic speech recognition (ASR) systems, which maintain a balance between utilizing a wide knowledge of speech variability, while keeping the training / recognition effort feasible and improving speech recognition performance. The book provides an efficient general framework to incorporate additional knowledge sources into state-of-the-art statistical ASR systems. It can be applied to many existing ASR problems with their respective model-based likelihood functions in flexible ways.

Vertaile hintoja

Proactive Spoken Dialogue Interaction in Multi-Party Environments

Petra-Maria Strauss; Wolfgang Minker

Springer-Verlag New York Inc.

2010

sidottu

Vertaile hintoja

Proactive Spoken Dialogue Interaction in Multi-Party Environments describes spoken dialogue systems that act as independent dialogue partners in the conversation with and between users. The resulting novel characteristics such as proactiveness and multi-party capabilities pose new challenges on the dialogue management component of such a system and require the use and administration of an extensive dialogue history. In order to assist the proactive spoken dialogue systems development, a comprehensive data collection seems mandatory and may be performed in a Wizard-of-Oz environment. Such an environment builds also the appropriate basis for an extensive usability and acceptance evaluation. Proactive Spoken Dialogue Interaction in Multi-Party Environments is a useful reference for students and researchers in speech processing.

Vertaile hintoja

Handling Emotions in Human-Computer Dialogues

Johannes Pittermann; Angela Pittermann; Wolfgang Minker

Springer

2009

sidottu

Vertaile hintoja

In this book, a novel approach that combines speech-based emotion recognition with adaptive human-computer dialogue modeling is described. With the robust recognition of emotions from speech signals as their goal, the authors analyze the effectiveness of using a plain emotion recognizer, a speech-emotion recognizer combining speech and emotion recognition, and multiple speech-emotion recognizers at the same time. The semi-stochastic dialogue model employed relates user emotion management to the corresponding dialogue interaction history and allows the device to adapt itself to the context, including altering the stylistic realization of its speech. This comprehensive volume begins by introducing spoken language dialogue systems and providing an overview of human emotions, theories, categorization and emotional speech. It moves on to cover the adaptive semi-stochastic dialogue model and the basic concepts of speech-emotion recognition. Finally, the authors show how speech-emotion recognizers can be optimized, and how an adaptive dialogue manager can be implemented. The book, with its novel methods to perform robust speech-based emotion recognition at low complexity, will be of interest to a variety of readers involved in human-computer interaction.

Vertaile hintoja

Time-Domain Beamforming and Blind Source Separation

Julien Bourgeois; Wolfgang Minker

Springer-Verlag New York Inc.

2009

sidottu

Vertaile hintoja

The development of computer and telecommunication technologies led to a revolutioninthewaythatpeopleworkandcommunicatewitheachother.One of the results is that large amount of information will increasingly be held in a form that is natural for users, as speech in natural language. In the presented work, we investigate the speech signal capture problem, which includes the separation of multiple interfering speakers using microphone arrays. Adaptive beamforming is a classical approach which has been developed since the seventies. However it requires a double-talk detector (DTD) that interrupts the adaptation when the target is active, since otherwise target cancelation occurs. The fact that several speakers may be active simulta- ouslymakesthisdetectiondi?cult,andifadditionalbackgroundnoiseoccurs, even less reliable. Our proposed approaches address this separation problem using continuous, uninterrupted adaptive algorithms. The advantage seems twofold:Firstly,thealgorithmdevelopmentismuchsimplersincenodetection mechanism needs to be designed and no threshold is to be tuned. Secondly, the performance may be improved due to the adaptation during periods of double-talk. In the ?rst part of the book, we investigate a modi?cation of the widely usedNLMSalgorithm,termedImplicitLMS(ILMS),whichimplicitlyincludes an adaptation control and does not require any threshold. Experimental ev- uations reveal that ILMS mitigates the target signal cancelation substantially with the distributed microphone array. However, in the more di?cult case of the compact microphone array, this algorithm does not su?ciently reduce the target signal cancelation. In this case, more sophisticated blind source se- ration techniques (BSS) seem necessary.

Vertaile hintoja

Incorporating Knowledge Sources into Statistical Speech Recognition

Sakriani Sakti; Konstantin Markov; Satoshi Nakamura; Wolfgang Minker

Springer-Verlag New York Inc.

2009

sidottu

Vertaile hintoja

Incorporating Knowledge Sources into Statistical Speech Recognition addresses the problem of developing efficient automatic speech recognition (ASR) systems, which maintain a balance between utilizing a wide knowledge of speech variability, while keeping the training / recognition effort feasible and improving speech recognition performance. The book provides an efficient general framework to incorporate additional knowledge sources into state-of-the-art statistical ASR systems. It can be applied to many existing ASR problems with their respective model-based likelihood functions in flexible ways.

Vertaile hintoja

Bandwidth Extension of Speech Signals

Bernd Iser; Gerhard Schmidt; Wolfgang Minker

Springer-Verlag New York Inc.

2008

sidottu

Vertaile hintoja

Bandwidth Extension of Speech Signals describes the theory and methods for quality enhancement of clean speech signals and distorted speech signals such as those that have undergone a band limitation, for instance, in a telephone network. Problems and the respective solutions are discussed for the different approaches. The different approaches are evaluated and a real-time implementation of the most promising approach is presented. The book includes topics related to speech coding, pattern- / speech recognition, speech enhancement, statistics and digital signal processing in general.

Vertaile hintoja

Speech and Human-Machine Dialog

Wolfgang Minker; Samir Bennacef

Springer-Verlag New York Inc.

2004

sidottu

Vertaile hintoja

Speech and Human-Machine Dialog focuses on the dialog management component of a spoken language dialog system. Spoken language dialog systems provide a natural interface between humans and computers. These systems are of special interest for interactive applications, and they integrate several technologies including speech recognition, natural language understanding, dialog management and speech synthesis. Due to the conjunction of several factors throughout the past few years, humans are significantly changing their behavior vis-à-vis machines. In particular, the use of speech technologies will become normal in the professional domain, and in everyday life. The performance of speech recognition components has also significantly improved. This book includes various examples that illustrate the different functionalities of the dialog model in a representative application for train travel information retrieval (train time tables, prices and ticket reservation). Speech and Human-Machine Dialog is designed for a professional audience, composed of researchers and practitioners in industry. This book is also suitable as a secondary text for graduate-level students in computer science and engineering.

Vertaile hintoja

Stochastically-Based Semantic Analysis

Wolfgang Minker; Alex Waibel; Joseph Mariani

Springer

1999

sidottu

Vertaile hintoja

Stochastically-Based Semantic Analysis investigates the problem of automatic natural language understanding in a spoken language dialog system. The focus is on the design of a stochastic parser and its evaluation with respect to a conventional rule-based method. Stochastically-Based Semantic Analysis will be of most interest to researchers in artificial intelligence, especially those in natural language processing, computational linguistics, and speech recognition. It will also appeal to practicing engineers who work in the area of interactive speech systems.

Vertaile hintoja