Speech Recognition

Speech Recognition

Chapters in the first part of the book cover all the essential speech processing techniques for building robust, automatic speech recognition systems: the representation for speech signals and the methods for speech-features extraction, acoustic and language modeling, efficient algorithms for searching the hypothesis space, and multimodal approaches to speech recognition.

Description
The last part of the book is devoted to other speech processing applications that can use the information from automatic speech recognition for speaker identification and tracking, for prosody modeling in emotion-detection systems and in other speech processing applications that are able to operate in real-world environments, like mobile communication services and smart homes.

Contents

  • A Family of Stereo-Based Stochastic Mapping Algorithms for Noisy Speech Recognition
  • Histogram Equalization for Robust Speech Recognition
  • Employment of Spectral Voicing Information for Speech and Speaker Recognition in Noisy Conditions
  • Time-Frequency Masking: Linking Blind Source Separation and Robust Speech Recognition
  • Dereverberation and Denoising Techniques for ASR Applications
  • Feature Transformation Based on Generalization of Linear Discriminant Analysis
  • Algorithms for Joint Evaluation of Multiple Speech Patterns for Automatic Speech Recognition
  • Overcoming HMM Time and Parameter Independence Assumptions for ASR
  • Practical Issues of Building Robust HMM Models Using HTK and SPHINX Systems
  • Statistical Language Modeling for Automatic Speech Recognition of Agglutinative Languages
  • Discovery of Words: towards a Computational Model of Language Acquisition
  • Automatic Speech Recognition via N-Best Rescoring using Logistic Regression
  • Knowledge Resources in Automatic Speech Recognition and Understanding for Romanian Language
  • Construction of a Noise-Robust Body-Conducted Speech Recognition System
  • Adaptive Decision Fusion for Audio-Visual Speech Recognition
  • Multi-Stream Asynchrony Modeling for Audio Visual Speech Recognition
  • Normalization and Transformation Techniques for Robust Speaker Recognition
  • Speaker Vector-Based Speaker Recognition with Phonetic Modeling
  • Novel Approaches to Speaker Clustering for Speaker Diarization in Audio Broadcast News Data
  • Gender Classification in Emotional Speech
  • Recognition of Paralinguistic Information using Prosodic Features Related to Intonation and Voice Quality
  • Psychological Motivated Multi-Stage Emotion Classification Exploiting Voice Quality Features
  • A Weighted Discrete KNN Method for Mandarin Speech and Emotion Recognition
  • Motion-Tracking and Speech Recognition for Hands-Free Mouse-Pointer Manipulation
  • Arabic Dialectical Speech Recognition in Mobile Communication Services
  • Ultimate Trends in Integrated Systems to Enhance Automatic Speech Recognition Performance
  • Speech Recognition for Smart Homes
  • Silicon Technologies for Speaker Independent Speech Processing and Recognition Systems in Noisy Environments
  • Voice Activated Appliances for Severely Disabled Persons
  • System Request Utterance Detection Based on Acoustic and Linguistic Features

Book Details

Author(s): France Mihelic and Janez Zibert.
Publisher: InTech
Format(s): PDF
File size: 35.28 MB
Number of pages: 550
Link: Download or read online.








Leave a Reply