This book constitutes the thoroughly refereed proceedings of the 5th International Symposium on Chinese Spoken Language Processing, ISCSLP 2006, held in Singapore in December 2006, co-located with ICCPOL 2006, the 21st International Conference on Computer Processing of Oriental Languages. Coverage includes speech science, acoustic modeling for automatic speech recognition, speech data mining, and machine translation of speech.
Inhalt
Plenary.- Interactive Computer Aids for Acquiring Proficiency in Mandarin.- The Affective and Pragmatic Coding of Prosody.- Challenges in Machine Translation.- Automatic Indexing and Retrieval of Large Broadcast News Video Collections - The TRECVID Experience.- Tutorial.- An HMM-Based Approach to Flexible Speech Synthesis.- Text Information Extraction and Retrieval.- Topics in Speech Science.- Mechanisms of Question Intonation in Mandarin.- Comparison of Perceived Prosodic Boundaries and Global Characteristics of Voice Fundamental Frequency Contours in Mandarin Speech.- Linguistic Markings of Units in Spontaneous Mandarin.- Phonetic and Phonological Analysis of Focal Accents of Disyllabic Words in Standard Chinese.- Focus, Lexical Stress and Boundary Tone: Interaction of Three Prosodic Features.- Speech Analysis.- A Robust Voice Activity Detection Based on Noise Eigenspace Projection.- Pitch Mean Based Frequency Warping.- A Study of Knowledge-Based Features for Obstruent Detection and Classification in Continuous Mandarin Speech.- Speaker-and-Environment Change Detection in Broadcast News Using Maximum Divergence Common Component GMM.- UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection.- Design of Cubic Spline Wavelet for Open Set Speaker Classification in Marathi.- Speech Synthesis and Generation.- Rhythmic Organization of Mandarin Utterances - A Two-Stage Process.- Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification.- Prosodic Words Prediction from Lexicon Words with CRF and TBL Joint Method.- Prosodic Word Prediction Using a Maximum Entropy Approach.- Predicting Prosody from Text.- Nonlinear Emotional Prosody Generation and Annotation.- A Unified Framework for Text Analysis in Chinese TTS.- Speech Synthesis Based on a Physiological Articulatory Model.- An HMM-Based Mandarin Chinese Text-To-Speech System.- HMM-Based Emotional Speech Synthesis Using Average Emotion Model.- A Hakka Text-To-Speech System.- Speech Enhancement.- Adaptive Null-Forming Algorithm with Auditory Sub-bands.- Multi-channel Noise Reduction in Noisy Environments.- Acoustic Modeling for Automatic Speech Recognition.- Minimum Phone Error (MPE) Model and Feature Training on Mandarin Broadcast News Task.- State-Dependent Phoneme-Based Model Merging for Dialectal Chinese Speech Recognition.- Non-uniform Kernel Allocation Based Parsimonious HMM.- Consistent Modeling of the Static and Time-Derivative Cepstrums for Speech Recognition Using HSPTM.- Robust Speech Recognition.- Vector Autoregressive Model for Missing Feature Reconstruction.- Auditory Contrast Spectrum for Robust Speech Recognition.- Signal Trajectory Based Noise Compensation for Robust Speech Recognition.- An HMM Compensation Approach Using Unscented Transformation for Noisy Speech Recognition.- Noisy Speech Recognition Performance of Discriminative HMMs.- Distributed Speech Recognition of Mandarin Digits String.- Speech Adaptation/Normalization.- Unsupervised Speaker Adaptation Using Reference Speaker Weighting.- Automatic Construction of Regression Class Tree for MLLR Via Model-Based Hierarchical Clustering.- General Topics in Speech Recognition.- A Minimum Boundary Error Framework for Automatic Phonetic Segmentation.- Large Vocabulary Continuous Speech Recognition.- Advances in Mandarin Broadcast Speech Transcription at IBM Under the DARPA GALE Program.- Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-Based Consensus Networks.- All-Path Decoding Algorithm for Segmental Based Speech Recognition.- Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models.- On Using Entropy Information to Improve Posterior Probability-Based Confidence Measures.- Vietnamese Automatic Speech Recognition: The FLaVoR Approach.- Multilingual Recognition and Identification.- Language Identification by Using Syllable-Based Duration Classification on Code-Switching Speech.- Speaker Recognition and Characterization.- CCC Speaker Recognition Evaluation 2006: Overview, Methods, Data, Results and Perspective.- The IIR Submission to CSLP 2006 Speaker Recognition Evaluation.- A Novel Alternative Hypothesis Characterization Using Kernel Classifiers for LLR-Based Speaker Verification.- Speaker Verification Using Complementary Information from Vocal Source and Vocal Tract.- ISCSLP SR Evaluation, UVA-CS_es System Description. A System Based on ANNs.- Evaluation of EMD-Based Speaker Recognition Using ISCSLP2006 Chinese Speaker Recognition Evaluation Corpus.- Integrating Complementary Features with a Confidence Measure for Speaker Identification.- Discriminative Transformation for Sufficient Adaptation in Text-Independent Speaker Verification.- Fusion of Acoustic and Tokenization Features for Speaker Recognition.- Spoken Language Understanding.- Contextual Maximum Entropy Model for Edit Disfluency Detection of Spontaneous Speech.- Human Language Acquisition, Development and Learning.- Automatic Detection of Tone Mispronunciation in Mandarin.- Towards Automatic Tone Correction in Non-native Mandarin.- Spoken and Multimodal Dialog Systems.- A Corpus-Based Approach for Cooperative Response Generation in a Dialog System.- A Cantonese Speech-Driven Talking Face Using Translingual Audio-to-Visual Conversion.- The Implementation of Service Enabling with Spoken Language of a Multi-modal System Ozone.- Spoken Correction for Chinese Text Entry.- Speech Data Mining and Document Retrieval.- Extractive Chinese Spoken Document Summarization Using Probabilistic Ranking Models.- Meeting Segmentation Using Two-Layer Cascaded Subband Filters.- A Multi-layered Summarization System for Multi-media Archives by Understanding and Structuring of Chinese Spoken Documents.- Initial Experiments on Automatic Story Segmentation in Chinese Spoken Documents Using Lexical Cohesion of Extracted Named Entities.- Machine Translation of Speech.- Some Improvements in Phrase-Based Statistical Machine Translation.- Automatic Spoken Language Translation Template Acquisition Based on Boosting Structure Extraction and Alignment.- Spoken Language Resources and Annotation.- HKUST/MTS: A Very Large Scale Mandarin Telephone Speech Corpus.- The Paradigm for Creating Multi-lingual Text-To-Speech Voice Databases.- Multilingual Speech Corpora for TTS System Development.- Construct Trilingual Parallel Corpus on Demand.- The Contribution of Lexical Resources to Natural Language Processing of CJK Languages.- Multilingual Spoken Language Corpus Development for Communication Research.- Development of Multi-lingual Spoken Corpora of Indian Languages.