Looking into the past: Power spectral representation of periodic signals, sampling theories and fundamental frequency estimation for remaking speech
Hideki Kawahara
A Tutorial on How to Construct and Improve Automatic Pronunciation Proficiency Evaluation System - take PSC test as an example
Yu Hu, Si Wei, Guoping Hu
Speech-To-Speech Translation Technologies for Real-World Applications
Yuqing Gao
What Can Speech Researchers Bring to Music Processing?
Shigeki
Speech and Search: Bridging The Gap
Vincent Vanhoucke
Towards Robust Speech Recognition: Structured Modeling, Irrelevant Variability Normalization and Unsupervised Online Adaptation
Qiang Huo
Simultaneous Phrasing, Prosody, and Acoustic Model Training for Text-to-Speech Conversion
Keiichiro Oura, Yoshihiko Nankaku, Tomoki Toda, Keiichi Tokuda, Rannierry Maia, Shinsuke Sakai, Satoshi Nakamura
Cross-Stream Dependency Modeling for HMM-based Speech Synthesis
Zhen-Hua Ling, Wei Zhang, Ren-Hua Wang
Cross-Lingual Speaker Adaptation for HMM-based Speech Synthesis
Yi-Jian Wu, Simon King, Keiichi Tokuda
HMM-Based Mixed-Language (Mandarin-English) Speech Synthesis
Yao Qian, Hou-Wei Cao, Frank K. Soong
Improving HMM-based Speech Synthesis by Reducing Over-smoothing Problems
Meng Zhang, Jian-Hua Tao, Hui-Bin Jia, Xia Wang
Pronunciation Space Models for Pronunciation Evaluation
Si Wei, Yi-Qian Pan, Guo-Ping Hu, Yu Hu, Ren-Hua Wang
Decision Fusion for Improving Mispronunciation Detection Using Language Transfer Knowledge and Phoneme-dependent Pronunciation Scoring
W. K. Lo, Alissa M. Harrison, Helen Meng, Lan Wang
Mandarin Learning Using Speech and Language Technologies: A Translation Game in The Travel Domain
Yu-Shi Xu, Stephanie Seneff
Word Order Correction for Language Transfer Using Relative Position Language Modeling
Chao-Hong Liu, Chung-Hsien Wu, Matthew Harris
Improving Automatic Evaluation of Mandarin Pronunciation with Speaker Adaptive Training (Sat) and MLLR Speaker Adaption
Chao Huang, Feng Zhang, Frank K. Soong
Automatic Assessment of Language Proficiency Through Shadowing
Dean Luo, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose
Improvements on Mel-frequency Cepstrum Minimum-mean-square-error Noise Suppressor for Robust Speech Recognition
Dong Yu, Li Deng, Jian Wu, Yi-Fan Gong, Alex Acero
Effect of Feature Smoothing for Robust Speech Recognition
Xiong Xiao, Eng Siong Chng, Hai-Zhou Li
Reference Eigen-environment and Speaker Weighting for Robust Speech Recognition
Yuan-Fu Liao, Hung-Hsiang Fang, Chih-Min Yang
Evaluation of A Feature Compensation Approach Using High-order Vector Taylor Series Approximation of An Explicit Distortion Model on Aurora2, Aurora3, and Aurora4 Tasks
Jun Du, Qiang Huo, Yu Hu
Deriving MFCC Parameters from The Dynamic Spectrum for Robust Speech Recognition
Neng-Heng Zheng, Xia Li, Hou-Wei Cao, Tan Lee, P. C. Ching
Discriminative Output Coding Features for Speech Recognition
Omid Dehzangi, Bin Ma, Eng Siong Chng, Hai-Zhou Li
Double Gauss Based Unsupervised Score Normalization in Speaker Verification
Wu Guo, Li-Rong Dai, Ren-Hua Wang
Discriminative Feedback Adaptation for GMM-UBM Speaker Verification
Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang
Using Pseudo-key for Language Recogition System Design
Han-Wu Sun, Bin Ma, Hai-Zhou Li
Self-organized Clustering for Feature Mapping in Language Recognition
Chang-Huai You, Kong-Aik Lee, Bin Ma, Hai-Zhou Li
An Efficient Feature Selection Method for Speaker Recognition
Han-Wu Sun, Bin Ma, Hai-Zhou Li
PLSA Based Topic Mixture Language Modeling Approach
Shuan-Hu Bai, Hai-Zhou Li
The Improved TS-base Approaches with Interference Compensation and Their Evaluations for Speech Enhancement for Speech Enhancement
Jun-Feng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yoiti Suzuki
Pitch Tracking for Model-based Speech Separation
S. W. Lee, Frank K. Soong, P. C. Ching, Tan Lee
Improved Linear Discriminant Analysis Considering Empirical Pairwise Classification Error Rates
Hung-Shin Lee, Berlin Chen
Citybrowser II: A Multimodal Restaurant Guide in Mandarin
Jing-Jing Liu, Yu-Shi Xu, Stephanie Seneff, Victor Zue
Evaluation and Analysis of Minimum Phone Error Training and Its Modified Versions for Large Vocabulary Mandarin Speech Recognition
Yung-Jen Cheng, Che-Kuang Lin, Lin-Shan Lee
A Two-stage Algorithm for Multi-speaker Identification System
Yong Guan, Wen-Ju Liu
What's in The F0 of Mandarin Speech--Tones, Intonation and Beyond
Chiu-Yu Tseng, Zhao-Yu Su
A Perceptual Study of Approximated Cantonese Tone Contours
Yu-Jia Li, Tan Lee
A New Prosodic Strength Calculation Method for Prosody Reduction Modeling
Hong-Lei Cong, Zhi-Yong Wu, Lian-Hong Cai, Helen M. Meng
Prosody Study with Context-dependent Acoustic Models
Yue-Ning Hu, Min Chu
Intonational Prominence of "SHI...(DE)" Construction in Standard Chinese
Yuan Jia, Ai-Jun Li, Zi-Yu Xiong
Entropy-based Analysis of The Prosodic Features of Chinese Dialects
Raymond W. M. Ng, Tan Lee
Frequency Modulation Technique for Prosodic Modification
Jin-Fu Ni, Shinsuke Sakai, Tohru Shimizu, Satoshi Nakamura
Modeling and Generating Tone Contour with Phrase Intonation for Chinese Mandarin Speech
Zhizheng Wu, Yao Qian, Frank K. Soong
A Three-stage Text Normalization Strategy For Mandarin Text-to-speech Systems
Tao Zhou, Yuan Dong, De-zhi Huang, Wu Liu, Hai-la Wang
Multi-Layer F0 Modeling For HMM-Based Speech Synthesis
Cheng-Cheng Wang, Zhen-Hua Ling, Bu-Fan Zhang, Li-Rong Dai
Predicting Spectral and Prosodic Parameters for Unit Selection in Speech Synthesis
Ming-Hui Dong, Hai-Zhou Li
Heteronym Verification for Mandarin Speech Synthesis
Heng Lu, Zhen-Hua Ling, Si Wei, Yu Hu, Li-Rong Dai, Ren-Hua Wang
Investigation on Adaptation Using Different Discriminative Training Criteria Based Linear Regression and Map
Bo Zhu, Zhi-Jie Yan, Yu Hu, Zhi-Guo Wang, Li-Rong Dai, Ren-Hua Wang
Utilization of Huge Written Text Corpora for Conversational Speech Recognition
Xin-Hui Hu, Hirofumi Yamamoto, Jin-Song Zhang, Keiji Yasuda, You-Zheng Wu, Hideki Kashioka
Position Information for Language Modeling in Speech Recognition
Hsuan-Sheng Chiu, Guan-Yu Chen, Chun-Jen Lee, Berlin Chen
An Investigation of Phonological Feature Systems Used in Detection-based ASR
I-Fan Chen, Hsin-Min Wang
An HMM Compensatioon Approach for Dynamic Features Using Unscented Transformation and Its Application to Noisy Speech Recognition
Yu Hu, Qiang Huo
Mandarin Language Understanding in Dialogue Context
Yu-Shi Xu, Jing-Jing Liu, Stephanie Seneff
Pronunciation Error Detection for Computer Assisted Pronunciation Teaching in Mandarin
Min-siong Liang, Ren-Yuan Lyu, Yuang-Chin Chiang, Jing-Fung Chen
A Two-stage Multi-feature Integration Approach to Unsupervised Speaker Change Detection in Real-time News Broadcasting
Lei Xie, Guang-Sen Wang
Automatic Prosody Boundary Labeling of Mandarin Using Both Text and Acoustic Information
Chong-Jia Ni, Wen-Ju Liu, Bo Xu
Subword Latent Semantic Analysis for TextTiling-based Automatic Story Segmentation of Chinese Broadcast News
Yu-Lian Yang, Lei Xie
Multipitch Detection Based on Weighted Summary Correlogram
Xue-Liang Zhang, Wen-Ju Liu, Peng Li, Bo Xu
Efficient System Combination for Syllable-confusion-network-based Chinese Spoken Term Detection
Jie Gao, Jian Shao, Qing-Wei Zhao, Yong-Hong Yan
The Use of Dynamic Deformable Templates for Lip Tracking in An Audio-visual Corpus with Large Variations in Head Pose, Face Illumination and Lip Shapes
Zhi-Yong Wu, Ji-Ying Wu, Helen M. Meng
Microphone Array Post-filter Based on Auditory Filtering
Peng Li, Feng-Chai Liao, Ning Cheng, Bo Xu, Wen-Ju Liu
Exploring Tone Variations in Chinese Dialects Using Context Dependent Tone Models
Wei Guo, Min Chu
A Trellis Based Fast Lattice Generating Algorithm
Wei Li, Ji Wu, Zhi-Guo Wang
Order Adaptation of The Fractional Fourier Transform Using The Intraframe Pitch Change Rate for Speech Recognition
Hui Yin, Climent Nadeu, Volker Hohmann, Xiang Xie, Jing-Ming Kuang
Large Vocabulary Continuous Speech Recognition in Uyghur: Data Preparation and Experimental Results
Nasirjan Tursun, Wushour Silamu
A Improvement for Training Efficiency of Semi-tied Covariance
Si-Bao Chen, Yu Hu, Bin Luo, Ren-Hua Wang
Improved Semi-parametric Mean Trajectory Model Using Discriminatively Trained Centroids
Ran Xu, Jie-Lin Pan, Yong-Hong Yan
Local Mismatch Phone for Confidence Measure in Standard and Accented Chinese Speech Recognition
Wen-Xiao Cao, Yi Liu, Fang Zheng
A Combined Task Analysis Method for Data Selection in Mandarin Isolated Word Recognition System
Zhi-Yang He, Zhi-Guo Wang, Wei Li, Ji Wu
Mandarin Speech Recognition For Nonnative Speakers Based on Pronunciation Dictionary Adaption
Jian Yang, Pei-Shan Wu, Dan Xu
A New Similarity Measure Between HMMs
Yih-Ru Wang
Recognition of Syllable-contracted Words in Spontaneous Speech Using Word Expansion and Duration Information
Wei-Bin Liang, Chung-Hsien Wu, Yu-Kai Kang
Exploiting Non-target Region Information for Confidence Measure Based on Bayesian Information Criterion
Cong Liu, Yu Hu, Xiong-Guo Lei, Zhi-Guo Wang, Li-Rong Dai, Ren-Hua Wang
Simplified Deformation Compensation for Emotional Speaker Recognition
Ying-Chun Yang, Tian Wu, Hong-Bin Lv
Interfusing The Confused Region Score of Speaker Verification Systems
Yan-Hua Long, Wu Guo, Li-Rong Dai
Parallel Phone Recognizer Based MLLR Speaker Recognition
Eryu Wang, Wu Guo, Li-Rong Dai
Eigenchannel Compensation and Symmetric Score for A Robust Text-independent Speaker Verification
Yuan Dong, Jian Zhao, Xian-Yu Zhao, Liang Lu, Ji-Qing Liu, Hai-La Wang
A Sample and Feature Selection Scheme for Gmm-svm Based Language Recognition
Yan Song, Li-Rong Dai
Speaker Recognition Using A Kind of Novel Phonotactic Information
Xiang Zhang, Xiang Xiao, Hai-Peng Wang, Hong-Bin Suo, Qing-Wei Zhao, Yong-Hong Yan
The Adaptation Schemes in PR-SVM Based Language Recognition
Bing Xu, Yan Song, Li-Rong Dai
Mandarin Tone Perception with Temporal Envelope and Periodicity Cues from Different Frequency Regions
Meng Yuan, Tan Lee, Sigfrid D. Soli
Prosodic Variation in Cantonese-english Code-mixed Speech
Wen-Tao Gu, Tan Lee, P. C. Ching
Word Alignment Based on Multi-grain Model
Yan-Qing He, Yu Zhou, Cheng-Qing Zong
Word Reordering Alignment for Combination of Statistical Machine Translation Systems
Mao-Xi Li, Cheng-Qing Zong
An EMD Based Approach to Transliteration Unit Alignment Between English and Chinese
Mu-Yun Yang, Shu-Jie Liu, Sheng Li, Ju-Feng Li, Tie-Jun Zhao, Hao-Liang Qi
Analysis and Modeling of Affective Audio Visual Speech Based on Pad Emotion Space
Shen Zhang, Ying-Jin Xu, Jia Jia, Lian-Hong Cai
Noise Reduction Based Random Matrix Theory
XU-Gang Lu, S. Matsuda, T. Shimizu, S. Nakamura
Language Model Adaptation for Relevance Feedback in Information Retrieval
Ying-Lang Chang, Jen-Tzung Chien
Predicting and Tagging Dialog-act Using MDP and SVM
Ke-Yan Zhou, Cheng-Qing Zong, Hua Wu, Hai-Feng Wang
A Synchronous Method for Automatic Scoring of Language Learning
Bin Dong, Yong-Hong Yan
Using Reference to Tune Language Model for Detection of Reading Miscues
Chang-Liang Liu, Fu-Ping Pan, Feng-Pei Ge, Bin Dong, Yong-Hong Yan
How Syllables Group in Chinese
Mao-Lin Wang, Yi Xu
Prosodic Modeling for Isolated Mandarin Words and Its Application
Hung-Kuang Shih, Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen
A CSI and Rate-Distortion Based Packet Loss Recovery Algorithm for VoIP
Zhong-Bo Li, Sheng-Hui Zhao, Jing Wang, Jing-Ming Kuang
Mandarin Stops Classification Based on Random Forest Approach
Chi-Yueh Lin, Hsiao-Chuan Wang
A Pitch Synchronous Method for Speech Modification
Chih-Ting Kuo, Hsiao-Chuan Wang
Speech Database Compacted for An Embedded Mandarin TTS System
Qing Guo, Bin Wang, Nobuyuki Katae
Prosody Modification on Mixed-language Speech Synthesis
Yi Zhang, Jian-Hua Tao
A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin
Fang-Zhou Liu, Hui-Bin Jia, Jian-Hua Tao
Tone Evaluation of Chinese Continuous Speech Based on Prosodic Words
Yi-Qian Pan, Si Wei, Ren-Hua Wang
The Pitch Analysis of Imperative Sentences in Standard Chinese
Jia Sun, Ji-Lun Lu, Ai-Jun Li, Yuan Jia
Article |
---|