default search action
IEEE Transactions on Audio, Speech & Language Processing, Volume 15
Volume 15, Number 1, January 2007
- Paris Smaragdis:
Convolutive Speech Bases and Their Application to Supervised Speech Separation. 1-12 - Li Deng, Leo J. Lee, Hagai Attias, Alex Acero:
Adaptive Kalman Filtering and Smoothing for Tracking Vocal Tract Resonances Using a Continuous-Valued Hidden Dynamic Model. 13-23 - Ben Milner, Xu Shao:
Prediction of Fundamental Frequency and Voicing From Mel-Frequency Cepstral Coefficients for Unconstrained Speech Reconstruction. 24-33 - Patrick A. Naylor, Anastasis Kounoudes, Jón Guðnason, Mike Brookes:
Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm. 34-43 - Farshad Lahouti, Amir K. Khandani:
Soft Reconstruction of Speech in the Presence of Noise and Packet Loss. 44-56 - Sean A. Ramprashad:
Sparse Bit-Allocations Based on Partial Ordering Schemes With Application to Speech and Audio Coding. 57-69 - Taesu Kim, Hagai Thomas Attias, Soo-Young Lee, Te-Won Lee:
Blind Source Separation Exploiting Higher-Order Frequency Dependencies. 70-79 - Tomohiro Nakatani, Keisuke Kinoshita, Masato Miyoshi:
Harmonicity-Based Blind Dereverberation for Single-Channel Speech Signals. 80-95 - Bertrand Rivet, Laurent Girin, Christian Jutten:
Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures. 96-108 - Guangji Shi, Parham Aarabi, Hui Jiang:
Phase-Based Dual-Microphone Speech Enhancement Using A Prior Speech Model. 109-118 - Gwo-hwa Ju, Lin-Shan Lee:
A Perceptually Constrained GSVD-Based Approach for Enhancing Speech Corrupted by Colored Noise. 119-134 - Steven J. Rennie, Parham Aarabi, Brendan J. Frey:
Variational Probabilistic Speech Separation Using Microphone Arrays. 135-149 - Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics. 150-161 - Christian Raymond, Frédéric Béchet, Nathalie Camelin, Renato de Mori, Géraldine Damnati:
Sequential Decision Strategies for Machine Interpretation of Speech. 162-171 - Scott Axelrod, Vaibhava Goel, Ramesh A. Gopinath, Peder A. Olsen, Karthik Visweswariah:
Discriminative Estimation of Subspace Constrained Gaussian Mixture Models for Speech Recognition. 172-189 - Rajesh M. Hegde, Hema A. Murthy, Venkata Ramana Rao Gadde:
Significance of the Modified Group Delay Feature in Speech Recognition. 190-202 - Erik McDermott, Timothy J. Hazen, Jonathan Le Roux, Atsushi Nakamura, Shigeru Katagiri:
Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error. 203-223 - Satya Dharanipragada, Umit H. Yapanel, Bhaskar D. Rao:
Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method. 224-234 - Michael L. Seltzer, Alex Acero:
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition. 235-245 - Joe Frankel, Simon King:
Speech Recognition Using Linear Dynamic Models. 246-256 - Chia-Ping Chen, Jeff A. Bilmes:
MVA Processing of Speech Features. 257-270 - Haizhou Li, Bin Ma, Chin-Hui Lee:
A Vector Space Modeling Approach to Spoken Language Identification. 271-284 - Peter Day, Asoke K. Nandi:
Robust Text-Independent Speaker Verification Using Genetic Programming. 285-295 - Youngim Jung, Ae-sun Yoon, Hyuk-Chul Kwon:
Grapheme-to-Phoneme Conversion of Arabic Numeral Expressions for Embedded TTS Systems. 296-309 - Jan H. Plasberg, W. Bastiaan Kleijn:
The Sensitivity Matrix: Using Advanced Auditory Models in Speech and Audio Processing. 310-319 - Ixone Arroabarren, Alfonso Carlosena:
Voice Production Mechanisms of Vocal Vibrato in Male Singers. 320-332 - Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno:
Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression. 333-345 - Kishan Thambiratnam, Sridha Sridharan:
Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting. 346-357 - Paris Smaragdis, Petros Boufounos:
Position and Trajectory Learning for Microphone Arrays. 358-368
Volume 15, Number 2, February 2007
- Yannis Agiomyrgiannakis, Yannis Stylianou:
Conditional Vector Quantization for Speech Coding. 377-386 - Sorin Dusan, James L. Flanagan, Amod Karve, Mridul Balaraman:
Speech Compression by Polynomial Approximation. 387-395 - Guoning Hu, DeLiang Wang:
Auditory Segmentation Based on Onset and Offset Analysis. 396-405 - Richard C. Hendriks, Richard Heusdens, Jesper Jensen:
An MMSE Estimator for Speech Enhancement Under a Combined Stochastic-Deterministic Speech Model. 406-415 - Yoshifumi Nagata, Toyota Fujioka, Masato Abe:
Two-Dimensional DOA Estimation of Sound Sources Based on Weighted Wiener Gain Exploiting Two-Directional Microphones. 416-429 - Marc Delcroix, Takafumi Hikichi, Masato Miyoshi:
Precise Dereverberation Using Multichannel Linear Prediction. 430-440 - Sriram Srinivasan, Jonas Samuelsson, W. Bastiaan Kleijn:
Codebook-Based Bayesian Speech Enhancement for Nonstationary Environments. 441-452 - Rongqing Huang, John H. L. Hansen, Pongtep Angkititrakul:
Dialect/Accent Classification Using Unrestricted Audio. 453-464 - Murat Akbacak, John H. L. Hansen:
Environmental Sniffing: Noise Knowledge Estimation for Robust Speech Systems. 465-477 - Jian Wu, Qiang Huo:
A Study of Minimum Classification Error (MCE) Linear Regression for Supervised Adaptation of MCE-Trained Continuous-Density Hidden Markov Models. 478-488 - Paul D. Teal:
Tracking Wide-Band Targets Having Significant Doppler Shift. 489-497 - Pongtep Angkititrakul, John H. L. Hansen:
Discriminative In-Set/Out-of-Set Speaker Recognition. 498-508 - Darko Kirovski, Zeph Landau:
Generalized Lempel-Ziv Compression for Audio. 509-518 - Tin Lay Nwe, Haizhou Li:
Exploring Vibrato-Motivated Acoustic Features for Singer Identification. 519-530 - Nicola Laurenti, Giovanni De Poli, Daniele Montagner:
A Nonlinear Method for Stochastic Spectrum Estimation in the Modeling of Musical Sounds. 531-541 - Sunil Bharitkar, Chris Kyriakakis:
Visualization of Multiple Listener Room Acoustic Equalization With the Sammon Map. 542-551 - Damian T. Murphy, Mark Beeson:
The KW-Boundary Hybrid Digital Waveguide Mesh for Room Acoustics Applications. 552-564 - Ramani Duraiswami, Dmitry N. Zotkin, Nail A. Gumerov:
Fast Evaluation of the Room Transfer Function Using Multipole Expansion. 565-576 - Jack Mullen, David M. Howard, Damian T. Murphy:
Real-Time Dynamic Articulations in the 2-D Waveguide Mesh Vocal Tract Model. 577-585 - Xu Sun, Sen M. Kuo:
Active Narrowband Noise Control Systems Using Cascading Adaptive Filters. 586-592 - Muhammad Tahir Akhtar, Masahide Abe, Masayuki Kawamata:
On Active Noise Control Systems With Online Acoustic Feedback Path Modeling. 593-600 - Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez, Iain McCowan:
Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings. 601-616 - Simon Doclo, Marc Moonen:
Superdirective Beamforming Robust Against Microphone Mismatch. 617-631 - Chang-Heon Lee, Sung-Kyo Jung, Hong-Goo Kang:
Applying a Speaker-Dependent Speech Compression Technique to Concatenative TTS Synthesizers. 632-640 - K.-S. Lee:
Statistical Approach for Voice Personality Transformation. 641-651 - Xiaodong Cui, Abeer Alwan:
Robust Speaker Adaptation by Weighted Model Averaging Based on the Minimum Description Length Criterion. 652-660 - M.-Y. Tsai, F.-C. Chou, L.-S. Lee:
Pronunciation Modeling With Reduced Confusion for Mandarin Chinese Using a Three-Stage Framework. 661-675 - Qin Yan, Saeed Vaseghi, Dimitrios Rentzos, Ching-Hsiang Ho:
Analysis and Synthesis of Formant Spaces of British, Australian, and American Accents. 676-689 - Dagen Wang, Shrikanth S. Narayanan:
An Acoustic Measure for Word Prominence in Spontaneous Speech. 690-701 - Zhiyun Li, Ramani Duraiswami:
Flexible and Optimal Design of Spherical Microphone Arrays for Beamforming. 702-714 - Mirko Knaak, Shoko Araki, Shoji Makino:
Geometrically Constrained Independent Component Analysis. 715-726 - I. Balmages, Boaz Rafaely:
Open-Sphere Designs for Spherical Microphone Arrays. 727-732 - Peter Jancovic:
Fast Algorithm for Calculation of the Union-Based Probability. 732-734 - Young-Ik Kim, Rhee Man Kil:
Estimation of Interaural Time Differences Based on Zero-Crossings in Noisy Multisource Environments. 734-743
Volume 15, Number 3, March 2007
- Pradeepa Yahampath, Paul Rondeau:
Multiple-Description Predictive-Vector Quantization With Applications to Low Bit-Rate Speech Coding Over Networks. 749-755 - Ethan Robert Duni, Bhaskar D. Rao:
High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models. 756-769 - Ethan Robert Duni, Bhaskar D. Rao:
A High-Rate Optimal Transform Coder With Gaussian Mixture Companders. 770-783 - Brian Kan-Wing Mak, Roger Wend-Huu Hsiao:
Kernel Eigenspace-Based MLLR Adaptation. 784-795 - Bertrand Rivet, Laurent Girin, Christian Jutten:
Log-Rayleigh Distribution: A Simple and Efficient Statistical Representation of Log-Spectral Coefficients. 796-802 - Patricia Scanlon, Daniel P. W. Ellis, Richard B. Reilly:
Using Broad Phonetic Group Experts for Improved Speech Recognition. 803-812 - Barbara Resch, Mattias Nilsson, L. Anders Ekman, W. Bastiaan Kleijn:
Estimation of the Instantaneous Pitch of Speech. 813-822 - Francesco Gianfelici, Giorgio Biagetti, Paolo Crippa, Claudio Turchetti:
Multicomponent AM-FM Representations: An Asymptotically Exact Approach. 823-837 - Dima Ruinskiy, Yizhar Lavner:
An Effective Algorithm for Automatic Detection and Exact Demarcation of Breath Sounds in Speech and Song Signals. 838-850 - Laurent Girin, Mohammad Firouzmand, Sylvain Marchand:
Perceptual Long-Term Variable-Rate Sinusoidal Modeling of Speech. 851-861 - Jesper Jensen, Richard Heusdens:
Improved Subspace-Based Single-Channel Speech Enhancement Using Generalized Super-Gaussian Priors. 862-872 - Juho Kontio, Laura Laaksonen, Paavo Alku:
Neural Network-Based Artificial Bandwidth Expansion of Speech. 873-881 - David Yuheng Zhao, W. Bastiaan Kleijn:
HMM-Based Gain Modeling for Enhancement of Speech in Noise. 882-892 - M. Khademul Islam Molla, Keikichi Hirose:
Single-Mixture Audio Source Separation by Subspace Decomposition of Hilbert Spectrum. 893-900 - Karsten Vandborg Sørensen, Søren Vang Andersen:
Rayleigh Mixture Model-Based Hidden Markov Modeling and Estimation of Noise in Noisy Speech Signals. 901-917 - Richard C. Hendriks, Rainer Martin:
MAP Estimators for Speech Enhancement Under Normal and Rayleigh Inverse Gaussian Distributions. 918-927 - Nikos Chatzichrisafis, Vassilios Diakoloukas, Vassilios Digalakis, Costas Harizakis:
Gaussian Mixture Clustering and Language Adaptation for the Development of a New Language Speech Recognition System. 928-938 - Ghinwa F. Choueiter, James R. Glass:
An Implementation of Rational Wavelets and Filter Design for Phonetic Classification. 939-948 - Esther Klabbers, Jan P. H. van Santen, Alexander Kain:
The Contribution of Various Sources of Spectral Mismatch to Audible Discontinuities in a Diphone Database. 949-956 - Jerome R. Bellegarda:
Globally Optimal Training of Unit Boundaries in Unit Selection Text-to-Speech Synthesis. 957-965 - Pim Korten, Jesper Jensen, Richard Heusdens:
High-Resolution Spherical Quantization of Sinusoidal Parameters. 966-981 - Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama:
A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering. 982-994 - Johannes Nix, Volker Hohmann:
Combined Estimation of Spectral Envelopes and Sound Source Direction of Concurrent Voices by Multidimensional Statistical Filtering. 995-1008 - Matthew E. P. Davies, Mark D. Plumbley:
Context-Dependent Beat Tracking of Musical Audio. 1009-1020 - Leevi Peltola, Cumhur Erkut, Perry R. Cook, Vesa Välimäki:
Synthesis of Hand Clapping Sounds. 1021-1029 - Jean-Marc Valin:
On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk. 1030-1034 - James D. Gordy, Rafik A. Goubran:
Statistical Analysis of Doubletalk Detection for Calibration and Performance Evaluation. 1035-1043 - Felix Albu, Martin Bouchard, Yuriy V. Zakharov:
Pseudo-Affine Projection Algorithms for Multichannel Active Noise Control. 1044-1052 - Jacob Benesty, Jingdong Chen, Yiteng Huang, Jacek Dmochowski:
On Microphone-Array Beamforming From a MIMO Acoustic Signal Processing Perspective. 1053-1065 - Tuomas Virtanen:
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria. 1066-1074 - Carlos Busso, Zhigang Deng, Michael Grimm, Ulrich Neumann, Shrikanth S. Narayanan:
Rigid Head Motion in Expressive Speech Animation: Analysis and Synthesis. 1075-1086 - Chen Yang, Frank K. Soong, Tan Lee:
Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR. 1087-1097 - Luis Buera, Eduardo Lleida, Antonio Miguel, Alfonso Ortega, Oscar Saz:
Cepstral Vector Normalization Based on Stereo Data for Robust Speech Recognition. 1098-1113 - Xianyu Zhao, Zhijian Ou:
Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition. 1114-1122
Volume 15, Number 4, May 2007
- Rasool Tahmasbi, Sadegh Rezaei:
A Soft Voice Activity Detection Using GARCH Filter and Variance Gamma Distribution. 1129-1134 - Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono, Alain de Cheveigné, Shigeki Sagayama:
Single and Multiple F0 Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments. 1135-1145 - Thomas Eriksson, Frank Norden:
Memory-Based Vector Quantization of LSF Parameters by a Power Series Approximation. 1146-1155 - Bengt J. Borgstrom, Mihaela van der Schaar, Abeer Alwan:
Rate Allocation for Noncollaborative Multiuser Speech Communication Systems Based on Bargaining Theory. 1156-1166 - Milan Jelinek, Redwan Salami:
Wideband Speech Coding Advances in VMR-WB Standard. 1167-1179 - Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller, Panagiotis Tsakalides:
A Spectral Conversion Approach to Single-Channel Speech Enhancement. 1180-1193 - Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan:
Noisy Speech Enhancement Using Harmonic-Noise Model and Codebook-Based Post-Processing. 1194-1203 - Xuechuan Wang, Douglas D. O'Shaughnessy:
Environmental Independent ASR Model Adaptation/Compensation by Bayesian Parametric Representation. 1204-1217 - Peter Birkholz, Dietmar Jackèl, Bernd J. Kröger:
Simulation of Losses Due to Turbulence in the Time-Varying Vocal System. 1218-1226 - Chung-Hsien Wu, Chi-Chun Hsia, Jiun-Fu Chen, Jhing-Fa Wang:
Variable-Length Unit Selection in TTS Using Structural Syntactic Cost. 1227-1235 - Karthikeyan Umapathy, Sridhar Krishnan, R. K. Rao:
Audio Signal Feature Extraction and Classification Using Local Discriminant Bases. 1236-1246 - Graham E. Poliner, Daniel P. W. Ellis, Andreas F. Ehmann, Emilia Gómez, Sebastian Streich, Beesuan Ong:
Melody Transcription From Music Audio: Approaches and Evaluation. 1247-1256 - Harvey D. Thornburg, Randal J. Leistikow, Jonathan Berger:
Melody Extraction and Musical Onset Detection via Probabilistic Models of Framewise STFT Peak Data. 1257-1272 - Emmanuel Vincent, Mark D. Plumbley:
Low Bit-Rate Object Coding of Musical Audio Using Bayesian Harmonic Models. 1273-1282 - Corentin Dubois, Manuel Davy:
Joint Detection and Tracking of Time-Varying Harmonic Components: A Flexible Bayesian Approach. 1283-1295 - H. M. A. Malik, Rashid Ansari, Ashfaq A. Khokhar:
Robust Data Hiding in Audio Using Allpass Filters. 1296-1304 - Yekutiel Avargel, Israel Cohen:
System Identification in the Short-Time Fourier Transform Domain With Crossband Filtering. 1305-1319 - Fredric Lindström, Christian Schüldt, Ingvar Claesson:
An Improvement of the Two-Path Algorithm Transfer Logic for Acoustic Echo Cancellation. 1320-1326 - Jacek Dmochowski, Jacob Benesty, Sofiène Affes:
Direction of Arrival Estimation Using the Parameterized Spatial Correlation Matrix. 1327-1339 - Wolfgang Herbordt, Herbert Buchner, Satoshi Nakamura, Walter Kellermann:
Multichannel Bin-Wise Robust Frequency-Domain Adaptive Filtering and Its Application to Adaptive Beamforming. 1340-1351 - Takaaki Hori, Chiori Hori, Yasuhiro Minami, Atsushi Nakamura:
Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition. 1352-1365 - Xiaodong Cui, Yifan Gong:
A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition. 1366-1376 - Mathias De Wachter, Mike Matton, Kris Demuynck, Patrick Wambacq, Ronald Cools, Dirk Van Compernolle:
Template-Based Continuous Speech Recognition. 1377-1390 - Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Exploiting Temporal Correlation of Speech for Error Robust and Bandwidth Flexible Distributed Speech Recognition. 1391-1403 - Paris Smaragdis, Madhusudana V. S. Shashanka:
A Framework for Secure Speech Recognition. 1404-1413 - Xunying Liu, Mark J. F. Gales:
Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions. 1414-1424 - Yan Han, Johan de Veth, Lou Boves:
Trajectory Clustering for Solving the Trajectory Folding Problem in Automatic Speech Recognition. 1425-1434 - Patrick Kenny, Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel:
Joint Factor Analysis Versus Eigenchannels in Speaker Recognition. 1435-1447 - Patrick Kenny, Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel:
Speaker and Session Variability in GMM-Based Speaker Verification. 1448-1460 - Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang:
Automatic Speaker Clustering Using a Voice Characteristic Reference Space and Maximum Purity Estimation. 1461-1474 - Yipeng Li, DeLiang Wang:
Separation of Singing Voice From Music Accompaniment for Monaural Recordings. 1475-1487 - Stefan Bilbao, Lauri Savioja, Julius O. Smith III:
Parameterized Finite Difference Schemes for Plates: Stability, the Reduction of Directional Dispersion and Frequency Warping. 1488-1495 - Angel M. Gomez, Antonio M. Peinado, Victoria E. Sánchez, Antonio J. Rubio:
On the Ramsey Class of Interleavers for Robust Speech Recognition in Burst-Like Packet Loss. 1496-1499
Volume 15, Number 5, July 2007
- Scott C. Douglas, Malay Gupta, Hiroshi Sawada, Shoji Makino:
Spatio-Temporal FastICA Algorithms for the Blind Separation of Convolutive Mixtures. 1511-1520 - Intae Lee, Te-Won Lee:
On the Assumption of Spherical Symmetry and Sparseness for the Frequency-Domain Speech Model. 1521-1528 - Ernst Warsitz, Reinhold Haeb-Umbach:
Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition. 1529-1539 - Abdeldjalil Aïssa-El-Bey, Karim Abed-Meraim, Yves Grenier:
Blind Separation of Underdetermined Convolutive Mixtures Using Their Time-Frequency Representation. 1540-1550 - Zhaoshui He, Shengli Xie, Shuxue Ding, Andrzej Cichocki:
Convolutive Blind Source Separation in the Frequency Domain Based on Sparse Representation. 1551-1563 - Alexey Ozerov, Pierrick Philippe, Frédéric Bimbot, Rémi Gribonval:
Adaptation of Bayesian Models for Single-Channel Source Separation and its Application to Voice/Music Separation in Popular Songs. 1564-1578 - Ken'ichi Furuya, Akitoshi Kataoka:
Robust Speech Dereverberation Using Multichannel Blind Deconvolution With Spectral Subtraction. 1579-1591 - Hiroshi Sawada, Shoko Araki, Ryo Mukai, Shoji Makino:
Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation. 1592-1604 - Oscal T.-C. Chen, Chia-Hsiung Liu:
Content-Dependent Watermarking Scheme in Compressed Speech With Identifying Manner and Location of Attacks. 1605-1616 - Vesa Siivola, Teemu Hirsimäki, Sami Virpioja:
On Growing and Pruning Kneser-Ney Smoothed N-Gram Models. 1617-1624 - Mathieu Lagrange, Sylvain Marchand, Jean-Bernard Rault:
Enhancing the Tracking of Partials for the Sinusoidal Modeling of Polyphonic Sounds. 1625-1634 - Mads Græsbøll Christensen, Andreas Jakobsson, Søren Holdt Jensen:
Joint High-Resolution Fundamental Frequency and Order Estimation. 1635-1644 - Xinglei Zhu, Gerald Beauregard, Lonce L. Wyse:
Real-Time Signal Estimation From Modified Short-Time Fourier Transform Magnitude Spectra. 1645-1653 - Anders Meng, Peter Ahrendt, Jan Larsen, Lars Kai Hansen:
Temporal Feature Integration for Music Genre Classification. 1654-1664 - Masahiro Yukawa, Konstantinos Slavakis, Isao Yamada:
Adaptive Parallel Quadratic-Metric Projection Algorithms. 1665-1680 - Andy W. H. Khong, Patrick A. Naylor:
Selective-Tap Adaptive Filtering With Performance Analysis for Identification of Time-Varying Systems. 1681-1695 - Guillaume Lathoud, Jean-Marc Odobez:
Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers. 1696-1710 - Ji Ming, Timothy J. Hazen, James R. Glass, Douglas A. Reynolds:
Robust Speaker Recognition in Noisy Conditions. 1711-1723 - Mark D. Skowronski, John G. Harris:
Noise-Robust Automatic Speech Recognition Using a Predictive Echo State Network. 1724-1730 - Mohamed Afify, Olivier Siohan:
Comments on Vocal Tract Length Normalization Equals Linear Transformation in Cepstral Space. 1731-1732
Volume 15, Number 6, August 2007
- Jan S. Erkelens, Richard C. Hendriks, Richard Heusdens, Jesper Jensen:
Minimum Mean-Square Error Estimation of Discrete Fourier Coefficients With Generalized Gamma Priors. 1741-1752 - Chang Huai You, Susanto Rahardja, Soo Ngee Koh:
Audible Noise Reduction in Eigendomain for Speech Enhancement. 1753-1765 - Aarthi M. Reddy, Bhiksha Raj:
Soft Mask Methods for Single-Channel Speaker Separation. 1766-1776 - Ann Spriet, Geert Rombouts, Marc Moonen, Jan Wouters:
Combined Feedback and Noise Suppression in Hearing Aids. 1777-1790 - Marc Delcroix, Takafumi Hikichi, Masato Miyoshi:
Dereverberation and Denoising Using Multichannel Linear Prediction. 1791-1801 - Woojay Jeon, Biing-Hwang Juang:
Speech Analysis in a Model of the Central Auditory System. 1802-1817 - Nikolaos Mitianoudis, Tania Stathaki:
Batch and Online Underdetermined Source Separation Using Laplacian Mixture Models. 1818-1832 - Maurizio Mancini, Roberto Bresin, Catherine Pelachaud:
A Virtual Head Driven by Music Expressivity. 1833-1841 - Shantanu Chakrabartty, Yunbin Deng, Gert Cauwenberghs:
Robust Speech Feature Extraction by Growth Transformation in Reproducing Kernel Hilbert Space. 1842-1849 - Bertrand Mesot, David Barber:
Switching Linear Dynamical Systems for Noise Robust Speech Recognition. 1850-1858 - Amit S. Malegaonkar, Aladdin M. Ariyaeeinia, P. Sivakumaran:
Efficient Speaker Change Detection Using Adapted Gaussian Mixture Models. 1859-1869 - Yuan-Fu Liao, Zi-He Chen, Yau-Tarng Juang:
Latent Prosody Analysis for Robust Speaker Identification. 1870-1883 - Wai Nang Chan, Nengheng Zheng, Tan Lee:
Discrimination Power of Vocal Source and Vocal Tract Related Features for Speaker Segmentation. 1884-1892 - Wei Wu, Thomas Fang Zheng, Mingxing Xu, Frank K. Soong:
A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification. 1893-1903 - Jean-Luc Rouas:
Automatic Prosodic Variations Modeling for Language and Dialect Discrimination. 1904-1911 - Peter Taraba:
Kneser-Ney Smoothing With a Correcting Transformation for Small Data Sets. 1912-1921 - Darko Kirovski, Fabien A. P. Petitcolas, Zeph Landau:
The Replacement Attack. 1922-1931 - Kai Yu, Mark J. F. Gales:
Bayesian Adaptive Inference and Adaptive Training. 1932-1943
Volume 15, Number 7, September 2007
- Mark A. Przybocki, Alvin F. Martin, Audrey N. Le:
NIST Speaker Recognition Evaluations Utilizing the Mixer Corpora - 2004, 2005, 2006. 1951-1959 - Benoit G. B. Fauve, Driss Matrouf, Nicolas Scheffer, Jean-François Bonastre, John S. D. Mason:
State-of-the-Art Performance in Text-Independent Speaker Verification Through Open-Source Software. 1960-1968 - Fabio Castaldo, Daniele Colibro, Emanuele Dalmasso, Pietro Laface, Claudio Vair:
Compensation of Nuisance Factors for Speaker and Language Recognition. 1969-1978 - Lukás Burget, Pavel Matejka, Petr Schwarz, Ondrej Glembek, Jan Cernocký:
Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System. 1979-1986 - Andreas Stolcke, Sachin S. Kajarekar, Luciana Ferrer, E. Shrinberg:
Speaker Recognition With Session Variability Normalization Based on MLLR Adaptation Transforms. 1987-1998 - Shou-Chun Yin, Richard C. Rose, Patrick Kenny:
A Joint Factor Analysis Approach to Progressive Model Adaptation in Text-Independent Speaker Verification. 1999-2010 - Xavier Anguera, Chuck Wooters, Javier Hernando:
Acoustic Beamforming for Speaker Diarization of Meetings. 2011-2022 - Qin Jin, Tanja Schultz, Alex Waibel:
Far-Field Speaker Recognition. 2023-2032 - Hagai Aronowitz, David Burshtein:
Efficient Speaker Recognition Using Approximated Cross Entropy (ACE). 2033-2043 - Vinod Prakash, John H. L. Hansen:
In-Set/Out-of-Set Speaker Recognition Under Sparse Enrollment. 2044-2052 - Bin Ma, Haizhou Li, Rong Tong:
Spoken Language Recognition Using Ensemble Classifiers. 2053-2062 - Yosef A. Solewicz, Moshe Koppel:
UsingPost-Classifiers to Enhance Fusion of Low- and High-Level Speaker Recognition. 2063-2071 - Niko Brümmer, Lukás Burget, Jan Cernocký, Ondrej Glembek, Frantisek Grézl, Martin Karafiát, David A. van Leeuwen, Pavel Matejka, Petr Schwarz, Albert Strasheim:
Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006. 2072-2084 - William M. Campbell, Joseph P. Campbell, Terry P. Gleason, Douglas A. Reynolds, Wade Shen:
Speaker Verification Using Support Vector Machines and High-Level Features. 2085-2094 - Najim Dehak, Pierre Dumouchel, Patrick Kenny:
Modeling Prosodic Features With Joint Factor Analysis for Speaker Verification. 2095-2103 - Joaquin Gonzalez-Rodriguez, P. Rose, Daniel Ramos, Doroteo T. Toledano, Javier Ortega-Garcia:
Emulating DNA: Rigorous Quantification of Evidential Weight in Transparent and Testable Forensic Speaker Recognition. 2104-2115 - Jason D. Williams, S. Young:
Scaling POMDPs for Spoken Dialog Management. 2116-2129 - Soundararajan Srinivasan, DeLiang L. Wang:
Transforming Binary Uncertainties for Robust Speech Recognition. 2130-2140 - J. Usher, Jacob Benesty:
Enhancement of Spatial Sound Quality: A New Reverberation-Extraction Audio Upmixer. 2141-2150 - Cheng-Yuan Lin, Jyh-Shing Roger Jang:
Automatic Phonetic Segmentation by Score Predictive Model for the Corpora of Mandarin Singing Voices. 2151-2159 - Rusheng Hu, Yunxin Zhao:
Knowledge-Based Adaptive Decision Tree State Tying for Conversational Speech Recognition. 2160-2168
Volume 15, Number 8, November 2007
- Javier Ramírez, José C. Segura, Juan Manuel Górriz, Luz García:
Improved Voice Activity Detection Using Contextual Multiple Hypothesis Testing for Robust Speech Recognition. 2177-2189 - Dagen Wang, Shrikanth S. Narayanan:
Robust Speech Rate Estimation for Spontaneous Speech. 2190-2201 - Seung Seop Park, Nam Soo Kim:
On Using Multiple Models for Automatic Speech Segmentation. 2202-2212 - Robert I. Damper, Tasanawan Soonklang:
Subjective Evaluation of Techniques for Proper Name Pronunciation. 2213-2221 - Tomoki Toda, Alan W. Black, Keiichi Tokuda:
Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory. 2222-2235 - Te Li, Susanto Rahardja, Rongshan Yu, Soo Ngee Koh:
On Integer MDCT for Perceptual Audio Coding. 2236-2248 - Enrique Alexandre, Lucas Cuadra, Manuel Rosa-Zurera, Francisco López-Ferreras:
Feature Selection for Sound Classification in Hearing Aids Through Restricted Search Driven by Genetic Algorithms. 2249-2256 - Hari Krishna Maganti, Daniel Gatica-Perez, Iain McCowan:
Speech Enhancement and Recognition in Meetings With an Audio-Visual Sensor Array. 2257-2269 - Xiangyang Wang, Wei Qi, Panpan Niu:
A New Adaptive Digital Audio Watermarking Based on Support Vector Regression. 2270-2277 - Leslie S. Smith, Steve Collins:
Determining ITDs Using Two Microphones on a Flat Panel During Onset Intervals With a Biologically Inspired Spike-Based Technique. 2278-2286 - Harsha I. K. Rao, V. John Mathews, Young-Cheol Park:
A Minimax Approach for the Joint Design of Acoustic Crosstalk Cancellation Filters. 2287-2298 - Mohammad H. Radfar, Richard M. Dansereau:
Single-Channel Speech Separation Using Soft Mask Filtering. 2299-2310 - Jingyi Zhang, Wai Lok Woo, Satnam Singh Dlay:
Blind Source Separation of Postnonlinear Convolutive Mixture. 2311-2330 - Carlos Busso, Shrikanth S. Narayanan:
Interrelation Between Speech and Facial Gestures in Emotional Utterances: A Single Subject Study. 2331-2347 - Ari Abramson, Israel Cohen:
Simultaneous Detection and Estimation Approach for Speech Enhancement. 2348-2359 - Zohra Yermeche, Nedelko Grbic, Ingvar Claesson:
Blind Subband Beamforming With Time-Delay Constraints for Moving Source Speech Enhancement. 2360-2372 - Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, Dan Chazan:
A Large Margin Algorithm for Speech-to-Phoneme and Music-to-Score Alignment. 2373-2382 - Xinwei Li, Hui Jiang:
Solving Large-Margin Hidden Markov Model Estimation via Semidefinite Programming. 2383-2392 - Jinyu Li, Ming Yuan, Chin-Hui Lee:
Approximate Test Risk Bound Minimization Through Soft Margin Estimation. 2393-2404 - Mohamed Afify, Xinwei Li, Hui Jiang:
Statistical Analysis of Minimum Classification Error Learning for Gaussian and Hidden Markov Model Classifiers. 2405-2417 - Srinivasan Umesh, Rohit Sinha:
A Study of Filter Bank Smoothing in MFCC Features for Recognition of Children's Speech. 2418-2430 - Haitian Xu, Paul Dalsgaard, Zheng-Hua Tan, Børge Lindberg:
Noise Condition-Dependent Training Based on Noise Classification and SNR Estimation. 2431-2443 - Rongqing Huang, John H. L. Hansen:
Unsupervised Discriminative Training With Application to Dialect Classification. 2444-2453 - Shizhen Wang, Xiaodong Cui, Abeer Alwan:
Speaker Adaptation With Limited Data Using Regression-Tree-Based Spectral Peak Alignment. 2454-2464 - Jérôme Louradour, Khalid Daoudi, Francis R. Bach:
Feature Space Mahalanobis Sequence Kernels: Application to SVM Speaker Verification. 2465-2475 - Minho Jin, Frank K. Soong, Chang Dong Yoo:
A Syllable Lattice Approach to Speaker Verification. 2476-2484 - Mohamed Chibani, Roch Lefebvre, Philippe Gournay:
Fast Recovery for a CELP-Like Speech Codec After a Frame Erasure. 2485-2495 - Bernd Geiser, Peter Jax, Peter Vary, Hervé Taddei, Stefan Schandl, Martin Gartner, Cyril Guillaume, Stéphane Ragot:
Bandwidth Extension for Hierarchical Speech and Audio Coding in ITU-T Rec. G.729.1. 2496-2509 - Jacek Dmochowski, Jacob Benesty, Sofiène Affes:
A Generalized Steered Response Power Method for Computationally Viable Source Localization. 2510-2526 - Ken'ichi Kumatani, Tobias Gehrig, Uwe Mayer, Emilian Stoimenov, John W. McDonough, Matthias Wölfel:
Adaptive Beamforming With a Minimum Mutual Information Criterion. 2527-2541 - K. C. Ho, Ming Sun:
An Accurate Algebraic Closed-Form Solution for Energy-Based Source Localization. 2542-2550 - Chien-Lin Huang, Chung-Hsien Wu:
Spoken Document Retrieval Using Multilevel Knowledge and Semantic Verification. 2551-2560 - Toon van Waterschoot, Marc Moonen:
A Pole-Zero Placement Technique for Designing Second-Order IIR Parametric Equalizer Filters. 2561-2565
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.