default search action
ICASSP 2021: Toronto, ON, Canada
- IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021. IEEE 2021, ISBN 978-1-7281-7606-2
- Yi Luo, Zhuo Chen, Cong Han, Chenda Li, Tianyan Zhou, Nima Mesgarani:
Rethinking The Separation Layers In Speech Separation Networks. 1-5 - Xiaoyu Liu, Jordi Pons:
On Permutation Invariant Training For Speech Source Separation. 6-10 - Zhong-Qiu Wang, DeLiang Wang:
Count And Separate: Incorporating Speaker Counting For Continuous Speaker Separation. 11-15 - Yi Luo, Cong Han, Nima Mesgarani:
Ultra-Lightweight Speech Separation Via Group Communication. 16-20 - Cem Subakan, Mirco Ravanelli, Samuele Cornell, Mirko Bronzi, Jianyuan Zhong:
Attention Is All You Need In Speech Separation. 21-25 - Aidan O. T. Hogg, Christine Evers, Patrick A. Naylor:
Multichannel Overlapping Speaker Segmentation Using Multiple Hypothesis Tracking Of Acoustic And Spatial Features. 26-30 - Zhepei Wang, Ritwik Giri, Umut Isik, Jean-Marc Valin, Arvindh Krishnaswamy:
Semi-Supervised Singing Voice Separation With Noisy Self-Training. 31-35 - Giorgia Cantisani, Slim Essid, Gaël Richard:
Neuro-Steered Music Source Separation With EEG-Based Auditory Attention Decoding And Contrastive-NMF. 36-40 - Yixuan Zhang, Yuzhou Liu, DeLiang Wang:
Complex Ratio Masking For Singing Voice Separation. 41-45 - Yun-Ning Hung, Gordon Wichern, Jonathan Le Roux:
Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision. 46-50 - Ryosuke Sawata, Stefan Uhlich, Shusuke Takahashi, Yuki Mitsufuji:
All For One And One For All: Improving Music Separation By Bridging Networks. 51-55 - Yongwei Gao, Xingjian Du, Bilei Zhu, Xiaoheng Sun, Wei Li, Zejun Ma:
An Hrnet-Blstm Model With Two-Stage Training For Singing Melody Extraction. 56-60 - Satwinder Singh, Ruili Wang, Yuanhang Qiu:
DeepF0: End-To-End Fundamental Frequency Estimation for Music and Speech Signals. 61-65 - Marco A. Martínez Ramírez, Oliver Wang, Paris Smaragdis, Nicholas J. Bryan:
Differentiable Signal Processing With Black-Box Audio Effects. 66-70 - Christian J. Steinmetz, Jordi Pons, Santiago Pascual, Joan Serrà:
Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects. 71-75 - Jiatong Shi, Shuai Guo, Nan Huo, Yuekai Zhang, Qin Jin:
Sequence-To-Sequence Singing Voice Synthesis With Perceptual Entropy Loss. 76-80 - Junghyun Koo, Seungryeol Paik, Kyogu Lee:
Reverb Conversion Of Mixed Vocal Tracks Using An End-To-End Convolutional Deep Neural Network. 81-85 - Bo-Wei Tseng, Yih-Liang Shen, Tai-Shih Chi:
Extending Music Based On Emotion And Tonality Via Generative Adversarial Network. 86-90 - William Vickers, Ben Milner, Robert Lee:
Improving The Robustness Of Right Whale Detection In Noisy Conditions Using Denoising Autoencoders And Augmented Training. 91-95 - Ondrej Cífka, Alexey Ozerov, Umut Simsekli, Gaël Richard:
Self-Supervised VQ-VAE for One-Shot Music Style Transfer. 96-100 - Hongwei Song, Jiqing Han, Shiwen Deng, Zhihao Du:
Capturing Temporal Dependencies Through Future Prediction for CNN-Based Audio Classifiers. 101-105 - T. J. Tsai:
Segmental Dtw: A Parallelizable Alternative to Dynamic Time Warping. 106-110 - Keitaro Tanaka, Ryo Nishikimi, Yoshiaki Bando, Kazuyoshi Yoshii, Shigeo Morishima:
Pitch-Timbre Disentanglement Of Musical Instrument Sounds Based On Vae-Based Metric Learning. 111-115 - Robert Ayrapetian, Philip Hilmes, Mohamed Mansour, Trausti Kristjansson, Carlo Murgia:
Asynchronous Acoustic Echo Cancellation Over Wireless Channels. 116-120 - Mhd Modar Halimeh, Thomas Haubner, Annika Briegleb, Alexander Schmidt, Walter Kellermann:
Combining Adaptive Filtering And Complex-Valued Deep Postfiltering For Acoustic Echo Cancellation. 121-125 - Amir Ivry, Israel Cohen, Baruch Berdugo:
Deep Residual Echo Suppression With A Tunable Tradeoff Between Signal Distortion And Echo Suppression. 126-130 - Saeed Bagheri, Daniele Giacobello:
Robust STFT Domain Multi-Channel Acoustic Echo Cancellation with Adaptive Decorrelation of the Reference Signals. 131-135 - Meng Guo:
A Method for Determining Periodically Time-Varying Bias and Its Applications in Acoustic Feedback Cancellation. 136-140 - Ziteng Wang, Yueyue Na, Zhang Liu, Biao Tian, Qiang Fu:
Weighted Recursive Least Square Filter and Neural Network Based Residual ECHO Suppression for the AEC-Challenge. 141-145 - Renhua Peng, Linjuan Cheng, Chengshi Zheng, Xiaodong Li:
ICASSP 2021 Acoustic Echo Cancellation Challenge: Integrated Adaptive Echo Cancellation with Time Alignment and Deep Learning-Based Residual Echo Plus Noise Suppression. 146-150 - Kusha Sridhar, Ross Cutler, Ando Saabas, Tanel Pärnamaa, Markus Loide, Hannes Gamper, Sebastian Braun, Robert Aichner, Sriram Srinivasan:
ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results. 151-155 - Jan Franzen, Ernst Seidel, Tim Fingscheidt:
AEC in A Netshell: on Target and Topology Choices for FCRN Acoustic Echo Cancellation. 156-160 - Jesper Brunnström, Shoichi Koyama:
Kernel-Interpolation-Based Filtered-X Least Mean Square for Spatial Active Noise Control In Time Domain. 161-165 - Jian Xu, Kean Chen, Yunhe Li:
Wave-Domain Optimization of Secondary Source Placement Free From Information of Error Sensor Positions. 166-170 - Woo-Sung Choi, Minseok Kim, Jaehwa Chung, Soonyoung Jung:
Lasaft: Latent Source Attentive Frequency Transformation For Conditioned Source Separation. 171-175 - Robin Scheibler, Masahito Togami:
Surrogate Source Model Learning for Determined Source Separation. 176-180 - Han Li, Kean Chen, Bernhard U. Seeber:
Auditory Filterbanks Benefit Universal Sound Source Separation. 181-185 - Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey:
What's all the Fuss about Free Universal Sound Separation Data? 186-190 - Shota Inoue, Hirokazu Kameoka, Li Li, Shoji Makino:
SepNet: A Deep Separation Matrix Prediction Network for Multichannel Audio Source Separation. 191-195 - Pranay Manocha, Zeyu Jin, Richard Zhang, Adam Finkelstein:
CDPAM: Contrastive Learning for Perceptual Audio Similarity. 196-200 - Soichiro Oyabu, Daichi Kitamura, Kohei Yatabe:
Linear Multichannel Blind Source Separation based on Time-Frequency Mask Obtained by Harmonic/Percussive Sound Separation. 201-205 - Daniel Arteaga, Jordi Pons:
Multichannel-based Learning for Audio Object Extraction. 206-210 - Ali Aroudi, Sebastian Braun:
DBnet: Doa-Driven Beamforming Network for end-to-end Reverberant Sound Source Separation. 211-215 - Taishi Nakashima, Robin Scheibler, Masahito Togami, Nobutaka Ono:
Joint Dereverberation and Separation With Iterative Source Steering. 216-220 - Ingvi Örnolfsson, Torsten Dau, Ning Ma, Tobias May:
Exploiting Non-Negative Matrix Factorization for Binaural Sound Localization in the Presence of Directional Interference. 221-225 - Jirí Málek, Jakub Janský, Tomás Kounovský, Zbynek Koldovský, Jindrich Zdánský:
Blind Extraction of Moving Audio Source in a Challenging Environment Supported by Speaker Identification Via X-Vectors. 226-230 - Ashvala Vinay, Alexander Lerch, Grace Leslie:
Mind the Beat: Detecting Audio Onsets from EEG Recordings of Music Listening. 231-235 - Mojtaba Heydari, Zhiyao Duan:
Don't Look Back: An Online Beat Tracking Method Using RNN and Enhanced Particle Filtering. 236-240 - Xingjian Du, Bilei Zhu, Qiuqiang Kong, Zejun Ma:
Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling. 241-245 - I-Chieh Wei, Chih-Wei Wu, Li Su:
Improving Automatic Drum Transcription Using Large-Scale Audio-to-Midi Aligned Data. 246-250 - Shuai Yu, Xiaoheng Sun, Yi Yu, Wei Li:
Frequency-Temporal Attention Network for Singing Melody Extraction. 251-255 - Yuki Hiramatsu, Go Shibata, Ryo Nishikimi, Eita Nakamura, Kazuyoshi Yoshii:
Statistical Correction of Transcribed Melody Notes Based on Probabilistic Integration of a Music Language Model and a Transcription Error Model. 256-260 - Sebastian Rosenzweig, Frank Scherbaum, Meinard Müller:
Reliability Assessment of Singing Voice F0-Estimates Using Multiple Algorithms. 261-265 - Sakya Basak, Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi:
End-to-End Lyrics Recognition with Voice to Singing Style Transfer. 266-270 - Lenny Renault, Andrea Vaglio, Romain Hennequin:
Singing Language Identification Using a Deep Phonotactic Approach. 271-275 - Jun-You Wang, Jyh-Shing Roger Jang:
On the Preparation and Validation of a Large-Scale Dataset of Singing Transcription. 276-280 - Lele Liu, Veronica Morfi, Emmanouil Benetos:
Joint Multi-Pitch Detection and Score Transcription for Polyphonic Piano Music. 281-285 - Yuan Wang, Shigeki Tanaka, Keita Yokoyama, Hsin-Tai Wu, Yi Fang:
Karaoke Key Recommendation Via Personalized Competence-Based Rating Prediction. 286-290 - Afagh Farhadi, Skyler G. Jennings, Elizabeth A. Strickland, Laurel H. Carney:
A Closed-Loop Gain-Control Feedback Model for The Medial Efferent System of The Descending Auditory Pathway. 291-295 - Zehai Tu, Ning Ma, Jon Barker:
DHASP: Differentiable Hearing Aid Speech Processing. 296-300 - Anil M. Nagathil, Florian Göbel, Alexandru Nelus, Ian C. Bruce:
Computationally Efficient DNN-Based Approximation of an Auditory Model for Applications in Speech Processing. 301-305 - Hideki Kawahara, Kohei Yatabe:
Cascaded All-Pass Filters with Randomized Center Frequencies and Phase Polarity for Acoustic and Speech Measurement and Data Augmentation. 306-310 - Danni Ma, Neville Ryant, Mark Liberman:
Probing Acoustic Representations for Phonetic Properties. 311-315 - Zhuohuang Zhang, Piyush Vyas, Xuan Dong, Donald S. Williamson:
An End-To-End Non-Intrusive Model for Subjective and Objective Real-World Speech Assessment Using a Multi-Task Framework. 316-320 - Yu Wang, Nicholas J. Bryan, Mark Cartwright, Juan Pablo Bello, Justin Salamon:
Few-Shot Continual Learning for Audio Classification. 321-325 - Huang Xie, Okko Räsänen, Tuomas Virtanen:
Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections. 326-330 - Hsin-Ping Huang, Krishna C. Puvvada, Ming Sun, Chao Wang:
Unsupervised and Semi-Supervised Few-Shot Acoustic Event Classification. 331-335 - Kota Dohi, Takashi Endo, Harsh Purohit, Ryo Tanabe, Yohei Kawaguchi:
Flow-Based Self-Supervised Density Estimation for Anomalous Sound Detection. 336-340 - Sangwook Park, Ashwin Bellur, David K. Han, Mounya Elhilali:
Self-Training for Sound Event Detection in Audio Mixtures. 341-345 - Shubhr Singh, Helen L. Bear, Emmanouil Benetos:
Prototypical Networks for Domain Adaptation in Acoustic Scene Classification. 346-350 - Helin Wang, Yuexian Zou, Wenwu Wang:
A Global-Local Attention Framework for Weakly Labelled Audio Tagging. 351-355 - Xu Zheng, Yan Song, Ian McLoughlin, Lin Liu, Li-Rong Dai:
An Improved Mean Teacher Based Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection. 356-360 - Léo Cances, Thomas Pellegrini:
Comparison of Deep Co-Training and Mean-Teacher Approaches for Semi-Supervised Audio Tagging. 361-365 - Shawn Hershey, Daniel P. W. Ellis, Eduardo Fonseca, Aren Jansen, Caroline Liu, R. Channing Moore, Manoj Plakal:
The Benefit of Temporally-Strong Labels in Audio Event Classification. 366-370 - Eduardo Fonseca, Diego Ortego, Kevin McGuinness, Noel E. O'Connor, Xavier Serra:
Unsupervised Contrastive Learning of Sound Event Representations. 371-375 - Chih-Yuan Koh, You-Siang Chen, Yi-Wen Liu, Mingsian R. Bai:
Sound Event Detection by Consistency Training and Pseudo-Labeling With Feature-Pyramid Convolutional Recurrent Neural Networks. 376-380 - Joan Serrà, Jordi Pons, Santiago Pascual:
SESQA: Semi-Supervised Learning for Speech Quality Assessment. 381-385 - Helmer Nylén, Saikat Chatterjee, Sten Ternström:
Detecting Signal Corruptions in Voice Recordings For Speech Therapy. 386-390 - Yichong Leng, Xu Tan, Sheng Zhao, Frank K. Soong, Xiang-Yang Li, Tao Qin:
MBNET: MOS Prediction for Synthesized Speech with Mean-Bias Network. 391-395 - Jana Roßbach, Saskia Röttges, Christopher F. Hauth, Thomas Brand, Bernd T. Meyer:
Non-Intrusive Binaural Prediction of Speech Intelligibility Based on Phoneme Classification. 396-400 - Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines:
Warp-Q: Quality Prediction for Generative Neural Speech Codecs. 401-405 - Ross Cutler, Babak Nadari, Markus Loide, Sten Sootla, Ando Saabas:
Crowdsourcing Approach for Subjective Evaluation of Echo Impairment. 406-410 - Shoichi Koyama, Takashi Amakasu, Natsuki Ueno, Hiroshi Saruwatari:
Amplitude Matching: Majorization-Minimization Algorithm for Sound Field Control Only with Amplitude Constraint. 411-415 - Huanyu Zuo, Thushara D. Abhayapala, Prasanga N. Samarasinghe:
3D Multizone Soundfield Reproduction in a Reverberant Environment Using Intensity Matching Method. 416-420 - Jens Ahrens, Hannes Helmholz, David Lou Alon, Sebastià V. Amengual Garí:
The Far-Field Equatorial Array for Binaural Rendering. 421-425 - Fabrice Katzberg, Marco Maaß, Alfred Mertins:
Spherical Harmonic Representation for Dynamic Sound-Field Measurements. 426-430 - Adrian Herzog, Daniele Mirabilii, Emanuël A. P. Habets:
Direction Preserving Wind Noise Reduction Of B-Format Signals. 431-435 - Robin Scheibler, Masahito Togami:
Refinement of Direction of Arrival Estimators by Majorization-Minimization Optimization on the Array Manifold. 436-440 - Yaxuan Zhou, Hao Jiang, Vamsi Krishna Ithapu:
On the Predictability of Hrtfs from Ear Shapes Using Deep Networks. 441-445 - Lior Arbel, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely:
Applied Methods for Sparse Sampling of Head-Related Transfer Functions. 446-450 - Mengfan Zhang, Jui-Hsien Wang, Doug L. James:
Personalized HRTF Modeling Using DNN-Augmented BEM. 451-455 - Fabian Hübner, Wolfgang Mack, Emanuël A. P. Habets:
Efficient Training Data Generation for Phase-Based DOA Estimation. 456-460 - Giovanni Bologni, Richard Heusdens, Jorge Martínez:
Acoustic Reflectors Localization from Stereo Recordings Using Neural Networks. 1-5 - Usama Saqib, Antoine Deleforge, Jesper Rindom Jensen:
Detecting Acoustic Reflectors Using A Robot's Ego-Noise. 466-470 - Ziqi Fan, Vibhav Vineet, Chenshen Lu, T. W. Wu, Kyla A. McMullen:
Prediction of Object Geometry from Acoustic Scattering Using Convolutional Neural Networks. 471-475 - Tom Shlomo, Boaz Rafaely:
Blind Amplitude Estimation of Early Room Reflections Using Alternating Least Squares. 476-480 - Thomas McKenzie, Sebastian J. Schlecht, Ville Pulkki:
Acoustic Analysis and Dataset of Transitions Between Coupled Rooms. 481-485 - Yuying Li, Yuchen Liu, Donald S. Williamson:
On Loss Functions for Deep-Learning Based T60 Estimation. 486-490 - Hideyuki Tachibana:
Towards Listening to 10 People Simultaneously: An Efficient Permutation Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm. 491-495 - Andreas Brendel, Walter Kellermann:
Accelerating Auxiliary Function-Based Independent Vector Analysis. 496-500 - Beat Gfeller, Dominik Roblek, Marco Tagliasacchi:
One-Shot Conditional Audio Filtering of Arbitrary Sounds. 501-505 - Tetsuya Ueda, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Shoko Araki, Shoji Makino:
Low Latency Online Blind Source Separation Based on Joint Optimization with Blind Dereverberation. 506-510 - Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii:
Autoregressive Fast Multichannel Nonnegative Matrix Factorization For Joint Blind Source Separation And Dereverberation. 511-515 - Paul Magron, Pierre-Hugo Vial, Thomas Oberlin, Cédric Févotte:
Phase Recovery with Bregman Divergences for Audio Source Separation. 516-520 - Naoya Takahashi, Shota Inoue, Yuki Mitsufuji:
Adversarial Attacks on Audio Source Separation. 521-525 - Mieszko Fras, Konrad Kowalczyk:
Maximum a Posteriori Estimator for Convolutive Sound Source Separation with Sub-Source Based NTF Model and the Localization Probabilistic Prior on the Mixing Matrix. 526-530 - Efthymios Tzinis, Dimitrios Bralios, Paris Smaragdis:
Unified Gradient Reweighting for Model Biasing with Applications to Source Separation. 531-535 - Andres Ferraro, Yuntae Kim, Soohyeon Lee, Biho Kim, Namjun Jo, Semi Lim, Suyon Lim, Jungtaek Jang, Sehwan Kim, Xavier Serra, Dmitry Bogdanov:
Melon Playlist Dataset: A Public Dataset for Audio-Based Playlist Generation and Music Tagging. 536-540 - Furkan Yesiler, Emilio Molina, Joan Serrà, Emilia Gómez:
Investigating the Efficacy of Music Version Retrieval Systems for Setlist Identification. 541-545 - Kevin Ji, Daniel Yang, T. J. Tsai:
Instrument Classification of Solo Sheet Music Images. 546-550 - Xingjian Du, Zhesong Yu, Bilei Zhu, Xiaoou Chen, Zejun Ma:
Bytecover: Cover Song Identification Via Multi-Loss Training. 551-555 - Ho-Hsiang Wu, Chieh-Chi Kao, Qingming Tang, Ming Sun, Brian McFee, Juan Pablo Bello, Chao Wang:
Multi-Task Self-Supervised Pre-Training for Music Classification. 556-560 - Shreyan Chowdhury, Gerhard Widmer:
Towards Explaining Expressive Qualities in Piano Recordings: Transfer of Explanatory Features Via Acoustic Domain Adaptation. 561-565 - Ju-Chiang Wang, Jordan B. L. Smith, Jitong Chen, Xuchen Song, Yuxuan Wang:
Supervised Chorus Detection for Popular Music Using Convolutional Neural Network and Multi-Task Learning. 566-570 - Ruchit Agrawal, Daniel Wolff, Simon Dixon:
Structure-Aware Audio-to-Score Alignment Using Progressively Dilated Convolutional Neural Networks. 571-575 - Juan Sebastián Gómez Cañón, Estefanía Cano, Ana Gabriela Pandrea, Perfecto Herrera, Emilia Gómez:
Language-Sensitive Music Emotion Recognition Models: are We Really There Yet? 576-580 - Paul Magron, Cédric Févotte:
Leveraging the Structure of Musical Preference in Content-Aware Music Recommendation. 581-585 - Emir Demirel, Sven Ahlbäck, Simon Dixon:
Low Resource Audio-To-Lyrics Alignment from Polyphonic Music Recordings. 586-590 - Minz Won, Sergio Oramas, Oriol Nieto, Fabien Gouyon, Xavier Serra:
Multimodal Metric Learning for Tag-Based Music Retrieval. 591-595 - Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. 596-600 - Paulo Lopez-Meyer, Juan A. del Hoyo Ontiveros, Hong Lu, Georg Stemmer:
Efficient End-to-End Audio Embeddings Generation for Audio Classification on Target Applications. 601-605 - Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events. 606-610 - Huy Phan, Huy Le Nguyen, Oliver Y. Chén, Lam Dang Pham, Philipp Koch, Ian McLoughlin, Alfred Mertins:
Multi-View Audio And Music Classification. 611-615 - Juncheng B. Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze:
Audio-Visual Event Recognition Through the Lens of Adversary. 616-620 - Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu:
DCASENET: An Integrated Pretrained Deep Neural Network for Detecting and Classifying Acoustic Scenes and Events. 621-625 - Shanshan Wang, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A Curated Dataset of Urban Scenes for Audio-Visual Scene Analysis. 626-630 - Giacomo Ferroni, Nicolas Turpault, Juan Azcarreta, Francesco Tuveri, Romain Serizel, Çagdas Bilen, Sacha Krstulovic:
Improving Sound Event Detection Metrics: Insights from DCASE 2020. 631-635 - Satvik Venkatesh, David Moffat, Alexis Kirke, Gözel Shakeri, Stephen A. Brewster, Jörg Fachner, Helen Odell-Miller, Alex Street, Nicolas Farina, Sube Banerjee, Eduardo Reck Miranda:
Artificially Synthesising Data for Audio Classification and Segmentation to Improve Speech and Music Detection in Radio Broadcast. 636-640 - Weiquan Fan, Xiangmin Xu, Xiaofen Xing, Weidong Chen, Dongyan Huang:
LSSED: A Large-Scale Dataset and Benchmark for Speech Emotion Recognition. 641-645 - Turab Iqbal, Karim Helwani, Arvindh Krishnaswamy, Wenwu Wang:
Enhancing Audio Augmentation Methods with Consistency Learning. 646-650 - Thomas Pellegrini, Timothée Masquelier:
Fast Threshold Optimization for Multi-Label Audio Tagging Using Surrogate Gradient Learning. 651-655 - Sebastian Braun, Hannes Gamper, Chandan K. A. Reddy, Ivan Tashev:
Towards Efficient Models for Real-Time Deep Noise Suppression. 656-660 - Sotaro Nakaoka, Li Li, Shota Inoue, Shoji Makino:
Teacher-Student Learning for Low-Latency Online Speech Enhancement Using Wave-U-Net. 661-665 - Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training. 666-670 - Koen Oostermeijer, Jun Du, Qing Wang, Chin-Hui Lee:
Speech Enhancement Autoencoder with Hierarchical Latent Structure. 671-675 - Huajian Fang, Guillaume Carbajal, Stefan Wermter, Timo Gerkmann:
Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder. 676-680 - Guillaume Carbajal, Julius Richter, Timo Gerkmann:
Guided Variational Autoencoder for Speech Enhancement with a Supervised Classifier. 681-685 - Satoru Emura, Noboru Harada:
An Extension of Sparse Audio Declipper to Multiple Measurement Vectors. 686-690 - Yunpeng Li, Marco Tagliasacchi, Oleg Rybakov, Victor Ungureanu, Dominik Roblek:
Real-Time Speech Frequency Bandwidth Extension. 691-695 - Jiaqi Su, Yunyun Wang, Adam Finkelstein, Zeyu Jin:
Bandwidth Extension is All You Need. 696-700 - Pavel Záviska, Pavel Rajmic, Ondrej Mokrý:
Audio Dequantization Using (Co)Sparse (Non)Convex Methods. 701-705 - Haici Yang, Kai Zhen, Seungkwon Beack, Minje Kim:
Source-Aware Neural Speech Coding for Noisy Speech Compression. 706-710 - Jonah Casebeer, Vinjai Vale, Umut Isik, Jean-Marc Valin, Ritwik Giri, Arvindh Krishnaswamy:
Enhancing into the Codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders. 711-715 - Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot:
Speech Enhancement with Mixture of Deep Experts with Clean Clustering Pre-Training. 716-720 - Yang Xiang, Liming Shi, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen:
A Novel NMF-HMM Speech Enhancement Algorithm Based on Poisson Mixture Model. 721-725 - Yajing Liu, Xiulian Peng, Zhiwei Xiong, Yan Lu:
Phoneme-Based Distribution Regularization for Speech Enhancement. 726-730 - Carol Chermaz, Dario Leuchtmann, Simon Tanner, Roger Wattenhofer:
Compressed Representation of Cepstral Coefficients via Recurrent Neural Networks for Informed Speech Enhancement. 731-735 - An Zhao, Krishna Subramani, Paris Smaragdis:
Optimizing Short-Time Fourier Transform Parameters via Gradient Descent. 736-740 - Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. 741-745 - Xudong Zhao, Gongping Huang, Jacob Benesty, Jingdong Chen, Israel Cohen:
On the Design of Square Differential Microphone Arrays with a Multistage Structure. 746-750 - Federico Borra, Alberto Bernardini, Ivan Bertuletti, Fabio Antonacci, Augusto Sarti:
Arrays of First-Order Steerable Differential Microphones. 751-755 - Xi Chen, Chao Pan, Jingdong Chen, Jacob Benesty:
Planar Array Geometry Optimization for Region Sound Acquisition. 756-760 - Alexandru Nelus, Rene Glitza, Rainer Martin:
Estimation of Microphone Clusters in Acoustic Sensor Networks Using Unsupervised Federated Learning. 761-765 - Gabriel F. Miller, Andreas Brendel, Walter Kellermann, Sharon Gannot:
Misalignment Recognition in Acoustic Sensor Networks Using a Semi-Supervised Source Estimation Method and Markov Random Fields. 766-770 - Yukoh Wakabayashi, Kouei Yamaoka, Nobutaka Ono:
Rotation-Robust Beamforming Based on Sound Field Interpolation with Regularly Circular Microphone Array. 771-775 - Shiduo Yu, Craig T. Jin, Fabio Antonacci, Augusto Sarti:
Sparse Recovery Beamforming and Upscaling in the Ray Space. 776-780 - Gongping Huang, Yuzhu Wang, Jacob Benesty, Israel Cohen, Jingdong Chen:
Combined Differential Beamforming With Uniform Linear Microphone Arrays. 781-785 - Vincent W. Neo, Christine Evers, Patrick A. Naylor:
Polynomial Matrix Eigenvalue Decomposition of Spherical Harmonics for Speech Enhancement. 786-790 - Jie Zhang:
A Parametric Unconstrained Binaural Beamformer Based Noise Reduction and Spatial Cue Preservation for Hearing-Assistive Devices. 791-795 - Fan Zhang, Chao Pan, Jacob Benesty, Jingdong Chen:
A Simplified Wiener Beamformer Based on Covariance Matrix Modelling. 796-800 - Aleksej Chinaev, Sven Wienand, Gerald Enzner:
Control Architecture of the Double-Cross-Correlation Processor for Sampling-Rate-Offset Estimation in Acoustic Sensor Networks. 801-805 - Yuto Kondo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari:
Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction. 806-810 - Noman Akbar, Glenn Dickins, Mark R. P. Thomas, Prasanga N. Samarasinghe, Thushara D. Abhayapala:
Reducing Modal Error Propagation through Correcting Mismatched Microphone Gains Using Rapid. 811-814 - Yonggang Hu, Prasanga N. Samarasinghe, Sharon Gannot, Thushara D. Abhayapala:
Evaluation and Comparison of Three Source Direction-of-Arrival Estimators Using Relative Harmonic Coefficients. 815-819 - Michael Günther, Haitham Afifi, Andreas Brendel, Holger Karl, Walter Kellermann:
Network-Aware Optimal Microphone Channel Selection in Wireless Acoustic Sensor Networks. 820-824 - Bing Yang, Xiaofei Li, Hong Liu:
Supervised Direct-Path Relative Transfer Function Learning for Binaural Sound Source Localization. 825-829 - Yang Liu, Alexandros Neophytou, Sunando Sengupta, Eric Sommerlade:
Cross-Modal Spectrum Transformation Network for Acoustic Scene Classification. 830-834 - Ziheng Lin, Yanxiong Li, Zhangjin Huang, Wenhao Zhang, Yufeng Tan, Yichun Chen, Qianhua He:
Domestic Activities Clustering From Audio Recordings Using Convolutional Capsule Autoencoder Network. 835-839 - Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes. 840-844 - Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. 845-849 - Simyung Chang, Hyoungwoo Park, Janghoon Cho, Hyunsin Park, Sungrack Yun, Kyuwoong Hwang:
Subspectral Normalization for Neural Audio Data Processing. 850-854 - Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen:
Slow-Fast Auditory Streams for Audio Recognition. 855-859 - Keisuke Imoto, Sakiko Mishima, Yumi Arai, Reishi Kondo:
Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance. 860-864 - Jan Baumann, Patrick Meyer, Timo Lohrenz, Alexander Roy, Michael Papendieck, Tim Fingscheidt:
A New DCASE 2017 Rare Sound Event Detection Benchmark Under Equal Training Data: CRNN With Multi-Width Kernels. 865-869 - Jaejun Lee, Donmoon Lee, Hyeong-Seok Choi, Kyogu Lee:
Room Adaptive Conditioning Method for Sound Event Classification in Reverberant Environments. 870-874 - Noriyuki Tonami, Keisuke Imoto, Yuki Okamoto, Takahiro Fukumori, Yoichi Yamashita:
Sound Event Detection Based on Curriculum Learning Considering Learning Difficulty of Events. 875-879 - Christopher Ick, Brian McFee:
Sound Event Detection in Urban Audio with Single and Multi-Rate Pcen. 880-884 - Yin Cao, Turab Iqbal, Qiuqiang Kong, Fengyan An, Wenwu Wang, Mark D. Plumbley:
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection. 885-889 - Shahan Nercessian, Andy M. Sarroff, Kurt James Werner:
Lightweight and Interpretable Neural Modeling of an Audio Distortion Effect Using Hyperconditioned Differentiable Biquads. 890-894 - Chih-Hsiang Huang, Po-Hao Wu, Yi-Wen Liu, Shan-Hung Wu:
Attacking and Defending Behind A Psychoacoustics-Based Captcha. 895-899 - JinHong Lu, Tianhang Liu, Shuzhuang Xu, Hiroshi Shimodaira:
Double-DCCCAE: Estimation of Body Gestures From Speech Waveform. 900-904 - Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu:
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning. 905-909 - Jian Luo, Jianzong Wang, Ning Cheng, Jing Xiao:
Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition. 910-914 - Kazuki Shimada, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji:
Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection. 915-919 - Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset. 920-924 - Eesung Kim, Jae-Jin Jeon, Hyeji Seo:
U-Convolution Based Residual Echo Suppression with Multiple Encoders. 925-929 - You Wang, Chuyao Feng, David V. Anderson:
A Multi-Channel Temporal Attention Convolutional Neural Network Model for Environmental Sound Classification. 930-934 - Thi Ngoc Tho Nguyen, Ngoc Khanh Nguyen, Huy Phan, Lam Pham, Kenneth Ooi, Douglas L. Jones, Woon-Seng Gan:
A General Network Architecture for Sound Event Localization and Detection Using Transfer Learning and Recurrent Neural Network. 935-939 - Hongsen He, Jingdong Chen, Jacob Benesty, Yi Yu:
Robust Recursive Least M-Estimate Adaptive Filter for the Identification of Low-Rank Acoustic Systems. 940-944 - Thomas Haubner, Andreas Brendel, Mohamed Elminshawi, Walter Kellermann:
Noise-Robust Adaptation Control for Supervised Acoustic System Identification Exploiting a Noise Dictionary. 945-949 - Matteo Acerbi, Raffaele Malvermi, Mirco Pezzoli, Fabio Antonacci, Augusto Sarti, Roberto Corradi:
Interpolation of Irregularly Sampled Frequency Response Functions Using Convolutional Neural Networks. 950-954 - Heinrich W. Löllmann, Andreas Brendel, Walter Kellermann:
Effective Rank-Based Estimation of the Coherent-to-Diffuse Power Ratio. 955-959 - Orchisama Das, Paul Calamia, Sebastià V. Amengual Garí:
Room Impulse Response Interpolation from a Sparse Set of Measurements Using a Modal Architecture. 960-964 - Alastair H. Moore, Rebecca R. Vos, Patrick A. Naylor, Mike Brookes:
Processing Pipelines for Efficient, Physically-Accurate Simulation of Microphone Array Signals in Dynamic Sound Scenes. 965-969 - Hadi Habibzadeh, Olivia Zhou, James J. S. Norton, Theresa M. Vaughan, Daphney-Stavroula Zois:
A Classifier for Improving Cause and Effect in SSVEP-based BCIs for Individuals with Complex Communication Disorders. 970-974 - Boyuan Feng, Yuke Wang, Yufei Ding:
Saga: Sparse Adversarial Attack on EEG-Based Brain Computer Interface. 975-979 - Marie-Constance Corsi, Florian Yger, Sylvain Chevallier, Camille Noûs:
Riemannian Geometry on Connectivity for Clinical BCI. 980-984 - Winko W. An, Barbara G. Shinn-Cunningham, Hannes Gamper, Dimitra Emmanouilidou, David Johnston, Mihai Jalobeanu, Edward Cutrell, Andrew D. Wilson, Kuan-Jung Chiang, Ivan Tashev:
Decoding Music Attention from "EEG Headphones": A User-Friendly Auditory Brain-Computer Interface. 985-989 - Sunhee Hwang, Sungho Park, Dohyung Kim, Jewook Lee, Hyeran Byun:
Mitigating Inter-Subject Brain Signal Variability FOR EEG-Based Driver Fatigue State Classification. 990-994 - Pradeep Kumar, Erik J. Scheme:
A Deep Spatio-Temporal Model for EEG-Based Imagined Speech Recognition. 995-999 - Bahman Abdi-Sargezeh, Antonio Valentín, Gonzalo Alarcón, Saeid Sanei:
Incorporating Uncertainty In Data Labeling Into Detection of Brain Interictal Epileptiform Discharges From EEG Using Weighted optimization. 1000-1004 - Mikko Impiö, Mehmet Yamaç, Jenni Raitoharju:
Multi-Level Reversible Encryption for ECG Signals Using Compressive Sensing. 1005-1009 - Minh C. Tran, Phi Anh Phan, Douglas C. Crockett, Federico Formenti, John N. Cronin, Stephen J. Payne, Andrew D. Farmery:
Validating the Inspired Sinewave Technique to Measure Lung Heterogeneity Compared to Atelectasis & Over-Distended Volume in Computed Tomography Images. 1010-1014 - Nasimuddin Ahmed, Shivam Singhal, Varsha Sharma, Sakyajit Bhattacharya, Aniruddha Sinha, Avik Ghose:
A Patient-Invariant Model for Freezing of Gait Detection Aided by Wavelet Decomposition. 1015-1019 - Liu Yang, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric:
Identification of Uterine Contractions by An Ensemble of Gaussian Processes. 1020-1024 - Bin Wang, Chang Liu, Chuanyan Hu, Xudong Liu, Jun Cao:
Arrhythmia Classification with Heartbeat-Aware Transformer. 1025-1029 - Alejandro Cohen, Nir Shlezinger, Amit Solomon, Yonina C. Eldar, Muriel Médard:
Multi-Level Group Testing with Application to One-Shot Pooled COVID-19 Tests. 1030-1034 - Mahmoud Al Ismail, Soham Deshmukh, Rita Singh:
Detection of Covid-19 Through the Analysis of Vocal Fold Oscillations. 1035-1039 - Shahin Heidarian, Parnian Afshar, Arash Mohammadi, Moezedin Javad Rafiee, Anastasia Oikonomou, Konstantinos N. Plataniotis, Farnoosh Naderkhani:
Ct-Caps: Feature Extraction-Based Automated Framework for Covid-19 Disease Identification From Chest Ct Scans Using Capsule Networks. 1040-1044 - Yifan Jiang, Han Chen, Hanseok Ko, David K. Han:
Few-Shot Learning for Ct Scan Based Covid-19 Diagnosis. 1045-1049 - Huimin Huang, Ming Cai, Lanfen Lin, Jing Zheng, Xiongwei Mao, Xiaohan Qian, Zhiyi Peng, Jianying Zhou, Yutaro Iwamoto, Xian-Hua Han, Yen-Wei Chen, Ruofeng Tong:
Graph-Based Pyramid Global Context Reasoning With a Saliency- Aware Projection for Covid-19 Lung Infections Segmentation. 1050-1054 - Soham Deshmukh, Mahmoud Al Ismail, Rita Singh:
Interpreting Glottal Flow Dynamics for Detecting Covid-19 From Voice. 1055-1059 - Daniel Iglesias Morís, Joaquim de Moura, Jorge Novo, Marcos Ortega:
Cycle Generative Adversarial Network Approaches to Produce Novel Portable Chest X-Rays Images for Covid-19 Diagnosis. 1060-1064 - Seyed Saman Saboksayr, Gonzalo Mateos, Müjdat Çetin:
EEG-Based Emotion Classification Using Graph Signal Processing. 1065-1069 - Tamanna T. K. Munia, Selin Aviyente:
Granger Causality Based Directional Phase-Amplitude Coupling Measure. 1070-1074 - Giulia Cisotto:
REPAC: Reliable Estimation of Phase-Amplitude Coupling in Brain Networks. 1075-1079 - Maria Sayu Yamamoto, Florian Yger, Sylvain Chevallier:
Subspace Oddity - Optimization on Product of Stiefel Manifolds for EEG Data. 1080-1084 - Erdem Varol, Julien Boussard, Nishchal Dethe, Olivier Winter, Anne E. Urai, International Brain Laboratory, Anne Churchland, Nick Steinmetz, Liam Paninski:
Decentralized Motion Inference and Registration of Neuropixel Data. 1085-1089 - Bo Jiang, Yiyi Yu, Hamid Krim, Spencer L. Smith:
Dynamic Graph Learning Based on Graph Laplacian. 1090-1094 - Syed Ahmed Pasha, Victor Solo:
Mutual Information Flows in a Bivariate Point Process. 1095-1099 - Karim Armanious, Sherif Abdulatif, Wenbin Shi, Tobias Hepp, Sergios Gatidis, Bin Yang:
Uncertainty-Based Biological Age Estimation of Brain MRI Scans. 1100-1104 - Jia-Yang Song, Miao-Ying Qi, Dun-Pei Lv, Chao-Ying Zhang, Qiu-Hua Lin, Vince D. Calhoun:
Sparse Representation of Complex-Valued fMRI Data Based on Hard Thresholding of Spatial Source Phase. 1105-1109 - Yue Han, Qiu-Hua Lin, Li-Dan Kuang, Xiao-Feng Gong, Fengyu Cong, Vince D. Calhoun:
Tucker Decomposition for Extracting Shared and Individual Spatial Maps from Multi-Subject Resting-State fMRI Data. 1110-1114 - Simon Geirnaert, Tom Francart, Alexander Bertrand:
Riemannian Geometry-Based Decoding of the Directional Focus of Auditory Attention Using EEG. 1115-1119 - Wei Chen, Qiuli Wang, Sheng Huang, Xiaohong Zhang, Yucong Li, Chen Liu:
DFDM: A Deep Feature Decoupling Module for Lung Nodule Segmentation. 1120-1124 - Jiawei Zhang, Yanchun Zhang, Xiaowei Xu:
Pyramid U-Net for Retinal Vessel Segmentation. 1125-1129 - Xiaojiang Long, Wei Chen, Qiuli Wang, Xiaohong Zhang, Chen Liu, Yucong Li, Jiuquan Zhang:
A Probabilistic Model for Segmentation of Ambiguous 3D Lung Nodule. 1130-1134 - Zhiqiang Xie, Enmei Tu, Hao Zheng, Yun Gu, Jie Yang:
Semi-Supervised Skin Lesion Segmentation with Learning Model Confidence. 1135-1139 - Xiangjiang Wu, Xuanya Li, Kai Hu, Zhineng Chen, Xieping Gao:
A Hybrid Feature Enhancement Method for Gl And Segmentation In Histopathology Images. 1140-1144 - Annika Liebgott, Charlotte Lorenz, Sergios Gatidis, Viet Chau Vu, Konstantin Nikolaou, Bin Yang:
Automated Multi-Organ Segmentation in Pet Images Using Cascaded Training of a 3d U-Net and Convolutional Autoencoder. 1145-1149 - Burhaneddin Yaman, Seyed Amir Hossein Hosseini, Steen Moeller, Mehmet Akçakaya:
Improved Supervised Training of Physics-Guided Deep Learning Image Reconstruction with Multi-Masking. 1150-1154 - Jingshuai Liu, Mehrdad Yaghoobi:
Fine-Grained Mri Reconstruction Using Attentive Selection Generative Adversarial Networks. 1155-1159 - Hemant Kumar Aggarwal, Aniket Pramanik, Mathews Jacob:
Ensure: Ensemble Stein's Unbiased Risk Estimator for Unsupervised Learning. 1160-1164 - Narges Mohammadi, Marvin M. Doyley, Müjdat Çetin:
Ultrasound Elasticity Imaging Using Physics-Based Models and Learning-Based Plug-and-Play Priors. 1165-1169 - Yinbing Tian, Shibiao Xu, Li Guo, Fu'ze Cong:
A Periodic Frame Learning Approach for Accurate Landmark Localization in M-Mode Echocardiography. 1170-1174 - Madhuri Nagare, Roman Melnyk, Obaidullah Rahman, Ken D. Sauer, Charles A. Bouman:
A Bias-Reducing Loss Function for CT Image Denoising. 1175-1179 - Xiao Kang, Xingbo Liu, Xiushan Nie, Yilong Yin:
Learning Binary Semantic Embedding for Breast Histology Image Classification and Retrieval. 1180-1184 - Changlu Guo, Márton Szemenyei, Yangtao Hu, Wenle Wang, Wei Zhou, Yugen Yi:
Channel Attention Residual U-Net for Retinal Vessel Segmentation. 1185-1189 - Tristan Sylvain, Francis Dutil, Tess Berthier, Lisa Di-Jorio, Margaux Luck, R. Devon Hjelm, Yoshua Bengio:
CMIM: Cross-Modal Information Maximization For Medical Imaging. 1190-1194 - Rui Zhao, Zixun Huang, Tianshan Liu, Frank H. F. Leung, Sai Ho Ling, De Yang, Timothy Tin-Yan Lee, Daniel Pak-Kong Lun, Yong-Ping Zheng, Kin-Man Lam:
Structure-Enhanced Attentive Learning For Spine Segmentation From Ultrasound Volume Projection Images. 1195-1199 - Zhijin Liang, Junkang Zhang, Cheolhong An:
Foveal Avascular Zone Segmentation of Octa Images Using Deep Learning Approach with Unsupervised Vessel Segmentation. 1200-1204 - Angelo Genovese, Mahdi S. Hosseini, Vincenzo Piuri, Konstantinos N. Plataniotis, Fabio Scotti:
Acute Lymphoblastic Leukemia Detection Based on Adaptive Unsharpening and Deep Learning. 1205-1209 - Yiming Lei, Hongming Shan, Junping Zhang:
Meta Ordinal Weighting Net For Improving Lung Nodule Classification. 1210-1214 - Jingqin Li, Kun Wang, Dan Yang, Xiaohong Zhang, Chen Liu:
Deepnodule: Multi-Task Learning of Segmentation Bootstrap for Pulmonary Nodule Detection. 1215-1219 - Jiannan Liu, Jie Li, Fanyong Xue, Chentao Wu:
Dense Attention Module for Accurate Pulmonary Nodule Detection. 1220-1224 - Zhe Xu, Jiangpeng Yan, Jie Luo, Xiu Li, Jayender Jagadeesan:
Unsupervised Multimodal Image Registration with Adaptative Gradient Guidance. 1225-1229 - Meng Jia, Matthew Kyan:
Improving Intraoperative Liver Registration in Image-Guided Surgery with Learning-Based Reconstruction. 1230-1234 - Xinxin Shan, Ying Wen:
A New Framework Based on Transfer Learning for Cross-Database Pneumonia Detection. 1235-1239 - Chao Li, Boyang Chen, Ziping Zhao, Nicholas Cummins, Björn W. Schuller:
Hierarchical Attention-Based Temporal Convolutional Networks for Eeg-Based Emotion Recognition. 1240-1244 - Jaswanth Reddy Katthi, Sriram Ganapathy:
Deep Multiway Canonical Correlation Analysis For Multi-Subject Eeg Normalization. 1245-1249 - Puneet Mathur, Trisha Mittal, Dinesh Manocha:
Dynamic Graph Modeling Of Simultaneous EEG And Eye-Tracking Data For Reading Task Identification. 1250-1254 - Aaqib Saeed, David Grangier, Olivier Pietquin, Neil Zeghidour:
Learning From Heterogeneous Eeg Signals with Differentiable Channel Reordering. 1255-1259 - Chi Nok Enoch Kan, Richard J. Povinelli, Dong Hye Ye:
Enhancing Multi-Channel Eeg Classification with Gramian Temporal Generative Adversarial Networks. 1260-1264 - Haoming Zhang, Chen Wei, Mingqi Zhao, Quanying Liu, Haiyan Wu:
A Novel Convolutional Neural Network Model to Remove Muscle Artifacts from EEG. 1265-1269 - Alexander William Wong, Amir Salimi, Abram Hindle, Sunil Vasu Kalmady, Padma Kaul:
Multilabel 12-Lead Electrocardiogram Classification Using Beat to Sequence Autoencoders. 1270-1274 - Wenjie Song, Jiqing Han, Hongwei Song:
Contrastive Embeddind Learning Method for Respiratory Sound Classification. 1275-1279 - Pei-Chun Chang, Jia-Ren Chang, Po-Yu Chen, Li-Kai Cheng, Jen-Chuen Hsieh, Hsin-Yen Yu, Li-Fen Chen, Yong-Sheng Chen:
Decoding Neural Representations of Rhythmic Sounds From Magnetoencephalography. 1280-1284 - Jian Guan, Wenbo Wang, Pengming Feng, Xinxin Wang, Wenwu Wang:
Low-Dimensional Denoising Embedding Transformer for ECG Classification. 1285-1289 - Qinfeng Xiao, Jing Wang, Jianan Ye, Hongjun Zhang, Yuyan Bu, Yiqiong Zhang, Hao Wu:
Self-Supervised Learning for Sleep Stage Classification with Predictive and Discriminative Contrastive Coding. 1290-1294 - Chuanqi Han, Fang Yu, Peng Wang, Ruoran Huang, Xi Huang, Li Cui:
Length No Longer Matters: A Real Length Adaptive Arrhythmia Classification Model with Multi-Scale Convolution. 1295-1299 - Elahe Rahimian, Soheil Zabihi, Amir Asif, Seyed Farokh Atashzar, Arash Mohammadi:
Few-Shot Learning for Decoding Surface Electromyography for Hand Gesture Recognition. 1300-1304 - Upasana Tiwari, Swapnil Bhosale, Rupayan Chakraborty, Sunil Kumar Kopparapu:
Deep Lung Auscultation Using Acoustic Biomarkers for Abnormal Respiratory Sound Event Detection. 1305-1309 - Maryam Hosseini, Luca Celotti, Eric Plourde:
Speaker-Independent Brain Enhanced Speech Denoising. 1310-1314 - Shreyasi Datta, Chandan K. Karmakar, Punit Rathore, Marimuthu Palaniswami:
Shapelet Based Visual Assessment of Cluster Tendency in Analyzing Complex Upper Limb Motion. 1315-1319 - Ryosuke Sawata, Takahiro Ogawa, Miki Haseyama:
Human-Centered Favorite Music Classification Using EEG-Based Individual Music Preference Via Deep Time-Series CCA. 1320-1324 - Mingyue Niu, Jianhua Tao, Bin Liu:
Multi-Scale and Multi-Region Facial Discriminative Representation for Automatic Depression Level Prediction. 1325-1329 - Zeeshan Ahmad, Anika Tabassum, Ling Guan, Naimul Mefraz Khan:
ECG Heart-Beat Classification Using Multimodal Image Fusion. 1330-1334 - Takaaki Higashi, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama:
Estimation of Visual Features of Viewed Image From Individual and Shared Brain Information Based on FMRI Data Using Probabilistic Generative Model. 1335-1339 - Jianxiong Zhou, Zhongyu Jiang, Jang-Hee Yoo, Jenq-Neng Hwang:
Hierarchical Pose Classification for Infant Action Analysis and Mental Development Assessment. 1340-1344 - Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik, Mathew Magimai-Doss:
On The Relationship Between Speech-Based Breathing Signal Prediction Evaluation Measures and Breathing Parameters Estimation. 1345-1349 - Jianhong Cheng, Jin Liu, Meilin Jiang, Hailin Yue, Lin Wu, Jianxin Wang:
Prediction of Egfr Mutation Status in Lung Adenocarcinoma Using Multi-Source Feature Representations. 1350-1354 - Taeheon Lee, Jeonghwan Hwang, Honggu Lee:
Training Neural Networks with Domain Pattern-Aware Auxiliary Task for Sleep Staging. 1355-1359 - Yusuke Akamatsu, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama:
Classification of Expert-Novice Level Using Eye Tracking And Motion Data via Conditional Multimodal Variational Autoencoder. 1360-1364 - Fang Yu, Chuanqi Han, Pengcheng Wang, Xi Huang, Li Cui:
Gate Trimming: One-Shot Channel Pruning for Efficient Convolutional Neural Networks. 1365-1369 - Christopher A. Metzler, Gordon Wetzstein:
Deep S3PR: Simultaneous Source Separation and Phase Retrieval Using Deep Generative Models. 1370-1374 - Zhenbo Shi, Wei Yang, Zhenbo Xu, Zhi Chen, Yingjie Li, Haoran Zhu, Liusheng Huang:
Adversarial Attacks on Object Detectors with Limited Perturbations. 1375-1379 - Rakib Hyder, Hassan Mansour, Yanting Ma, Petros T. Boufounos, Pu Wang:
A Consensus Equilibrium Solution For Deep Image Prior Powered By Red. 1380-1384 - Ruangrawee Kitichotkul, Christopher A. Metzler, Frank Ong, Gordon Wetzstein:
Suremap: Predicting Uncertainty in Cnn-Based Image Reconstructions Using Stein's Unbiased Risk Estimate. 1385-1389 - Zhengyu Chen, Donglin Wang:
Multi-Initialization Meta-Learning with Domain Adaptation. 1390-1394 - Jiaming Liu, Yu Sun, Weijie Gan, Xiaojian Xu, Brendt Wohlberg, Ulugbek S. Kamilov:
Stochastic Deep Unfolding for Imaging Inverse Problems. 1395-1399 - Laixi Shi, Dehong Liu, Masaki Umeda, Norihiko Hana:
Fusion-Based Digital Image Correlation Framework for Strain Measurement. 1400-1404 - Kaiyi Yang, Narong Borijindargoon, Boon Poh Ng, Saiprasad Ravishankar, Bihan Wen:
Learning Sparsifying Transforms for Image Reconstruction in Electrical Impedance Tomography. 1405-1409 - Christopher A. Metzler, Gordon Wetzstein:
D-VDAMP: Denoising-Based Approximate Message Passing for Compressive MRI. 1410-1414 - Byung Hyun Lee, Se Young Chun:
Empirically Accelerating Scaled Gradient Projection Using Deep Neural Network for Inverse Problems in Image Processing. 1415-1419 - Boqiang Fan, Samarjit Das:
Synthetic Aperture Acoustic Imaging with Deep Generative Model Based Source Distribution Prior. 1420-1424 - Chaobing Zheng, Zhengguo Li, Yuwen Li, Shiqian Wu:
Non-Local Single Image DE-Raining Without Decomposition. 1425-1429 - Takashi Isobe, Fang Zhu, Shengjin Wang:
Frame-Rate-Aware Aggregation for Efficient Video Super-Resolution. 1430-1434 - Rentao Wan, Jinjia Zhou, Bowen Huang, Hui Zeng, Yibo Fan:
Measurement Coding Framework with Adjacent Pixels Based Measurement Matrix for Compressively Sensed Images. 1435-1439 - Yanting Ma, Petros T. Boufounos, Hassan Mansour, Shuchin Aeron:
Multiview Sensing with Unknown Permutations: an Optimal Transport Approach. 1440-1444 - Yuhu Chang, Changyang He, Yingying Zhao, Tun Lu, Ning Gu:
A High-Frame-Rate Eye-Tracking Framework for Mobile Devices. 1445-1449 - Ali Ghofrani, Rahil Mahdian Toroghi, Seyed Mojtaba Tabatabaie:
Catiloc: Camera Image Transformer for Indoor Localization. 1450-1454 - Zi-Yao Zhang, Odysseas A. Pappas, Alin Achim:
Sar Image Autofocusing Using Wirtinger Calculus and Cauchy Regularization. 1455-1459 - Luciano C. Ayres, Sérgio J. M. de Almeida, José C. M. Bermudez, Ricardo Augusto Borsoi:
A Homogeneity-Based Multiscale Hyperspectral Image Representation for Sparse Spectral Unmixing. 1460-1464 - Jisheng Li, Qi Dai, Jiangtao Wen:
Learning to Estimate Kernel Scale and Orientation of Defocus Blur with Asymmetric Coded Aperture. 1465-1469 - Jorge Bacca, Tatiana Gelvez, Henry Arguello:
Transmittance Regularizer for Binary coded Aperture Design in a Computational Imaging end-to-end Approach. 1470-1474 - Demetris Lappas, Vasileios Argyriou, Dimitrios Makris:
Fourier Transformation Autoencoders for Anomaly Detection. 1475-1479 - Kazuki Naganuma, Saori Takeyama, Shunsuke Ono:
Zero-Gradient Constraints for Destriping of Remote-Sensing Data. 1480-1484 - Zhiguo Li, Yuan Yuan, Dandan Ma:
Selection Based on Statistical Characteristics for Object Detection. 1485-1489 - Tianyuan Wang, Can Ma, Haoshan Su, Weiping Wang:
CSPN: Multi-Scale Cascade Spatial Pyramid Network for Object Detection. 1490-1494 - Shuyong Gao, Qianyu Guo, Wei Zhang, Wenqiang Zhang, Zhongwei Ji:
Dual-Stream Network Based On Global Guidance for Salient Object Detection. 1495-1499 - Tianyuan Wang, Can Ma, Haoshan Su, Weiping Wang:
SSFENet: Spatial and Semantic Feature Enhancement Network for Object Detection. 1500-1504 - Kristian Fischer, Felix Fleckenstein, Christian Herglotz, André Kaup:
Saliency-Driven Versatile Video Coding for Neural Object Detection. 1505-1509 - Shuyu Miao, Rui Feng:
Object-Oriented Relational Distillation for Object Detection. 1510-1514 - Kateryna Chumachenko, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj:
Ensembling Object Detectors for Image and Video Data Analysis. 1515-1519 - Qing-Yang Shen, Tian-Guo Huang, Peng-Xin Ding, Jia He:
Training Real-Time Panoramic Object Detectors with Virtual Dataset. 1520-1524 - Lv Tang, Bo Li, Yanliang Wu, Bo Xiao, Shouhong Ding:
Fast: Feature Aggregation for Detecting Salient Object in Real-Time. 1525-1529 - Wanli Ma, Alin Achim, Oktay Karakus:
Exploiting the Dual-Tree Complex Wavelet Transform for Ship Wake Detection in SAR Imagery. 1530-1534 - Zhinan Cai, Zhiyu Jiang, Yuan Yuan:
Task-Related Self-Supervised Learning For Remote Sensing Image Change Detection. 1535-1539 - Makoto Okuda, Shin'ichi Satoh, Yoichi Sato, Yutaka Kidawara:
Unsupervised Common Particular Object Discovery and Localization by Analyzing a Match Graph. 1540-1544 - Madeleine Barowsky, Alexander Mariona, Flávio P. Calmon:
Predictive Coding for Lossless Dataset Compression. 1545-1549 - Weijia Zhu, Jizheng Xu, Li Zhang, Yue Wang:
Adaptive Dual Tree Structure For Screen Content Coding. 1550-1554 - Mingze Ding, Jiahui Li, Mengyao Ma, Xiaopeng Fan:
SNR-Adaptive Deep Joint Source-Channel Coding for Wireless Image Transmission. 1555-1559 - Gabriel B. Sant'Anna, Luiz Henrique Cancellier, Ismael Seidel, Mateus Grellert, José Luís Güntzel:
Relying on a Rate Constraint to Reduce Motion Estimation Complexity. 1560-1564 - Andy Regensky, Christian Herglotz, André Kaup:
A Novel Viewport-Adaptive Motion Compensation Technique for Fisheye Video. 1565-1569 - Alban Marie, Navid Mahmoudian Bidgoli, Thomas Maugey, Aline Roumy:
Rate-Distortion Optimized Motion Estimation for on-the-Sphere Compression of 360 Videos. 1570-1574 - Bohan Li, Jingning Han, Yaowu Xu:
Adaptive GOP Size Decision for Multi-Pass Video Coding Based on Hidden Markov Model. 1575-1579 - Yize Jin, Liang Zhao, Xin Zhao, Shan Liu, Alan C. Bovik:
Improved Intra Mode Coding Beyond Av1. 1580-1584 - Xinyao Chen, Yiwei Zhang, Yanghao Li, Jiangtao Wen:
Decision Tree Based Inter Partition Termination For Av1 Encoding. 1585-1589 - Nam Le, Honglei Zhang, Francesco Cricri, Ramin Ghaznavi Youvalari, Esa Rahtu:
Image Coding For Machines: an End-To-End Learned Approach. 1590-1594 - Shihui Zhao, Shuyuan Yang, Zhi Liu, Zhixi Feng, Xu Liu:
Sparse Flow Adversarial Model For Robust Image Compression. 1595-1599 - Lee Prangnell, Victor Sanchez:
HVS-Based Perceptual Color Compression of Image Data. 1600-1604 - Yalei Lv, Tao Dai, Bin Chen, Jian Lu, Shu-Tao Xia, Jingchao Cao:
HOCA: Higher-Order Channel Attention for Single Image Super-Resolution. 1605-1609 - Anqi Liu, Sumei Li, Yongli Chang:
Image Super-Resolution Using Multi-Resolution Attention Network. 1610-1614 - Zhihong Pan, Baopu Li:
Real Image Super-Resolution Using Token Based Contextual Attention. 1615-1619 - Jun Xiao, Wenqi Jia, Kin-Man Lam:
Feature Redundancy Mining: Deep Light-Weight Image Super-Resolution Model. 1620-1624 - Risheng Wang, Tao Lei, Wenzheng Zhou, Qi Wang, Hongying Meng, Asoke K. Nandi:
Lightweight Non-Local Network for Image Super-Resolution. 1625-1629 - Zhonghan Niu, Xi-Peng Lin, An-Ni Yu, Yang-Hao Zhou, Yu-Bin Yang:
Lightweight and Accurate Single Image Super-Resolution with Channel Segregation Network. 1630-1634 - Angel Villar-Corrales, Franziska Schirrmacher, Christian Riess:
Deep Learning Architectural Designs for Super-Resolution Of Noisy Images. 1635-1639 - Andrew Gigie, Achanna Anil Kumar, Angshul Majumdar, Kriti Kumar, M. Girish Chandra:
Joint Coupled Transform Learning Framework for Multimodal Image Super-Resolution. 1640-1644 - Qiang Li, Qi Wang, Xuelong Li:
Hyperspectral Image Super-Resolution Via Adjacent Spectral Fusion Strategy. 1645-1649 - Miguel Heredia Conde:
Raw Data Processing for Practical Time-of-Flight Super-Resolution. 1650-1654 - Jun Xia, Guanghua Tan, Yi Xiao, Fangqiang Xu, Chi-Sing Leung:
Edge-Aware Multi-Scale Progressive Colorization. 1655-1659 - Kangbo Sun, Jie Zhu:
Learning Representation of Multi-Scale Object for Fine-Grained Image Retrieval. 1660-1664 - Yu Sang, Jinguang Sun, Si-Miao Wang, Heng Qi, Keqiu Li:
Super-Resolution and Infection Edge Detection Co-Guided Learning for Covid-19 Ct Segmentation. 1665-1669 - Weidong He, Yangjinan Hu, Lulu Wang, Zhongshi He, Jinglong Du:
Gating Feature Dense Network for Single Anisotropic Mr Image Super-Resolution. 1670-1674 - Yankai Wang, Dawei Yang, Wei Zhang, Zhe Jiang, Wenqiang Zhang:
Adaptable Ensemble Distillation. 1675-1679 - Akshay Rangamani, Nam H. Nguyen, Abhishek Kumar, Dzung T. Phan, Sang (Peter) Chin, Trac D. Tran:
A Scale Invariant Measure of Flatness for Deep Network Minima. 1680-1684 - Zhixiao Fu, Xinyuan Chen, Jianfeng Dong, Shouling Ji:
Multi-Order Adversarial Representation Learning for Composed Query Image Retrieval. 1685-1689 - Zhengbo Luo, Sei-ichiro Kamata, Zitang Sun, Weilian Zhou:
Deep Neural Networks with Flexible Complexity While Training Based on Neural Ordinary Differential Equations. 1690-1694 - Adrian Bulat, Enrique Sánchez-Lozano, Georgios Tzimiropoulos:
Improving Memory Banks for Unsupervised Learning with Large Mini-Batch, Consistency and Hard Negative Mining. 1695-1699 - Defu Liu, Guowu Yang, Jinzhao Wu, Jiayi Zhao, Fengmao Lv:
Robust Binary Loss for Multi-Category Classification with Label Noise. 1700-1704 - Zengsheng Kuang, Xian Fang, Ruixun Zhang, Xiuli Shao, Hongpeng Wang:
A Plug and Play Fast Intersection Over Union Loss for Boundary Box Regression. 1705-1709 - Sheng-Jhe Huang, Jen-Tzung Chien:
Attribute Decomposition for Flow-Based Domain Mapping. 1710-1714 - Mahesh Sudhakar, Sam Sattarzadeh, Konstantinos N. Plataniotis, Jongseong Jang, Yeonjeong Jeong, Hyunwoo Kim:
Ada-Sise: Adaptive Semantic Input Sampling for Efficient Explanation of Convolutional Neural Networks. 1715-1719 - Hao Pan, Zhongdi Chao, Jiang Qian, Bojin Zhuang, Shaojun Wang, Jing Xiao:
Network Pruning Using Linear Dependency Analysis on Feature Maps. 1720-1724 - Fangming Zhong, Guangze Wang, Zhikui Chen, Xu Yuan, Feng Xia:
Multiple-Input Multiple-Output Fusion Network for Generalized Zero-Shot Learning. 1725-1729 - Kun Yan, Lingbo Liu, Jun Hou, Ping Wang:
Representative Local Feature Mining for Few-Shot Learning. 1730-1734 - Zeyang Zhu, Xin Lin:
KAN: Knowledge-Augmented Networks for Few-Shot Learning. 1735-1739 - Kun Yan, Zied Bouraoui, Ping Wang, Shoaib Jameel, Steven Schockaert:
Few-Shot Image Classification with Multi-Facet Prototypes. 1740-1744 - Da Chen, Yuefeng Chen, Yuhong Li, Feng Mao, Yuan He, Hui Xue:
Self-Supervised Learning for Few-Shot Image Classification. 1745-1749 - Chun-Chih Teng, Pin-Yu Chen, Wei-Chen Chiu:
Domain Adaptation for Learning Generator From Paired Few-Shot Data. 1750-1754 - Furen Zhuang, Pierre Moulin:
Deep Semi-Supervised Metric Learning Via Identification of Manifold Memberships. 1755-1759 - Jian Wang, Zhichao Zhang, Dongmei Huang, Wei Song, Quanmiao Wei, Xinyue Li:
A Ranked Similarity Loss Function with pair Weighting for Deep Metric Learning. 1760-1764 - Ting-Yao Hu, Alexander G. Hauptmann:
Statistical Distance Metric Learning for Image Set Retrieval. 1765-1769 - Yinong Zhu, Yong Feng, Mingliang Zhou, Baohua Qiang, Leong Hou U, Jiajie Zhu:
Distribution-Aware Hierarchical Weighting Method for Deep Metric Learning. 1770-1774 - Sam Sattarzadeh, Mahesh Sudhakar, Konstantinos N. Plataniotis, Jongseong Jang, Yeonjeong Jeong, Hyunwoo Kim:
Integrated Grad-Cam: Sensitivity-Aware Visual Explanation of Deep Convolutional Networks Via Integrated Gradient-Based Scoring. 1775-1779 - Taiga Kashima, Ryuichiro Hataya, Hideki Nakayama:
Visualizing Association in Exemplar-Based Classification. 1780-1784 - Zitang Sun, Ruojing Wang, Zhengbo Luo, Weili Chen:
HFGCNET: High-Frequency Graph Reasoning for Finer Semantic Image Segmentation. 1785-1789 - Hugo Gangloff, Jean-Baptiste Courbot, Emmanuel Monfrini, Christophe Collet:
Unsupervised Image Segmentation with Spatial Triplet Markov Trees. 1790-1794 - Dong Liang, Bin Kang, Xinyu Liu, Han Sun, Liyan Zhang, Ningzhong Liu:
Cross Scene Video Foreground Segmentation Via Co-Occurrence Probability Oriented Supervised and Unsupervised Model Interaction. 1795-1799 - Jianfeng Cao, Hong Yan:
Instance Segmentation with the Number of Clusters Incorporated in Embedding Learning. 1800-1804 - Lianlei Shan, Xiaobin Li, Weiqiang Wang:
Decouple the High-Frequency and Low-Frequency Information of Images for Semantic Segmentation. 1805-1809 - Zhaoxin Fan, Hongyan Liu, Jun He, Min Zhang, Xiaoyong Du:
MPDNet: A 3D Missing Part Detection Network Based on Point Cloud Segmentation. 1810-1814 - Nan Jiang, Xuehui Yu, Xiaoke Peng, Yuqi Gong, Zhenjun Han:
SM+: Refined Scale Match for Tiny Person Detection. 1815-1819 - Weilian Zhou, Sei-ichiro Kamata, Zhengbo Luo:
Sub-Band Grouping Spectral Feature-Attention Block for Hyperspectral Image Classification. 1820-1824 - Erting Pan, Yong Ma, Xiaoguang Mei, Fan Fan, Jiayi Ma:
Unsupervised Stacked Capsule Autoencoder for Hyperspectral Image Classification. 1825-1829 - Ganghui Fan, Yong Ma, Jun Huang, Xiaoguang Mei, Jiayi Ma:
Robust Graph Autoencoder for Hyperspectral Anomaly Detection. 1830-1834 - Xiaomeng Wu, Yongqing Sun, Akisato Kimura, Kunio Kashino:
Reflectance-Oriented Probabilistic Equalization for Image Enhancement. 1835-1839 - Yijun Liu, Zhengning Wang, Yi Zeng, Hao Zeng, Deming Zhao:
PD-GAN: Perceptual-Details GAN for Extremely Noisy Low Light Image Enhancement. 1840-1844 - Dong Wang, Yunpeng Bai, Bendu Bai, Chanyue Wu, Ying Li:
Heterogeneous two-Stream Network with Hierarchical Feature Prefusion for Multispectral Pan-Sharpening. 1845-1849 - Chong Mou, Jian Zhang:
Synergic Feature Attention for Image Restoration. 1850-1854 - Jingwen Su, Hujun Yin:
Efficient Multi-Objective GANs for Image Restoration. 1855-1859 - Lanqing Guo, Zhiyuan Zha, Saiprasad Ravishankar, Bihan Wen:
Self-Convolution: A Highly-Efficient Operator for Non-Local Image Restoration. 1860-1864 - Fengchao Xiong, Jun Zhou, Minchao Ye, Jianfeng Lu, Yuntao Qian:
NMF-SAE: An Interpretable Sparse Autoencoder for Hyperspectral Unmixing. 1865-1869 - Chao Zhou, Miguel R. D. Rodrigues:
An ADMM Based Network for Hyperspectral Unmixing Tasks. 1870-1874 - Shuaikai Shi, Min Zhao, Lijun Zhang, Jie Chen:
Variational Autoencoders for Hyperspectral Unmixing with Endmember Variability. 1875-1879 - Yaser Esmaeili Salehani, Ehsan Arabnejad, Saeed Gazor:
Augmented Gaussian Linear Mixture Model for Spectral Variability in Hyperspectral Unmixing. 1880-1884 - Qiwen Jin, Yong Ma, Xiaoguang Mei, Hao Li, Jiayi Ma:
UTDN: An Unsupervised Two-Stream Dirichlet-Net for Hyperspectral Unmixing. 1885-1889 - Yi Yang, Fei Jiang, Hongtao Lu:
Laplacian Regularized Tensor Low-Rank Minimization for Hyperspectral Snapshot Compressive Imaging. 1890-1894 - Roy Miles, Krystian Mikolajczyk:
Compressing Local Descriptor Models for Mobile Applications. 1895-1899 - Zhi Chen, Wei Yang, Zhenbo Xu, Zhenbo Shi, Liusheng Huang:
VK-Net: Category-Level Point Cloud Registration with Unsupervised Rotation Invariant Keypoints. 1900-1904 - Bhavesh Deshpande, Sourabh Hanamsheth, Yawen Lu, Guoyu Lu:
Matching as Color Images: Thermal Image Local Feature Detection and Description. 1905-1909 - Viktoria Heimann, Andreas Spruck, André Kaup:
Frame Rate Up-Conversion Using Key Point Agnostic Frequency-Selective Mesh-to-Grid Resampling. 1910-1914 - Jianwei Ke, Alex J. Watras, Jae-Jun Kim, Hewei Liu, Hongrui Jiang, Yu Hen Hu:
Efficient Real-Time Video Stabilization with a Novel Least Squares Formulation. 1915-1919 - Yuan Hou, Annie A. M. Cuyt, Wen-shin Lee, Deepayan Bhowmik:
Decomposing Textures using Exponential Analysis. 1920-1924 - Hoda Roodaki, Masoud Dehyadegari, Mahdi Nazm Bojnordi:
G-Arrays: Geometric Arrays for Efficient Point Cloud Processing. 1925-1929 - Lisha Wang, Chenglin Li, Wenrui Dai, Junni Zou, Hongkai Xiong:
QoE-Driven and Tile-Based Adaptive Streaming for Point Clouds. 1930-1934 - Ashek Ahmmed, Manoranjan Paul, M. Manzur Murshed, David Taubman:
Dynamic Point Cloud Compression Using A Cuboid Oriented Discrete Cosine Based Motion Model. 1935-1939 - Yangang Cai, Ronggang Wang, Song Gu, Jian Zhang, Wen Gao:
An Adaptive Pyramid Single-View Depth Lookup Table Coding Method. 1940-1944 - Marta Milovanovic, Félix Henry, Marco Cagnazzo, Joël Jung:
Patch Decoder-Side Depth Estimation In Mpeg Immersive Video. 1945-1949 - Hongyan Quan, Mingwei Yao, Xiaoxiao Qian:
Geometry Consistency Of Augmented Reality Based On Semantics. 1950-1954 - Tong Zhou, Kun Tian:
What And Where To Focus In Person Search. 1955-1959 - Ning Lv, Xuezhi Xiang, Xinyao Wang, Jie Yang, Rokia Abdein, Abdulmotaleb El-Saddik:
Stable and Effective One-Step Method for Person Search. 1960-1964 - Xi-Peng Lin, Yu-Bin Yang:
An Adaptive Part-Based Model For Person Re-Identification. 1965-1969 - Yukang Gao, Hua Yang:
Crowd Counting Via Multi-Level Regression With Latent Gaussian Maps. 1970-1974 - Ye Tian, Chengzhen Duan, Ruilin Zhang, Zhiwei Wei, Hongpeng Wang:
Lightweight Dual-Task Networks For Crowd Counting In Aerial Images. 1975-1979 - Siyang Pan, Yanyun Zhao, Fei Su, Zhicheng Zhao:
SANet++: Enhanced Scale Aggregation with Densely Connected Feature Fusion for Crowd Counting. 1980-1984 - Zehao Chen, Hua Yang:
Attentive Semantic Exploring for Manipulated Face Detection. 1985-1989 - Bin Cheng, Tao Dai, Bin Chen, Shutao Xia, Xiu Li:
Efficient Face Manipulation Via Deep Feature Disentanglement And Reintegration Net. 1990-1994 - Seogkyu Jeon, Pilhyeon Lee, Kibeom Hong, Hyeran Byun:
Continuous Face Aging Generative Adversarial Networks. 1995-1999 - Nicky Bayat, Vahid Reza Khazaie, Yalda Mohsenzadeh:
Fast Inverse Mapping of Face GANs. 2000-2004 - Jingwei Yan, Boyuan Jiang, Jingjing Wang, Qiang Li, Chunmao Wang, Shiliang Pu:
Multi-Level Adaptive Region of Interest and Graph Learning for Facial Action Unit Recognition. 2005-2009 - Meimei Shang, Fei Gao, Xiang Li, Jingjie Zhu, Lingna Dai:
Bridging Unpaired Facial Photos and Sketches by Line-Drawings. 2010-2014 - Xinwei Xue, Ying Ding, Long Ma, Yi Wang, Risheng Liu, Xin Fan:
Temporal Rain Decomposition with Spatial Structure Guidance for Video Deraining. 2015-2019 - Xinwei Xue, Xiangyu Meng, Long Ma, Risheng Liu, Xin Fan:
GTA-Net: Gradual Temporal Aggregation Network for Fast Video Deraining. 2020-2024 - Zhen Wang, Cong Wang, Zhixun Su, Junyang Chen:
Dense Feature Pyramid Grids Network for Single Image Deraining. 2025-2029 - Youzhao Yang, Hong Lu:
A Fast and Efficient Network for Single Image Deraining. 2030-2034 - Dongdong Ren, Jinbao Li, Meng Han, Minglei Shu:
DNANet: Dense Nested Attention Network for Single Image Dehazing. 2035-2039 - Cong Wang, Yan Huang, Yuexian Zou, Yong Xu:
FWB-Net: Front White Balance Network for Color Shift Correction in Single Image Dehazing Via Atmospheric Light Estimation. 2040-2044 - Tobias Alt, Joachim Weickert:
Learning Integrodifferential Models for Image Denoising. 2045-2049 - Huy Vu, Gene Cheung, Yonina C. Eldar:
Unrolling of Deep Graph Total Variation for Image Denoising. 2050-2054 - Yanghao Li, Bichuan Guo, Jiangtao Wen, Zhen Xia, Shan Liu, Yuxing Han:
Learning Model-Blind Temporal Denoisers without Ground Truths. 2055-2059 - Hangfan Liu, Jian Zhang, Chong Mou:
Image Denoising Based on Correlation Adaptive Sparse Modeling. 2060-2064 - Xiaokun Liu, Long Ma, Risheng Liu, Wei Zhong, Xin Fan, Zhongxuan Luo:
NASA: A Noise-Adaptive and Structure-Aware Learning Framework for Image Deblurring. 2065-2069 - Chen Li, Qi Wang, Shaoteng Liu, Xuelong Li:
Multiple Auxiliary Networks for Single Blind Image Deblurring. 2070-2074 - Xiangfei Liu, Xiushan Nie, Zhen Shen, Yilong Yin:
Joint Learning of Image Aesthetic Quality Assessment and Semantic Recognition Based on Feature Enhancement. 2075-2079 - Junming Chen, Haiqiang Wang, Ge Li, Shan Liu:
Nested Error Map Generation Network for No-Reference Image Quality Assessment. 2080-2084 - Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik:
Regression or classification? New methods to evaluate no-reference picture and video quality models. 2085-2089 - Ci Wang, Mei Li:
Blind Image Quality Evaluator with Scale Robustness. 2090-2094 - Yingjie Feng, Sumei Li, Yongli Chang:
Multi-Scale Feature-Guided Stereoscopic Video Quality Assessment Based on 3d Convolutional Neural Network. 2095-2099 - Fan Meng, Sumei Li, Yongli Chang:
No-Reference Stereoscopic Image Quality Assessment Based on the Human Visual System. 2100-2104 - Yuxing Wang, Yawen Lu, Guoyu Lu:
Stereo Rectification Based on Epipolar Constrained Neural Network. 2105-2109 - Xiaogang Jia, Wei Chen, Zhengfa Liang, Xin Luo, Mingfei Wu, Yusong Tan, Libo Huang:
Multi-Scale Cascade Disparity Refinement Stereo Network. 2110-2114 - Jun Peng, Wangduo Xie, Zijing Huang, Wei Chen, Yong Zhao:
Hierarchical Context Guided Aggregation Network for Stereo Matching. 2115-2119 - Shenglun Chen, Baopu Li, Wei Wang, Hong Zhang, Haojie Li, Zhihui Wang:
Cost Affinity Learning Network for Stereo Matching. 2120-2124 - Naga Sailaja Mahankali, Sumohana S. Channappayya:
Video Quality Prediction Using Voxel-Wise fMRI Models of the Visual Cortex. 2125-2129 - Jianfu Zhang, Zerui Tao, Liqing Zhang, Qibin Zhao:
Tensor Decomposition Via Core Tensor Networks. 2130-2134 - Katrin Renz, Nicolaj C. Stache, Samuel Albanie, Gül Varol:
Sign Language Segmentation with Temporal Convolutional Networks. 2135-2139 - Shuyi Li, Bob Zhang:
An Adaptive Discriminant and Sparsity Feature Descriptor for Finger Vein Recognition. 2140-2144 - Zhizhong Huang, Junping Zhang, Hongming Shan:
Routinggan: Routing Age Progression and Regression with Disentangled Learning. 2145-2149 - Zongyao Li, Ren Togo, Takahiro Ogawa, Miki Haseyama:
Semantic-Aware Unpaired Image-to-Image Translation for Urban Scene Images. 2150-2154 - Rakshith S, Rishabh Khurana, Vibhav Agarwal, Jayesh Rajkumar Vachhani, Bhanodai Guggilla:
Fontnet: On-Device Font Understanding and Prediction Pipeline. 2155-2159 - Viet-Khoa Vo-Ho, Ngan Le, Kashu Yamazaki, Akihiro Sugimoto, Minh-Triet Tran:
Agent-Environment Network for Temporal Action Proposal Generation. 2160-2164 - Zhaoyang Gui, Shanshan Zhang, Kangkan Wang, Jian Yang, Pong Chi Yuen:
Adaptive Multi-Domain Learning for Outdoor 3d Human Pose and Shape Estimation. 2165-2169 - Zhe Zhang, Jie Tang, Gangshan Wu:
Lightweight Human Pose Estimation under Resource-Limited Scenes. 2170-2174 - Jie Mei, Jenq-Neng Hwang, Suzanne Romain, Craig S. Rose, Braden Moore, Kelsey Magrane:
Absolute 3d Pose Estimation and Length Measurement of Severely Deformed Fish from Monocular Videos in Longline Fishing. 2175-2179 - Yuzhuo Ren, Feng Hu:
Camera Calibration with Pose Guidance. 2180-2184 - Rishi Rajesh Shah, Vyas Anirudh Akundy, Zhou Wang:
Real Versus Fake 4k - Authentic Resolution Assessment. 2185-2189 - Wenhan Zhu, Guangtao Zhai, Xiongkuo Min, Xiaokang Yang, Xiao-Ping Zhang:
Perceptual Quality Assessment for Recognizing True and Pseudo 4k Content. 2190-2194 - Li Liu, Da Chen, Minglei Shu, Huazhong Shu, Laurent D. Cohen:
A New Tubular Structure Tracking Algorithm Based On Curvature-Penalized Perceptual Grouping. 2195-2199 - Sibo Wang, Ruize Han, Wei Feng, Song Wang:
Multiple Human Tracking in Non-Specific Coverage with Wearable Cameras. 2200-2204 - Chaoyi Wang, Yang Hua, Tao Song, Zhengui Xue, Ruhui Ma, Neil Robertson, Haibing Guan:
Fine-Grained Pose Temporal Memory Module for Video Pose Estimation and Tracking. 2205-2209 - Minghao Yang, Xukang Zhou, Yangchang Sun, Jinglong Chen, Baohua Qiang:
Drawing Order Recovery from Trajectory Components. 2210-2214 - Na Lv, Ying Wang, Zhiquan Feng, Jingliang Peng:
Deep Hashing for Motion Capture Data Retrieval. 2215-2219 - Liqi Yan, Yiming Cui, Yingjie Victor Chen, Dongfang Liu:
Hierarchical Attention Fusion for Geo-Localization. 2220-2224 - Souvik Kundu, Sairam Sundaresan:
AttentionLite: Towards Efficient Self-Attention Models for Vision. 2225-2229 - Shannan Chen, Qiule Sun, Cunhua Li, Jianxin Zhang, Qiang Zhang:
Attention-Guided Second-Order Pooling Convolutional Networks. 2230-2234 - Qing-Long Zhang, Yu-Bin Yang:
SA-Net: Shuffle Attention for Deep Convolutional Neural Networks. 2235-2239 - Reshmi S. Bhooshan, Suresh K:
An Attention Based Wavelet Convolutional Model for Visual Saliency Detection. 2240-2244 - Shuang Wang, Yun Meng, Yu Gu, Lei Zhang, Xiutiao Ye, Jingxian Tian, Licheng Jiao:
Cascade Attention Fusion for Fine-Grained Image Captioning Based on Multi-Layer LSTM. 2245-2249 - Jinpeng Wang, Bin Chen, Tao Dai, Shu-Tao Xia:
Webly Supervised Deep Attentive Quantization. 2250-2254 - Leena Mathur, Maja J. Mataric:
Unsupervised Audio-Visual Subspace Alignment for High-Stakes Deception Detection. 2255-2259 - Wen-Feng Pang, Qian-Hua He, Yongjian Hu, Yan-Xiong Li:
Violence Detection in Videos Based on Fusing Visual and Audio Information. 2260-2264 - Andreea-Maria Oncescu, João F. Henriques, Yang Liu, Andrew Zisserman, Samuel Albanie:
QUERYD: A Video Dataset with High-Quality Text and Audio Narrations. 2265-2269 - Alkesh Patel, Akanksha Bindal, Hadas Kotek, Christopher Klein, Jason D. Williams:
Generating Natural Questions from Images for Multimodal Assistants. 2270-2274 - Jialang Xu, Yang Luo, Xinyue Chen, Chunbo Luo:
An Adaptive Multi-Scale and Multi-Level Features Fusion Network with Perceptual Loss for Change Detection. 2275-2279 - Samuel Albanie, Gül Varol, Liliane Momeni, Triantafyllos Afouras, Andrew Brown, Chuhan Zhang, Ernesto Coto, Necati Cihan Camgöz, Ben Saunders, Abhishek Dutta, Neil Fox, Richard Bowden, Bencie Woll, Andrew Zisserman:
SeeHear: Signer Diarisation and a New Dataset. 2280-2284 - Kai Katsumata, Hideki Nakayama:
Semantic Image Synthesis from Inaccurate and Coarse Masks. 2285-2289 - Yuan Chang, Yisong Chen, Guoping Wang:
Range Guided Depth Refinement and Uncertainty-Aware Aggregation for View Synthesis. 2290-2294 - Yuan Chang, Tao Peng, Ruhan He, Xinrong Hu, Junping Liu, Zili Zhang, Minghua Jiang:
DP-VTON: Toward Detail-Preserving Image-Based Virtual Try-on Network. 2295-2299 - Dónal Egan, Martin Alain, Aljosa Smolic:
Light Field Style Transfer with Local Angular Consistency. 2300-2304 - Kai Deng, Kun Zhang, Ping Yao, Siyuan Cheng, Peng He:
Skip Attention GAN for Remote Sensing Image Synthesis. 2305-2309 - Libao Zhang, Yanan Liu:
Image Generation Based on Texture Guided VAE-AGAN for Regions of Interest Detection in Remote Sensing Images. 2310-2314 - Qihang Yang, Tao Chen, Jiayuan Fan, Ye Lu, Chongyan Zuo, Qinghua Chi:
EADNet: Efficient Asymmetric Dilated Network For Semantic Segmentation. 2315-2319 - Binjie Mao, Lingfeng Wang, Shiming Xiang, Chunhong Pan:
Ltaf-Net: Learning Task-Aware Adaptive Features and Refining Mask for Few-Shot Semantic Segmentation. 2320-2324 - Hanlin Chen, Qingyong Hu, Jungang Yang, Jing Wu, Yulan Guo:
Cgan-Net: Class-Guided Asymmetric Non-Local Network for Real-Time Semantic Segmentation. 2325-2329 - Kuntao Cao, Xi Huang, Jie Shao:
Aggregation Architecture and all-to-one Network for Real-Time Semantic Segmentation. 2330-2334 - Dong Liang, Yun Du, Han Sun, Liyan Zhang, Ningzhong Liu, Mingqiang Wei:
Nlkd: Using Coarse Annotations For Semantic Segmentation Based on Knowledge Distillation. 2335-2339 - Shengjia Chen, Zhixin Li, Xiwei Yang:
Knowledge Reasoning for Semantic Segmentation. 2340-2344 - Yaxi Yang, Hailin Wang, Haiquan Qiu, Jianjun Wang, Yao Wang:
Non-Convex Sparse Deviation Modeling Via Generative Models. 2345-2349 - Xin Yang, Chunling Yang:
Imrnet: An Iterative Motion Compensation and Residual Reconstruction Network for Video Compressed Sensing. 2350-2354 - Jeong-Won Ha, Jun-Sang Yoo, Jong-Ok Kim:
Deep Color Constancy Using Temporal Gradient Under Ac Light Sources. 2355-2359 - Ronan Fablet, Lucas Drumetz, François Rousseau:
End-to-End Learning of Variational Models and Solvers for the Resolution of Interpolation Problems. 2360-2364 - Fengyin Cao, Ping An, Xinpeng Huang, Chao Yang, Qiang Wu:
Multi-Models Fusion for Light Field Angular Super-Resolution. 2365-2369 - Zhun Sun, Chao Li, Qibin Zhao:
Hide Chopin in the Music: Efficient Information Steganography Via Random Shuffling. 2370-2374 - Yi Zhang, Wei Yang, Zhenbo Xu, Yingjie Li, Zhi Chen, Liusheng Huang:
Pointer Networks for Arbitrary-Shaped Text Spotting. 2375-2379 - Longjiao Zhao, Yu Wang, Jien Kato:
Rotation Invariance Analysis of Local Convolutional Features in Image Retrieval. 2380-2384 - Atharva Kadethankar, Neelam Sinha, Vinayaka Hegde, Abhishek Burman:
Signature Feature Marking Enhanced IRM Framework for Drone Image Analysis in Precision Agriculture. 2385-2389 - Yanting Zhang, Aotian Zheng, Ke Han, Yizhou Wang, Jenq-Neng Hwang:
Vehicle 3d Localization in Road Scenes VIA a Monocular Moving Camera. 2390-2394 - Teresa White, Jesse Wheeler, Colton Lindstrom, Randall Christensen, Kevin R. Moon:
Gps-Denied Navigation Using Sar Images And Neural Networks. 2395-2399 - Binyu Zhao, Qianqian Ren, Jinbao Li, Yafeng Zhao:
Attention-Embedded Decomposed Network with Unpaired CT Images Prior for Metal Artifact Reduction. 2400-2404 - Houshun Yu, Li Zhang:
Partial Feature Aggregation Network for Real-Time Object Counting. 2405-2409 - David A. Maluf, Amr Elnakeeb, Matt Silverman:
A Bayesian Inference Approach for Location-Based Micro Motions using Radio Frequency Sensing. 2410-2414 - Yuheng Deng, Wenjun Zhou, Bo Peng, Dong Liang, Shun'ichi Kaneko:
Robust Spatial-Temporal Correlation Model for Background Initialization in Severe Scene. 2415-2419 - Lei Gao, Lin Qi, Ling Guan:
2D-FRFT Based Frequency Shift-Invariant Digital Image Encryption. 2420-2424 - Akshay Kapoor, Jatin Sapra, Zhou Wang:
Capturing Banding in Images: Database Construction and Objective Assessment. 2425-2429 - Qier An, Yuan Shen:
On The Camera Position Dithering In Visual 3d Reconstruction. 2430-2434 - Liyu Wu, Yuexian Zou, Can Zhang:
Long-Short Temporal Modeling for Efficient Action Recognition. 2435-2439 - Bohong Yang, Zijian Wang, Wu Ran, Hong Lu, Yi-Ping Phoebe Chen:
Multi-Directional Convolution Networks with Spatial-Temporal Feature Pyramid Module for Action Recognition. 2440-2444 - Xiaohang Yang, Lingtong Kong, Jie Yang:
Unsupervised Motion Representation Enhanced Network for Action Recognition. 2445-2449 - Wei Wu, Jiale Yu:
An Improved Deep Relation Network for Action Recognition in Still Images. 2450-2454 - Zichen Yang, Di Huang, Jie Qin, Yunhong Wang:
Human-Aware Coarse-to-Fine Online Action Detection. 2455-2459 - Ranyu Ning, Can Zhang, Yuexian Zou:
SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection. 2460-2464 - Zhilin Huang, Chujun Qin, Ruixin Liu, Zhenyu Weng, Yuesheng Zhu:
Semantic-Aware Context Aggregation for Image Inpainting. 2465-2469 - Xue Zhou, Tao Dai, Yong Jiang, Shu-Tao Xia:
Bishift-Net for Image Inpainting. 2470-2474 - Lingtong Kong, Xiaohang Yang, Jie Yang:
OAS-Net: Occlusion Aware Sampling Network for Accurate Optical Flow. 2475-2479 - Yingjie Li, Wei Yang, Zhenbo Xu, Zhi Chen, Zhenbo Shi, Yi Zhang, Liusheng Huang:
Mask4D: 4D Convolution Network for Light Field Occlusion Removal. 2480-2484 - Jianrong Wang, Ge Zhang, Zhenyu Wu, Xuewei Li, Li Liu:
Self-Supervised Depth Estimation Via Implicit Cues from Videos. 2485-2489 - Cho-Ying Wu, Ulrich Neumann:
Scene Completeness-Aware Lidar Depth Completion for Driving Scenario. 2490-2494 - Bahram Lavi, José Nascimento, Anderson Rocha:
Semi-Supervised Feature Embedding for Data Sanitization in Real-World Events. 2495-2499 - Shu Hu, Yuezun Li, Siwei Lyu:
Exposing GAN-Generated Faces Using Inconsistent Corneal Specular Highlights. 2500-2504 - Jiaxin Chen, Xin Liao, Wei Wang, Zheng Qin:
A Features Decoupling Method for Multiple Manipulations Identification in Image Operation Chains. 2505-2509 - Pavel Korshunov, Sébastien Marcel:
Subjective and Objective Evaluation of Deepfake Videos. 2510-2514 - Alexander Schlögl, Tobias Kupek, Rainer Böhme:
Forensicability of Deep Neural Network Inference Pipelines. 2515-2519 - Jianhui Xie, Song Liu, Ruixin Liu, Yinghong Zhang, Yuesheng Zhu:
SERN: Stance Extraction and Reasoning Network for Fake News Detection. 2520-2524 - Yuhao Sun, Xin Liao, Jianfeng Liu:
An Efficient Paper Anti-Counterfeiting Method Based on Microstructure Orientation Estimation. 2525-2529 - Irene Amerini, Aris Anagnostopoulos, Luca Maiano, Lorenzo Ricciardi Celsi:
Learning Double-Compression Video Fingerprints Left From Social-Media Platforms. 2530-2534 - Chiara Albisani, Massimo Iuliani, Alessandro Piva:
Checking PRNU Usability on Modern Devices. 2535-2539 - Thomas Thebaud, Gaël Le Lan, Anthony Larcher:
Handwritten Digits Reconstruction from Unlabelled Embeddings. 2540-2544 - Samet Taspinar, Manoranjan Mohanty, Nasir D. Memon:
Effect of Video Pixel-Binning on Source Attribution of Mixed Media. 2545-2549 - Lingling Lv, Youjun Xiang, Xianfeng Li, Hanye Huang, Rongju Ruan, Xiaoyan Xu, Yuli Fu:
Combining Dynamic Image and Prediction Ensemble for Cross-Domain Face Anti-Spoofing. 2550-2554 - Mingzhu Ma, Gongping Yang, Kuikui Wang, Yuwen Huang, Yilong Yin:
Label-Guided Dictionary Pair Learning for ECG Biometric Recognition. 2555-2559 - Tongqing Zhai, Yiming Li, Ziqi Zhang, Baoyuan Wu, Yong Jiang, Shu-Tao Xia:
Backdoor Attack Against Speaker Verification. 2560-2564 - Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich:
Class-Conditional Defense GAN Against End-To-End Speech Attacks. 2565-2569 - Yiqun Liu, Yi Zeng, Jian Pu, Hongming Shan, Peiyang He, Junping Zhang:
Selfgait: A Spatiotemporal Representation Learning Method for Self-Supervised Gait Recognition. 2570-2574 - Weiyi Zhang, Shuning Zhao, Le Liu, Jianmin Li, Xingliang Cheng, Thomas Fang Zheng, Xiaolin Hu:
Attack on Practical Speaker Verification System Using Universal Adversarial Perturbations. 2575-2579 - Heinz Hofbauer, Yoanna Martínez-Díaz, Simon Kirchgasser, Heydi Méndez-Vázquez, Andreas Uhl:
Highly Efficient Protection of Biometric Face Samples with Selective JPEG2000 Encryption. 2580-2584 - Hatef Otroshi-Shahreza, Sébastien Marcel:
Deep Auto-Encoding and Biohashing for Secure Finger Vein Recognition. 2585-2589 - Jinzhu Yang, Wei Zhou, Wanhui Qian, Jizhong Han, Songlin Hu:
Topic Sequence Embedding for User Identity Linkage from Heterogeneous Behavior Data. 2590-2594 - Daniele Mari, Samuele Giuliano Piazzetta, Sara Bordin, Luca Pajola, Sebastiano Verde, Simone Milani, Mauro Conti:
Looking Through Walls: Inferring Scenes from Video-Surveillance Encrypted Traffic. 2595-2599 - Zhanjiang Chen, H. Vicky Zhao:
Optimal Attacking Strategy Against Online Reputation Systems with Consideration of the Message-Based Persuasion Phenomenon. 2600-2604 - Mohammad Adiban, Arash Safari, Giampiero Salvi:
STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security. 2605-2609 - Michele Cirillo, Mario Di Mauro, Vincenzo Matta, Marco Tambasco:
Application-Layer DDOS Attacks with Multiple Emulation Dictionaries. 2610-2614 - Henri Hentilä, Yanina Y. Shkel, Visa Koivunen:
Secret Key Generation Over Wireless Channels using short Blocklength Multilevel Source Polar Coding. 2615-2619 - Zhifan Xu, Melike Baykal-Gürsoy:
Efficient Network Protection Games Against Multiple Types Of Strategic Attackers. 2620-2624 - Jinyuan Jia, Zheng Dong, Jie Li, Jack W. Stokes:
Detection Of Malicious DNS and Web Servers using Graph-Based Approaches. 2625-2629 - Mengdi Wang, Di Xiao, Jia Liang:
Low Complexity Secure P-Tensor Product Compressed Sensing Reconstruction Outsourcing and Identity Authentication in Cloud. 2630-2634 - Behrooz Razeghi, Sohrab Ferdowsi, Dimche Kostadinov, Flávio P. Calmon, Slava Voloshynovskiy:
Privacy-Preserving near Neighbor Search via Sparse Coding with Ambiguation. 2635-2639 - Zuobin Ying, Shuanglong Cao, Shengmin Xu, Ximeng Liu, Lingjuan Lyu, Cen Chen, Li Wang:
Privacy-Preserving Optimal Insulin Dosing Decision. 2640-2644 - Yulu Jin, Lifeng Lai:
Privacy-Accuracy Trade-Off of Inference as Service. 2645-2649 - Muah Kim, Onur Günlü, Rafael F. Schaefer:
Federated Learning with Local Differential Privacy: Trade-Offs Between Privacy, Utility, and Communication. 2650-2654 - Amin Aminifar, Fazle Rabbi, Yngve Lamo:
Scalable Privacy-Preserving Distributed Extremely Randomized Trees for Structured Data With Multiple Colluding Parties. 2655-2659 - Ecenaz Erdemir, Pier Luigi Dragotti, Deniz Gündüz:
Active Privacy-Utility Trade-Off Against A Hypothesis Testing Adversary. 2660-2664 - Bhanuka Gamage, Adnan Labib, Aisha Joomun, Chern Hong Lim, KokSheik Wong:
Baitradar: A Multi-Model Clickbait Detection Algorithm Using Deep Learning. 2665-2669 - Xiangyu Wang, Jianfeng Ma, Ximeng Liu:
Enabling Efficient and Expressive Spatial Keyword Queries On Encrypted Data. 2670-2674 - Shangyu Xie, Bingyu Liu, Yuan Hong:
Privacy-Preserving Cloud-Based DNN Inference. 2675-2679 - Avital Shafran, Gil Segev, Shmuel Peleg, Yedid Hoshen:
Crypto-Oriented Neural Architecture Design. 2680-2684 - Seok-Jun Bu, Sung-Bae Cho:
Integrating Deep Learning with First-Order Logic Programmed Constraints for Zero-Day Phishing Attack Detection. 2685-2689 - Haibo Cheng, Wenting Li, Ping Wang, Kaitai Liang:
Improved Probabilistic Context-Free Grammars for Passwords Using Word Extraction. 2690-2694 - Tingting Song, Minglin Liu, Weiqi Luo, Peijia Zheng:
Enhancing Image Steganography Via Stego Generation And Selection. 2695-2699 - Shengbei Wang, Weitao Yuan, Zhen Zhang, Jianming Wang, Masashi Unoki:
Synchronous Multi-Bit Audio Watermarking Based on Phase Shifting. 2700-2704 - Xinghong Qin, Shunquan Tan, Weixuan Tang, Bin Li, Jiwu Huang:
Image Steganography Based on Iterative Adversarial Perturbations Onto a Synchronized-Directions Sub-Image. 2705-2709 - Jan Butora, Jessica J. Fridrich:
Extending the Reverse JPEG Compatibility Attack to Double Compressed Images. 2710-2714 - Yuxuan Huang, Xin Cao, Hao-Tian Wu, Yiu-ming Cheung:
Reversible Data Hiding in Jpeg Images for Privacy Protection. 2715-2719 - Xiaoqing Jia, Jie Wang, Yongliang Liu, Xiangui Kang, Yun-Qing Shi:
A Layered Embedding-Based Scheme to Cope with Intra-Frame Distortion Drift In IPM-Based HEVC Steganography. 2720-2724 - Zejiang Hou, Anwar Walid, Sun-Yuan Kung:
Meta-Learning with Attention for Improved Few-Shot Learning. 2725-2729 - Anish Madan, Ranjitha Prasad:
B-Small: A Bayesian Neural Network Approach to Sparse Model-Agnostic Meta-Learning. 2730-2734 - Wen Tang, Emilie Chouzenoux, Jean-Christophe Pesquet, Hamid Krim:
Deep Transform and Metric Learning Networks. 2735-2739 - Pengchao Han, Jihong Park, Shiqiang Wang, Yejun Liu:
Robustness and Diversity Seeking Data-Free Knowledge Distillation. 2740-2744 - Yassir Fathullah, Mark J. F. Gales, Andrey Malinin:
Ensemble Distillation Approaches for Grammatical Error Correction. 2745-2749 - Shucong Zhang, Cong-Thanh Do, Rama Doddipatla, Erfan Loweimi, Peter Bell, Steve Renals:
Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers. 2750-2754 - Antônio H. Ribeiro, Thomas B. Schön:
How Convolutional Neural Networks Deal with Aliasing. 2755-2759 - Tianyou Chen, Xiaoguang Hu, Jin Xiao, Guofeng Zhang, Hui Ruan:
Canet: Context-Aware Loss for Descriptor Learning. 2760-2764 - Yan Zhang, Binyu He, Li Sun, Qingli Li:
Progressive Multi-Stage Feature Mix for Person Re-Identification. 2765-2769 - Vivek Sivaraman Narayanaswamy, Jayaraman J. Thiagarajan, Andreas Spanias:
Using Deep Image Priors to Generate Counterfactual Explanations. 2770-2774 - Hojatollah Zamani, Peyman Rostami, Arash Amini, Farokh Marvasti:
Elliptical Shape Recovery from Blurred Pixels Using Deep Learning. 2775-2779 - Eran Goldman, Jacob Goldberger:
Factorized CRF with Batch Normalization Based on the Entire Training Data. 2780-2784 - Zhenhua Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao:
Evolutionary Quantization of Neural Networks with Mixed-Precision. 2785-2789 - Yong Wang, Xiaojing Wang, Xiaoyu He:
Evolving Quantized Neural Networks for Image Classification Using A Multi-Objective Genetic Algorithm. 2790-2794 - Bochen Guan, Jinnian Zhang, William A. Sethares, Richard Kijowski, Fang Liu:
Spectral Domain Convolutional Neural Network. 2795-2799 - Luke Wood, Eric C. Larson:
Parametric Spectral Filters for Fast Converging, Scalable Convolutional Neural Networks. 2800-2804 - Xinyue Liang, Mikael Skoglund, Saikat Chatterjee:
Feature Reuse for a Randomization Based Neural Network. 2805-2809 - Alireza M. Javid, Sandipan Das, Mikael Skoglund, Saikat Chatterjee:
A ReLU Dense Layer to Improve the Performance of Neural Networks. 2810-2814 - Raphaël Achddou, J. Matías Di Martino, Guillermo Sapiro:
Nested Learning for Multi-Level Classification. 2815-2819 - Yu Wang, Shenjie Zhao:
Cross-Modal Representation Reconstruction for Zero-Shot Classification. 2820-2824 - Jisheng Dang, Jun Yang:
HIGCNN: Hierarchical Interleaved Group Convolutional Neural Networks for Point Clouds Analysis. 2825-2829 - Bo Zhang, Wenfeng Li, Qingyuan Li, Weiji Zhuang, Xiangxiang Chu, Yujun Wang:
AutoKWS: Keyword Spotting with Differentiable Architecture Search. 2830-2834 - Yubin Ge, Site Li, Xuyang Li, Fangfang Fan, Wanqing Xie, Jane You, Xiaofeng Liu:
Embedding Semantic Hierarchy in Discrete Optimal Transport for Risk Minimization. 2835-2839 - Panagiotis A. Traganitis, Georgios B. Giannakis:
Identifying Spammers to Boost Crowdsourced Classification. 2840-2844 - Shenfei Pei, Feiping Nie, Rong Wang, Xuelong Li:
A Rank-Constrained Clustering Algorithm with Adaptive Embedding. 2845-2849 - Yulan Deng, Lunke Fei, Shaohua Teng, Wei Zhang, Dongning Liu, Yan Hou:
Towards Efficient Age Estimation by Embedding Potential Gender Features. 2850-2854 - Ismail R. Alkhouri, George K. Atia:
Adversarial Attacks on Coarse-to-Fine Classifiers. 2855-2859 - Xiang Liu, Naiqi Li, Shu-Tao Xia:
GDTW: A Novel Differentiable DTW Loss for Time Series Tasks. 2860-2864 - Illya Degtyarenko, Ivan Deriuga, Andrii Grygoriev, Serhii Polotskyi, Volodymyr Melnyk, Dmytro Zakharchuk, Olga Radyvonenko:
Hierarchical Recurrent Neural Network for Handwritten Strokes Classification. 2865-2869 - Wenyu Zhang, Mohamed Ragab, Ramón Sagarna:
Robust Domain-Free Domain Generalization with Class-Aware Alignment. 2870-2874 - Swatantra Kafle, Geethu Joseph, Pramod K. Varshney:
One-Bit Compressed Sensing Using Untrained Network Prior. 2875-2879 - Rong Fu, Vincent Monardo, Tianyao Huang, Yimin Liu:
Deep Unfolding Network for Block-Sparse Signal Recovery. 2880-2884 - Wei Pu, Chao Zhou, Yonina C. Eldar, Miguel R. D. Rodrigues:
REST: Robust lEarned Shrinkage-Thresholding Network Taming Inverse Problems with Model Mismatch. 2885-2889 - Bahareh Tolooshams, Satish Mulleti, Demba E. Ba, Yonina C. Eldar:
Unfolding Neural Networks for Compressive Multichannel Blind Deconvolution. 2890-2894 - Vinayak Killedar, Praveen Kumar Pokala, Chandra Sekhar Seelamantula:
Sparsity Driven Latent Space Sampling for Generative Prior Based Compressive Sensing. 2895-2899 - Anurag Das, Seyedhooman Sajjadi, Bobak Mortazavi, Theodora Chaspari, Projna Paromita, Laura Ruebush, Nicolaas E. P. Deutz, Ricardo Gutierrez-Osuna:
A Sparse Coding Approach to Automatic Diet Monitoring with Continuous Glucose Monitors. 2900-2904 - Ouafae Karmouda, Jérémie Boulanger, Rémy Boyer:
Speeding Up of Kernel-Based Learning for High-Order Tensors. 2905-2909 - Le Trung Thanh, Karim Abed-Meraim, Nguyen Link Trung, Adel Hafiane:
A Fast Randomized Adaptive CP Decomposition For Streaming Tensors. 2910-2914 - Athanasios A. Rontogiannis, Paris V. Giampouras, Eleftherios Kofidis:
Rank-Revealing Block-Term Decomposition for Tensor Completion. 2915-2919 - Kriton Konstantinidis, Shengxi Li, Danilo P. Mandic:
Kernel Learning with Tensor Networks. 2920-2924 - Wenqiang Pu, Shahana Ibrahim, Xiao Fu, Mingyi Hong:
Fiber-Sampled Stochastic Mirror Descent for Tensor Decomposition with β-Divergence. 2925-2929 - Ruyuan Qu, Jiaqi He, Hui Feng, Chongbin Xu, Bo Hu:
Regularized Recovery by Multi-Order Partial Hypergraph Total Variation. 2930-2934 - Zhe Feng, Jie Tang, Yishun Dou, Gangshan Wu:
Learning Discriminative Features for Semi-Supervised Anomaly Detection. 2935-2939 - Jiaxiang Tang, Xiang Gao, Wei Hu:
RGLN: Robust Residual Graph Learning Networks via Similarity-Preserving Mapping on Graphs. 2940-2944 - Eric Sun, Liang Lu, Zhong Meng, Yifan Gong:
Sequence-Level Self-Teaching Regularization. 2945-2949 - Sina Alemohammad, Hossein Babaei, Randall Balestriero, Matt Y. Cheung, Ahmed Imtiaz Humayun, Daniel LeJeune, Naiming Liu, Lorenzo Luzi, Jasper Tan, Zichao Wang, Richard G. Baraniuk:
Wearing A Mask: Compressed Representations of Variable-Length Sequences Using Recurrent Neural Tangent Kernels. 2950-2954 - Naiqi Li, Yinghua Gao, Wenjie Li, Yong Jiang, Shu-Tao Xia:
H-GPR: A Hybrid Strategy for Large-Scale Gaussian Process Regression. 2955-2959 - Laia Amorós, Mikko Pitkänen:
Learning Optimal Lattice Codes for MIMO Communications. 2960-2964 - Alexandre Bittar, Philip N. Garner:
A Bayesian Interpretation of the Light Gated Recurrent Unit. 2965-2969 - Charles Séjourné, Romain Couillet, Pierre Comon:
A Large-Dimensional Analysis of Symmetric SNE. 2970-2974 - Alec Koppel, Amrit S. Bedi, Vikram Krishnamurthy:
A Dynamical Systems Perspective on Online Bayesian Nonparametric Estimators with Adaptive Hyperparameters. 2975-2979 - Zixiao Zong, Yanning Shen:
Online Multi-Hop Information Based Kernel Learning Over Graphs. 2980-2984 - Nikos Tsilivis, Anastasios Tsiamis, Petros Maragos:
Sparsity in Max-Plus Algebra and Applications in Multivariate Convex Regression. 2985-2989 - Jose Agustin Barrachina, Chenfang Ren, Christèle Morisseau, Gilles Vieillard, Jean Philippe Ovarlez:
Complex-Valued Vs. Real-Valued Neural Networks for Classification Perspectives: An Example on Non-Circular Data. 2990-2994 - Raphaël Olivier, Bhiksha Raj, Muhammad Shah:
High-Frequency Adversarial Defense for Speech and Audio. 2995-2999 - Jie Pu, Yannis Panagakis, Maja Pantic:
Learning Separable Time-Frequency Filterbanks for Audio Classification. 3000-3004 - Jordi Pons, Santiago Pascual, Giulio Cengarle, Joan Serrà:
Upsampling Artifacts in Neural Audio Synthesis. 3005-3009 - Kleanthis Avramidis, Agelos Kratimenos, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos:
Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms. 3010-3014 - Ke Chen, Beici Liang, Xiaoshuan Ma, Minwei Gu:
Learning Audio Embeddings with User Listening Data for Content-Based Music Recommendation. 3015-3019 - Zixuan Peng, Yu Lu, Shengfeng Pan, Yunfeng Liu:
Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention. 3020-3024 - Sungkyun Chang, Donmoon Lee, Jeongsoo Park, Hyungui Lim, Kyogu Lee, Karam Ko, Yoonchang Han:
Neural Audio Fingerprint for High-Specific Audio Retrieval Based on Contrastive Learning. 3025-3029 - Qiantong Xu, Alexei Baevski, Tatiana Likhomanenko, Paden Tomasello, Alexis Conneau, Ronan Collobert, Gabriel Synnaeve, Michael Auli:
Self-Training and Pre-Training are Complementary for Speech Recognition. 3030-3034 - Sascha Hornauer, Ke Li, Stella X. Yu, Shabnam Ghaffarzadegan, Liu Ren:
Unsupervised Discriminative Learning of Sounds for Audio Event Classification. 3035-3039 - Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. 3040-3044 - Chaitanya Talnikar, Tatiana Likhomanenko, Ronan Collobert, Gabriel Synnaeve:
Joint Masked CPC And CTC Training For ASR. 3045-3049 - Henry Zhou, Alexei Baevski, Michael Auli:
A Comparison of Discrete Latent Variable Models for Speech Representation Learning. 3050-3054 - Yasmin Sarcheshmehpour, M. Leinonen, Alexander Jung:
Federated Learning from Big Data Over Networks. 3055-3059 - Jie Zhao, Xinghua Zhu, Jianzong Wang, Jing Xiao:
Efficient Client Contribution Evaluation for Horizontal Federated Learning. 3060-3064 - Yong Liu, Xinghua Zhu, Jianzong Wang, Jing Xiao:
A Quantitative Metric for Privacy Leakage in Federated Learning. 3065-3069 - (Withdrawn) DP-SIGNSGD: When Efficiency Meets Privacy and Robustness. 3070-3074
- Sai Anuroop Kesanapalli, B. N. Bharath:
Federated Algorithm with Bayesian Approach: Omni-Fedge. 3075-3079 - Dhruv Guliani, Françoise Beaufays, Giovanni Motta:
Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework. 3080-3084 - Kishore Nandury, Anand Mohan, Frederick Weber:
Cross-Silo Federated Training in the Cloud with Diversity Scaling and Semi-Supervised Learning. 3085-3089 - Luong Trung Nguyen, Byonghyo Shim:
Gradual Federated Learning Using Simulated Annealing. 3090-3094 - Elsa Rizk, Stefan Vlaski, Ali H. Sayed:
Optimal Importance Sampling for Federated Learning. 3095-3099 - Anirban Das, Stacy Patterson:
Multi-Tier Federated Learning for Vertically Partitioned Data. 3100-3104 - Yuntao Hu, Ming Chen, Mingzhe Chen, Zhaohui Yang, Mohammad Shikh-Bahaei, H. Vincent Poor, Shuguang Cui:
Energy Minimization for Federated Learning with IRS-Assisted Over-the-Air Computation. 3105-3109 - Divyansh Jhunjhunwala, Advait Gadhikar, Gauri Joshi, Yonina C. Eldar:
Adaptive Quantization of Model Updates for Communication-Efficient Federated Learning. 3110-3114 - Manas Gupta, Arulmurugan Ambikapathi, Savitha Ramasamy:
HebbNet: A Simplified Hebbian Learning Framework to do Biologically Plausible Learning. 3115-3119 - Yiming Li, Yang Zhang, Qingtao Tang, Weipéng Huáng, Yong Jiang, Shu-Tao Xia:
t-k-means: A ROBUST AND STABLE k-means VARIANT. 3120-3124 - Feiping Nie, Wei Chang, Xuelong Li, Jin Xu, Gongfu Li:
Adaptive Feature Weight Learning For Robust Clustering Problem with Sparse Constraint. 3125-3129 - Jiaying Zhou, Xun Xian, Na Li, Jie Ding:
Assisted Learning: Cooperative AI with Autonomy. 3130-3134 - Gersende Fort, Eric Moulines, Hoi-To Wai:
Geom-Spider-EM: Faster Variance Reduced Stochastic Expectation Maximization for Nonconvex Finite-Sum Optimization. 3135-3139 - Arman Zharmagambetov, Miguel Á. Carreira-Perpiñán:
Learning a Tree of Neural Nets. 3140-3144 - Djallel Bouneffouf:
Corrupted Contextual Bandits: Online Learning with Corrupted Context. 3145-3149 - Bo Hu, José C. Príncipe:
Training a Bank of Wiener Models with a Novel Quadratic Mutual Information Cost Function. 3150-3154 - Matías Vera, Leonardo Rey Vega, Pablo Piantanida:
Information and Regularization in Restricted Boltzmann Machines. 3155-3159 - Xi Yu, Shujian Yu, José C. Príncipe:
Deep Deterministic Information Bottleneck with Matrix-Based Entropy Functional. 3160-3164 - Lingtian Feng, Feng Qian, Xin He, Yuqi Fan, Hanpeng Cai, Guangmin Hu:
Transitive Transfer Sparse Coding for Distant Domain. 3165-3169 - Canyu Zhang, Feiping Nie, Zheng Wang, Rong Wang, Xuelong Li:
Fast Local Representation Learning with Adaptive Anchor Graph. 3170-3174 - See Hian Lee, Feng Ji, Wee Peng Tay:
Learning On Heterogeneous Graphs Using High-Order Relations. 3175-3179 - Jianlun Liu, Shaohua Teng, Wei Zhang, Xiaozhao Fang, Lunke Fei, Zhuxiu Zhang:
Incomplete Multi-View Subspace Clustering with Low-Rank Tensor. 3180-3184 - Guowei Wang, Naiyang Guan, Hanjia Ye, Xiaodong Yi, Hang Cheng, Junjie Zhu:
Channel-Wise Mix-Fusion Deep Neural Networks for Zero-Shot Learning. 3185-3189 - Georgios Vasileios Karanikolas, Qin Lu, Georgios B. Giannakis:
Online Unsupervised Learning Using Ensemble Gaussian Processes with Random Features. 3190-3194 - Shuoyang Li, Yuhui Luo, Jonathon A. Chambers, Wenwu Wang:
Dimension Selected Subspace Clustering. 3195-3199 - Chen Yang, Shuyuan Yang:
Deep Ensemble Siamese Network For Incremental Signal Classification. 3200-3204 - Hao Chen, Zengde Deng, Yue Xu, Zhoujun Li:
Non-Recursive Graph Convolutional Networks. 3205-3209 - George Dasoulas, Giannis Nikolentzos, Kevin Scaman, Aladin Virmaux, Michalis Vazirgiannis:
Ego-Based Entropy Measures for Structural Representations on Graphs. 3210-3214 - Pratyusha Das, Antonio Ortega:
Symmetric Sub-graph Spatio-Temporal Graph Convolution and its application in Complex Activity Recognition. 3215-3219 - Negar Heidari, Alexandros Iosifidis:
Progressive Spatio-Temporal Graph Convolutional Network for Skeleton-Based Human Action Recognition. 3220-3224 - Yusuke Arai, Shogo Muramatsu, Hiroyasu Yasuda, Kiyoshi Hayasaka, Yu Otake:
Sparse-Coded Dynamic Mode Decomposition on Graph for Prediction of River Water Level Distribution. 3225-3229 - Yang Li, Gonzalo Mateos:
Graph Frequency Analysis of COVID-19 Incidence to Identify County-Level Contagion Patterns in the United States. 3230-3234 - Gokcan Tatli, Alper T. Erdogan:
Generalized Polytopic Matrix Factorization. 3235-3239 - Trung Vu, Raviv Raich:
Exact Linear Convergence Rate Analysis for Low-Rank Symmetric Matrix Completion via Gradient Descent. 3240-3244 - Quoc-Tung Le, Rémi Gribonval:
Structured Support Exploration for Multilayer Sparse Matrix Factorization. 3245-3249 - Yerlan Idelbayev, Miguel Á. Carreira-Perpiñán:
Optimal Selection of Matrix Shape and Decomposition Scheme for Neural Network Compression. 3250-3254 - Dong Hu, Shashanka Ubaru, Alex Gittens, Kenneth L. Clarkson, Lior Horesh, Vassilis Kalantzis:
Sparse Graph Based Sketching for Fast Numerical Linear Algebra. 3255-3259 - Oren Barkan, Roy Hirsch, Ori Katz, Avi Caciularu, Yoni Weill, Noam Koenigstein:
Cold Start Revisited: A Deep Hybrid Recommender with Cold-Warm Item Harmonization. 3260-3264 - Joshua Vendrow, Jamie Haddock, Elizaveta Rebrova, Deanna Needell:
On a Guided Nonnegative Matrix Factorization. 3265-32369 - Andersen Man Shun Ang, Nicolas Gillis, Arnaud Vandaele, Hans De Sterck:
Nonnegative Unimodal Matrix Factorization. 3270-3274 - Abderrahmane Rahiche, Mohamed Cheriet:
Kernel Orthogonal Nonnegative Matrix Factorization: Application to Multispectral Document Image Decomposition. 3275-3279 - Farouk Yahaya, Matthieu Puigt, Gilles Delmaire, Gilles Roussel:
Random Projection Streams for (Weighted) Nonnegative Matrix Factorization. 3280-3284 - Pascal A. Schirmer, Iosif Mporas:
Multivariate Non-Negative Matrix Factorization with Application to Energy Disaggregation. 3285-3289 - Jen-Tzung Chien, Yi-Hsiang Chen:
Continuous-Time Self-Attention in Neural Differential Equation. 3290-3294 - Ogul Can, Yeti Ziya Gürbüz, Berkin Yildirim, A. Aydin Alatan:
Blind Deinterleaving of Signals in Time Series with Self-Attention Based Soft Min-Cost Flow Learning. 3295-3299 - Tianlei Zhu, Jiawei Li, Xinji Liu, Yong Jiang, Shu-Tao Xia:
Attention on Attention Sparse Dense Convolutional Network for Financial Signal Processing. 3300-3304 - Divyanshu Daiya, Che Lin:
Stock Movement Prediction and Portfolio Management via Multimodal Learning with Transformer. 3305-3309 - Eleonora Grassucci, Danilo Comminiello, Aurelio Uncini:
A Quaternion-Valued Variational Autoencoder. 3310-3314 - Michel Barlaud, Frédéric Guyard:
Learning a Sparse Generative Non-Parametric Supervised Autoencoder. 3315-3319 - Yinghua Gao, Li Shen, Shu-Tao Xia:
DAG-GAN: Causal Structure Learning with Generative Adversarial Nets. 3320-3324 - Xin Guo, Johnny Hong, Tianyi Lin, Nan Yang:
Relaxed Wasserstein with Applications to GANs. 3325-3329 - Zhengyang Wang, Sheng Chen, Wei Yang, Yang Xu:
Environment-Independent Wi-Fi Human Activity Recognition with Adversarial Network. 3330-3334 - Maria Kaselimi, Athanasios Voulodimos, Nikolaos Doulamis, Anastasios D. Doulamis, Eftychios Protopapadakis:
A Robust to Noise Adversarial Recurrent Model for Non-Intrusive Load Monitoring. 3335-3339 - Xiaoyang Qu, Jianzong Wang, Jing Xiao:
Enhancing Data-Free Adversarial Distillation with Activation Regularization and Virtual Interpolation. 3340-3344 - Shixiang Zhu, Henry Shaowu Yuchi, Minghe Zhang, Yao Xie:
Sequential Adversarial Anomaly Detection with Deep Fourier Kernel. 3345-3349 - Yuchi Zhang, Yongliang Wang, Yang Dong:
Incorporate Maximum Mean Discrepancy in Recurrent Latent Space for Sequential Generative Model. 3350-3354 - Yiwen Sun, Yulu Wang, Kun Fu, Zheng Wang, Ziang Yan, Changshui Zhang, Jieping Ye:
FMA-ETA: Estimating Travel Time Entirely Based on FFN with Attention. 3355-3359 - Samarth Gupta, Shreyas Chaudhari, Subhojyoti Mukherjee, Gauri Joshi, Osman Yagan:
A Unified Approach to Translate Classical Bandit Algorithms to Structured Bandits. 3360-3364 - Lingda Wang, Huozhi Zhou, Bingcong Li, Lav R. Varshney, Zhizhen Zhao:
Near-Optimal Algorithms for Piecewise-Stationary Cascading Bandits. 3365-3369 - Yasitha Warahena Liyanage, Daphney-Stavroula Zois:
Optimum Feature Ordering for Dynamic Instance-Wise Joint Feature Selection and Classification. 3370-3374 - Wenyu Zhang:
POLA: Online Time Series Prediction by Adaptive Learning Rates. 3375-3379 - Xulong Zhang, Jiale Qian, Yi Yu, Yifu Sun, Wei Li:
Singer Identification Using Deep Timbre Feature Learning with KNN-NET. 3380-3384 - Israel D. Gebru, Dejan Markovic, Alexander Richard, Steven Krenn, Gladstone Alexander Butler, Fernando De la Torre, Yaser Sheikh:
Implicit HRTF Modeling Using Temporal Convolutional Networks. 3385-3389 - Marcelo Bortolozzo, Rodrigo Schramm, Cláudio R. Jung:
Improving the Classification of Rare Chords With Unlabeled Data. 3390-3394 - Pritish Chandna, António Ramires, Xavier Serra, Emilia Gómez:
Loopnet: Musical Loop Synthesis Conditioned on Intuitive Musical Parameters. 3395-3399 - Zalán Borsos, Yunpeng Li, Beat Gfeller, Marco Tagliasacchi:
Micaugment: One-Shot Microphone Style Transfer. 3400-3404 - Eduardo Fernandes Montesuma, Fred Maurice Ngolè Mboula:
Wasserstein Barycenter Transport for Acoustic Adaptation. 3405-3409 - Youngwoo Cho, Minwook Chang, Sanghyeon Lee, Hyoungwoo Lee, Gerard Jounghyun Kim, Jaegul Choo:
Efficient Adversarial Audio Synthesis VIA Progressive Upsampling. 3410-3414 - Panagiotis Tzirakis, Anurag Kumar, Jacob Donley:
Multi-Channel Speech Enhancement Using Graph Neural Networks. 3415-3419 - Junzhe Zhu, Raymond A. Yeh, Mark Hasegawa-Johnson:
Multi-Decoder Dprnn: Source Separation for Variable Number of Speakers. 3420-3424 - Guillaume Le Moing, Phongtharin Vinayavekhin, Don Joven Agravante, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana:
Data-Efficient Framework for Real-World Multiple Sound Source 2d Localization. 3425-3429 - Wentao Yu, Steffen Zeiler, Dorothea Kolossa:
Fusing Information Streams in End-to-End Audio-Visual Speech Recognition. 3430-3434 - Navneet Garg, Tharmalingam Ratnarajah:
Cooperative Scenarios for Multi-Agent Reinforcement Learning in Wireless Edge Caching. 3435-3439 - Juan Parras, Santiago Zazo:
Robust Deep Reinforcement Learning for Underwater Navigation with Unknown Disturbances. 3440-3444 - Djallel Bouneffouf, Emmanuelle Claeys:
Online Hyper-Parameter Tuning for the Contextual Bandit. 3445-3449 - Djallel Bouneffouf, Raphaël Féraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish:
Double-Linear Thompson Sampling for Context-Attentive Bandits. 3450-3454 - Yao-Chun Chan, Mingchen Li, Samet Oymak:
On the Marginal Benefit of Active Learning: Does Self-Supervision Eat its Cake? 3455-3459 - Thanh Xuan Nguyen, Tung Minh Luu, Trung X. Pham, Sanzhar Rakhimkul, Chang D. Yoo:
Robust Maml: Prioritization Task Buffer with Adaptive Learning Process for Model-Agnostic Meta-Learning. 3460-3464 - Ge Yu, Emre Barut, Chengwei Su:
Introducing Deep Reinforcement Learning to Nlu Ranking Tasks. 3465-3469 - Ye Tao, Ying Li, Zhonghai Wu:
Temporal Link Prediction Via Reinforcement Learning. 3470-3474 - Petros Giannakopoulos, Aggelos Pikrakis, Yannis Cotronis:
A Deep Reinforcement Learning Approach To Audio-Based Navigation In A Multi-Speaker Environment. 3475-3479 - Yuntao Liu, Yong Dou, Siqi Shen, Peng Qiao:
Global-Localized Agent Graph Convolution for Multi-Agent Reinforcement Learning. 3480-3484 - Qin Lu, Georgios B. Giannakis:
Gaussian Process Temporal-Difference Learning with Scalability and Worst-Case Performance Guarantees. 3485-3489 - Qifeng Lin, Qing Ling:
Self-Inference Of Others' Policies For Homogeneous Agents In Cooperative Multi-Agent Reinforcement Learning. 3490-3494 - Zalán Borsos, Marco Tagliasacchi, Andreas Krause:
Semi-Supervised Batch Active Learning Via Bilevel Optimization. 3495-3499 - Rami Mowakeaa, Seung-Jun Kim, Darren K. Emge:
Kernearl-Based Lifelong Policy Gradient Reinforcement Learning. 3500-3504 - Arash Golibagh Mahyari:
Policy Augmentation: An Exploration Strategy For Faster Convergence of Deep Reinforcement Learning Algorithms. 3505-3509 - Siqi Shen, Yongquan Fu, Huayou Su, Hengyue Pan, Peng Qiao, Yong Dou, Cheng Wang:
Graphcomm: A Graph Neural Network Based Method for Multi-Agent Reinforcement Learning. 3510-3514 - Olivier Vu Thanh, Matthieu Puigt, Farouk Yahaya, Gilles Delmaire, Gilles Roussel:
In Situ Calibration of Cross-Sensitive Sensors in Mobile Sensor Arrays Using Fast Informed Non-Negative Matrix Factorization. 3515-3519 - Lei Zhang, Peng Zhang, Luchen Liu, Jianlong Tan:
Multiphish: Multi-Modal Features Fusion Networks for Phishing Detection. 3520-3524 - Theodoros Tsiligkaridis:
Failure Prediction by Confidence Estimation of Uncertainty-Aware Dirichlet Networks. 3525-3529 - Qingyang Xu, Qingsong Wen, Liang Sun:
Two-Stage Framework for Seasonal Time Series Forecasting. 3530-3534 - Alberto García-Durán, Robert West:
Recursive Input and State Estimation: a General Framework for Learning from Time Series With Missing Data. 3535-3539 - Abolfazl Hashemi, Haris Vikalo, Gustavo de Veciana:
On the Performance-Complexity Tradeoff in Stochastic Greedy Weak Submodular Optimization. 3540-3544 - Haoyi Fan, Fengbin Zhang, Ruidong Wang, Xunhua Huang, Zuoyong Li:
Semi-Supervised Time Series Classification by Temporal Relation Prediction. 3545-3549 - Hui Shi, Yang Zhang, Hao Wu, Shiyu Chang, Kaizhi Qian, Mark Hasegawa-Johnson, Jishen Zhao:
Continuous Cnn For Nonuniform Time Series. 3550-3554 - Arijit Ukil, Antonio J. Jara, Leandro Marín:
Blend-Res2net: Blended Representation Space by Transformation of Residual Mapping with Restrained Learning for Time Series Classification. 3555-3559 - Tryambak Gangopadhyay, Sin Yong Tan, Zhanhong Jiang, Rui Meng, Soumik Sarkar:
Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation. 3560-3564 - Inkit Padhi, Yair Schiff, Igor Melnyk, Mattia Rigotti, Youssef Mroueh, Pierre L. Dognin, Jerret Ross, Ravi Nair, Erik R. Altman:
Tabular Transformers for Modeling Multivariate Time Series. 3565-3569 - Ahmed Abdulaal, Tomer Lancewicki:
Real-Time Synchronization in Neural Networks for Multivariate Time Series Anomaly Detection. 3570-3574 - Hashem Ghanem, Nicolas Keriven, Nicolas Tremblay:
Fast Graph Kernel with Optical Random Features. 3575-3579 - Xu Chen, Lun Du, Mengyuan Chen, Yun Wang, Qingqing Long, Kunqing Xie:
Fast Hierarchy Preserving Graph Embedding via Subspace Constraints. 3580-3584 - Jianming Huang, Hiroyuki Kasai:
Graph Embedding using Multi-Layer Adjacent Point Merging Model. 3585-3589 - Eda Bayram, Alberto García-Durán, Robert West:
Node Attribute Completion in Knowledge Graphs with Multi-Relational Propagation. 3590-3594 - Haiyang Zhang, Ivan Ganchev, Nikola S. Nikolov, Mark Stevenson:
UserReg: A Simple but Strong Model for Rating Prediction. 3595-3599 - Djallel Bouneffouf, Raphaël Féraud, Sohini Upadhyay, Mayank Agarwal, Yasaman Khazaeni, Irina Rish:
Toward Skills Dialog Orchestration with Online Learning. 3600-3604 - Hongyu Chen, Ruifang Liu, Han Fang, Ximing Zhang:
Adaptive Re-Balancing Network with Gate Mechanism for Long-Tailed Visual Question Answering. 3605-3609 - Huiyuan Li, Li Yu, Youfang Leng, Qihan Du:
Co-Capsule Networks Based Knowledge Transfer for Cross-Domain Recommendation. 3610-3614 - Javier Maroto, Clément Vignac, Pascal Frossard:
Modurec: Recommender Systems with Feature and Time Modulation. 3615-3619 - SangYeon Kim, Hyunwoo Lee, Jonghee Han, Joon-Ho Kim:
Sig2Sig: Signal Translation Networks to Take the Remains of the Past. 3620-3624 - Babak Barazandeh, Davoud Ataee Tarzanagh, George Michailidis:
Solving a Class of Non-Convex Min-Max Games Using Adaptive Momentum Methods. 3625-3629 - Thuan Nguyen, Thinh Nguyen:
Minimizing Weighted Concave Impurity Partition Under Constraints. 3630-3634 - Thuan Nguyen, Hoang Le, Thinh Nguyen:
Constant Approximation Algorithm for Minimizing Concave Impurity. 3635-3639 - Xiaobin Li, Lianlei Shan, Weiqiang Wang:
Fusing Multitask Models by Recursive Least Squares. 3640-3644 - Mahdi Shamsi, Soosan Beheshti:
Centrality Based Number of Cluster Estimation in Graph Clustering. 3645-3649 - Xia Dong, Danyang Wu, Feiping Nie, Rong Wang, Xuelong Li:
Dependence-Guided Multi-View Clustering. 3650-3654 - Sarit Khirirat, Xiaoyu Wang, Sindri Magnússon, Mikael Johansson:
Improved Step-Size Schedules for Noisy Gradient Methods. 3655-3659 - Pengzhen Li, Erdem Koyuncu, Hulya Seferoglu:
Respipe: Resilient Model-Distributed DNN Training at Edge Networks. 3660-3664 - Yuejiao Sun, Tianyi Chen, Wotao Yin:
An Optimal Stochastic Compositional Optimization Method with Applications to Meta Learning. 3665-3669 - Yiyue Chen, Abolfazl Hashemi, Haris Vikalo:
Decentralized Optimization on Time-Varying Directed Graphs Under Communication Constraints. 3670-3674 - Aditya Balu, Zhanhong Jiang, Sin Yong Tan, Chinmay Hegde, Young M. Lee, Soumik Sarkar:
Decentralized Deep Learning Using Momentum-Accelerated Consensus. 3675-3679 - Shuai Wang, Richard Cornelius Suwandi, Tsung-Hui Chang:
Demystifying Model Averaging for Communication-Efficient Federated Matrix Factorization. 3680-3684 - Halil Ibrahim Gulluk, Yue Sun, Samet Oymak, Maryam Fazel:
Sample Efficient Subspace-Based Representations for Nonlinear Meta-Learning. 3685-3689 - Xiaoqian Wang, Feiping Nie:
Multi-Task Learning Via Sharing Inexact Low-Rank Subspace. 3690-3694 - Ying Li, Fuwei Li, Lifeng Lai, Jun Wu:
On The Adversarial Robustness of Principal Component Analysis. 3695-3699 - Fen Wang, Gene Cheung, Yongchao Wang, Wai-Tian Tan:
Fast Manifold Landmarking Using Extreme Eigen-Pairs. 3700-3704 - Marc Vilà, Carlos Alejandro López, Jaume Riba:
Affine Projection Subspace Tracking. 3705-3709 - Bolaji Yusuf, Lucas Ondel, Lukás Burget, Jan Cernocký, Murat Saraçlar:
A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery. 3710-3714 - Lucas P. Damasceno, Charles C. Cavalcante, Tülay Adali, Zois Boukouvalas:
Independent Vector Analysis Using Semi-Parametric Density Estimation via Multivariate Entropy Maximization. 3715-3719 - Ben Gabrielson, Mohammad A. B. S. Akhonda, Zois Boukouvalas, Seung-Jun Kim, Tülay Adali:
ICA with Orthogonality Constraint: Identifiability And A New Efficient Algorithm. 3720-3724 - N. Amor, Jaroslav Cmejla, Václav Kautský, Zbynek Koldovský, Tomás Kounovský:
Blind Extraction of Moving Sources via Independent Component and Vector Analysis: Examples. 3725-3729 - Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi:
Single Channel Voice Separation for Unknown Number of Speakers Under Reverberant and Noisy Settings. 3730-3734 - Jing Yang, Tristan Cinquin, Gábor Sörös:
Unsupervised Musical Timbre Transfer for Notification Sounds. 3735-3739 - Yiming Li, Peidong Liu, Yong Jiang, Shu-Tao Xia:
Visual Privacy Protection via Mapping Distortion. 3740-3744 - Zhen Xiang, David J. Miller, George Kesidis:
L-Red: Efficient Post-Training Detection of Imperceptible Backdoor Attacks Without Access to the Training Set. 3745-3749 - Chuanguang Yang, Zhulin An, Yongjun Xu:
Multi-View Contrastive Learning for Online Knowledge Distillation. 3750-3754 - Alexander Sagel, Julian Wörmann, Hao Shen:
Dynamic Texture Recognition via Nuclear Distances on Kernelized Scattering Histogram Spaces. 3755-3759 - Zuogong Yue, Victor Solo:
Clustering A Collection of Networks With Mixtures of L1-Sparse Graphical Models. 3760-3764 - Suncheng Xiang, Yuzhuo Fu, Guanjie You, Ting Liu:
Taking A Closer Look at Synthesis: Fine-Grained Attribute Analysis for Person Re-Identification. 3765-3769 - Eldan Cohen, Hayato Ushijima-Mwesigwa, Avradip Mandal, Arnab Roy:
Unified Clustering and Outlier Detection on Specialized Hardware. 3770-3774 - Liu Yang, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric:
Class-Imbalanced Classifiers Using Ensembles of Gaussian Processes And Gaussian Process Latent Variable Models. 3775-3779 - Antonio Joia Neto, André G. C. Pacheco, Diogo Carbonera Luvizon:
Improving Deep Learning Sound Events Classifiers Using Gram Matrix Feature-Wise Correlations. 3780-3784 - Bhagyashree Puranik, Upamanyu Madhow, Ramtin Pedarsani:
Adversarially Robust Classification Based on GLRT. 3785-3789 - Jiacheng Zhang, Lin Jiang, Yuan Zong, Wenming Zheng, Li Zhao:
Cross-Corpus Speech Emotion Recognition Using Joint Distribution Adaptive Regression. 3790-3794 - Sannidhi P. Kumar, Chandan Gautam, Suresh Sundaram:
Meta-Cognition-Based Simple And Effective Approach To Object Detection. 3795-3799 - Xianchao Zhang, Jie Mu, Han Liu, Xiaotong Zhang:
Graphnet: Graph Clustering with Deep Neural Networks. 3800-3804 - Yuchen Chu, Zunhua Guo:
Attention Enhanced Spatial Temporal Neural Network For HRRP Recognition. 3805-3809 - Mingyuan Jiu, Hichem Sahbi:
DHCN: Deep Hierarchical Context Networks For Image Annotation. 3810-3814 - Cong Ye, Konstantinos Slavakis, Johan Nakuci, Sarah Feldt Muldoon, John D. Medaglia:
Online Classification of Dynamic Multilayer-Network Time Series in Riemannian Manifolds. 3815-3819 - Junghoon Seo, Joon Suk Huh:
On The Power of Deep But Naive Partial Label Learning. 3820-3824 - Nikolaos Dimitriadis, Petros Maragos:
Advances in Morphological Neural Networks: Training, Pruning and Enforcing Shape Constraints. 3825-3829 - Jarrod Hollis, Jinsub Kim, Raviv Raich:
Adversarial Learning via Probabilistic Proximity Analysis. 3830-3834 - Zhikang Xia, Bin Chen, Tao Dai, Shu-Tao Xia:
Class Aware Robust Training. 3835-3839 - Yu-Lin Tsai, Chia-Yi Hsu, Chia-Mu Yu, Pin-Yu Chen:
Non-Singular Adversarial Robustness of Neural Networks. 3840-3844 - Muhammad A. Shah, Raphaël Olivier, Bhiksha Raj:
Towards Adversarial Robustness Via Compact Feature Representations. 3845-3849 - Kejiang Chen, Yuefeng Chen, Hang Zhou, Chuan Qin, Xiaofeng Mao, Weiming Zhang, Nenghai Yu:
Adversarial Examples Detection Beyond Image Space. 3850-3854 - Eitan Borgnia, Valeriia Cherepanova, Liam Fowl, Amin Ghiasi, Jonas Geiping, Micah Goldblum, Tom Goldstein, Arjun Gupta:
Strong Data Augmentation Sanitizes Poisoning and Backdoor Attacks Without an Accuracy Tradeoff. 3855-3859 - Janek Ebbers, Michael Kuhlmann, Tobias Cord-Landwehr, Reinhold Haeb-Umbach:
Contrastive Predictive Coding Supported Factorized Variational Autoencoder For Unsupervised Learning Of Disentangled Speech Representations. 3860-3864 - Jun Wang, Max W. Y. Lam, Dan Su, Dong Yu:
Contrastive Separative Coding for Self-Supervised Representation Learning. 3865-3869 - Alex Xiao, Christian Fuegen, Abdelrahman Mohamed:
Contrastive Semi-Supervised Learning for ASR. 3870-3874 - Aaqib Saeed, David Grangier, Neil Zeghidour:
Contrastive Learning of General-Purpose Audio Representations. 3875-3879 - Yulong Chen, Jianping Zhao, Weiqi Wang, Ming Fang, Haimei Kang, Lu Wang, Tao Wei, Jun Ma, Shaojun Wang, Jing Xiao:
SEQ-CPC : Sequential Contrastive Predictive Coding for Automatic Speech Recognition. 3880-3884 - Lasse Borgholt, Tycho M. S. Tax, Jakob D. Havtorn, Lars Maaløe, Christian Igel:
On Scaling Contrastive Representations for Low-Resource Speech Recognition. 3885-3889 - Vikul Gupta, Burak Bartan, Tolga Ergen, Mert Pilanci:
Convex Neural Autoregressive Models: Towards Tractable, Expressive, and Theoretically-Backed Models for Sequential Forecasting and Generation. 3890-3894 - Linbo Qiao, Tao Sun, Hengyue Pan, Dongsheng Li:
Inertial Proximal Deep Learning Alternating Minimization for Efficient Neutral Network Training. 3895-3899 - Xingyi Yang:
Kalman Optimizer for Consistent Gradient Descent. 3900-3904 - Guy Revach, Nir Shlezinger, Ruud J. G. van Sloun, Yonina C. Eldar:
Kalmannet: Data-Driven Kalman Filtering. 3905-3909 - Ruben Pauwels, Evaggelia Tsiligianni, Nikos Deligiannis:
HCGM-Net: A Deep Unfolding Network for Financial Index Tracking. 3910-3914 - Elizabeth Fons, Paula Dawson, Xiao-Jun Zeng, John A. Keane, Alexandros Iosifidis:
Augmenting Transferred Representations for Stock Classification. 3915-3919 - Hojjat Salehinejad, Shahrokh Valaee:
A Framework for Pruning Deep Neural Networks Using Energy-Based Models. 3920-3924 - Jangho Kim, Simyung Chang, Sungrack Yun, Nojun Kwak:
Prototype-Based Personalized Pruning. 3925-3929 - Matej Ulicny, Vladimir A. Krylov, Rozenn Dahyot:
Tensor Reordering for CNN Compression. 3930-3934 - Hojjat Salehinejad, Shahrokh Valaee:
Pruning of Convolutional Neural Networks using ising Energy Model. 3935-3939 - Weiwei Chen, Chong Wang, Zhehao Zhang, Zheng Huo, Linlin Gao:
Reweighted Dynamic Group Convolution. 3940-3944 - Shohei Kubota, Hideaki Hayashi, Tomohiro Hayase, Seiichi Uchida:
Layer-Wise Interpretation of Deep Neural Networks using Identity Initialization. 3945-3949 - Bilal Taha, Megan Kirk, Paul Ritvo, Dimitrios Hatzinakos:
Detection of Post-Traumatic Stress Disorder Using Learned Time-Frequency Representations from Pupillometry. 3950-3954 - Soheil Rayatdoost, Yufeng Yin, David Rudrauf, Mohammad Soleymani:
Subject-Invariant Eeg Representation Learning For Emotion Recognition. 3955-3959 - Hongchao Jiang, Wei Yang Bryan Lim, Jer Shyuan Ng, Yu Wang, Ying Chi, Chunyan Miao:
Towards Parkinson's Disease Prognosis Using Self-Supervised Learning and Anomaly Detection. 3960-3964 - Vandad Davoodnia, Saeed Ghorbani, Ali Etemad:
In-Bed Pressure-Based Pose Estimation Using Image Space Representation Learning. 3965-3969 - Seyedhooman Sajjadi, Anurag Das, Ricardo Gutierrez-Osuna, Theodora Chaspari, Projna Paromita, Laura E. Ruebush, Nicolaas E. P. Deutz, Bobak J. Mortazavi:
Towards The Development of Subject-Independent Inverse Metabolic Models. 3970-3974 - Diyuan Lu, Nenad Polomac, Iskra Gacheva, Elke Hattingen, Jochen Triesch:
Human-Expert-Level Brain Tumor Detection Using Deep Learning with Data Distillation And Augmentation. 3975-3979 - Andrew Silva, Barry-John Theobald, Nicholas Apostoloff:
Multimodal Punctuation Prediction with Contextual Dropout. 3980-3984 - Masanao Matsumoto, Keisuke Maeda, Naoki Saito, Takahiro Ogawa, Miki Haseyama:
Multi-Modal Label Dequantized Gaussian Process Latent Variable Model for Ordinal Label Estimation. 3985-3989 - Kenneth Tran, Wesam A. Sakla, Hamid Krim:
Generative Information Fusion. 3990-3994 - Shinnosuke Matsuo, Seiichi Uchida, Brian Kenji Iwana:
Self-Augmented Multi-Modal Feature Embedding. 3995-3999 - Ashish Shrivastava, Arnav Kundu, Chandra Dhir, Devang Naik, Oncel Tuzel:
Optimize What Matters: Training DNN-Hmm Keyword Spotting Model Using End Metric. 4000-4004 - Björn Bebensee, Byoung-Tak Zhang:
Co-Attentional Transformers for Story-Based Video Understanding. 4005-4009 - Aaron Berk:
Deep Generative Demixing: Error Bounds for Demixing Subgaussian Mixtures of Lipschitz Signals. 4010-4014 - Théo Giraudon, Vincent Gripon, Matthias Löwe, Franck Vermet:
Towards an Intrinsic Definition of Robustness for a Classifier. 4015-4019 - Ganesh Ramachandra Kini, Christos Thrampoulidis:
Phase Transitions for One-Vs-One and One-Vs-All Linear Separability in Multiclass Gaussian Mixtures. 4020-4024 - Brian Whiteaker, Peter Gerstoft:
Leaky Integrator Dynamical Systems and Reachable Sets. 4025-4029 - Ke Wang, Christos Thrampoulidis:
Benign Overfitting in Binary Classification of Gaussian Mixtures. 4030-4034 - Sudeep Salgia, Qing Zhao:
An Order-Optimal Adaptive Test Plan for Noisy Group Testing Under Unknown Noise Models. 4035-4039 - Ting-Yao Hu, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Stefan Braun, Kyuyeon Hwang, Ozlem Kalinli, Oncel Tuzel:
SapAugment: Learning A Sample Adaptive Policy for Data Augmentation. 4040-4044 - Shahrzad Kiani, Tharindu Adikari, Stark C. Draper:
Hierarchical Coded Elastic Computing. 4045-4049 - Sandipan Banerjee, Ajjen Joshi, Ahmed Ghoneim, Survi Kyal, Taniya Mishra:
Synthesize & Learn: Jointly Optimizing Generative and Classifier Networks for Improved Drowsiness Detection. 4050-4054 - Malsha V. Perera, Ashwin De Silva:
A Joint Convolutional and Spatial Quad-Directional LSTM Network for Phase Unwrapping. 4055-4059 - Anand Dubey, Avik Santra, Jonas Fuchs, Maximilian Lübke, Robert Weigel, Fabian Lurz:
Integrated Classification and Localization of Targets Using Bayesian Framework In Automotive Radars. 4060-4064 - Shengyi Chen, Jalal Taghia, Tai Fei, Uwe Kühnau, Nils Pohl, Rainer Martin:
A DNN Autoencoder for Automotive Radar Interference Mitigation. 4065-4069 - Pranav Goyal, Satish Mulleti, Anubha Gupta, Yonina C. Eldar:
DURAS: Deep Unfolded Radar Sensing Using Doppler Focusing. 4070-4074 - Sami Jouaber, Silvère Bonnabel, Santiago Velasco-Forero, Marion Pilté:
NNAKF: A Neural Network Adapted Kalman Filter for Target Tracking. 4075-4079 - Hyeryung Jang, Osvaldo Simeone:
Multi-Sample Online Learning for Spiking Neural Networks Based on Generalized Expectation Maximization. 4080-4084 - Ting Zhong, Zheyang Xu, Fan Zhou:
Probabilistic Graph Neural Networks for Traffic Signal Control. 4085-4089 - Cat P. Le, Mohammadreza Soltani, Robert J. Ravier, Vahid Tarokh:
Task-Aware Neural Architecture Search. 4090-4094 - Jue Wang, Ping Wang, Chao Zhang, Kuifeng Su, Jun Li:
F-Net: Fusion Neural Network for Vehicle Trajectory Prediction in Autonomous Driving. 4095-4099 - Simon Benaïchouche, Clément Le Goff, Yann Guichoux, François Rousseau, Ronan Fablet:
Unsupervised Reconstruction of Sea Surface Currents from AIS Maritime Traffic Data Using Learnable Variational Models. 4100-4104 - Zhao Heng, Kim-Hui Yap, Alex ChiChung Kot:
A Compact Joint Distillation Network for Visual Food Recognition. 4105-4109 - Yiyuan Yang, Yi Li, Haifeng Zhang:
Pipeline Safety Early Warning Method for Distributed Signal using Bilinear CNN and LightGBM. 4110-4114 - Rafail Ismayilov, Renato L. G. Cavalcante, Slawomir Stanczak:
Deep Learning Based Hybrid Precoding in Dual-Band Communication Systems. 4115-4119 - Pourya Behmandpoor, Jeroen Verdyck, Marc Moonen:
Deep Learning-Based Cross-Layer Resource Allocation for Wired Communication Systems. 4120-4124 - Li Liu, Ge Li, Thomas H. Li:
ATVIO: Attention Guided Visual-Inertial Odometry. 4125-4129 - Kyohei Kamikawa, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama:
Feature Integration via Semi-Supervised Ordinally Multi-Modal Gaussian Process Latent Variable Model. 4130-4134 - Jia Chen, Haiping Yu, Yimei Kang:
A Multi-Layer Multi-Channel Attentive Network for Gender and Age Recognition. 4135-4139 - Babak Naderi, Gabriel Mittag, Rafael Zequeira Jiménez, Sebastian Möller:
Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms. 4140-4144 - Chung-En Sun, Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang:
Melody Harmonization Using Orderless Nade, Chord Balancing, and Blocked Gibbs Sampling. 4145-4149 - Yun Liang, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama:
Cross-Domain Semi-Supervised Deep Metric Learning for Image Sentiment Analysis. 4150-4154 - Karel Mundnich, Alexandra Fenster, Aparna Khare, Shiva Sundaram:
Audiovisual Highlight Detection in Videos. 4155-4159 - Nakamasa Inoue:
Teacher-Assisted Mini-Batch Sampling for Blind Distillation Using Metric Learning. 4160-4164 - Yuanbo Hou, Yi Deng, Bilei Zhu, Zejun Ma, Dick Botteldooren:
Rule-Embedded Network for Audio-Visual Voice Activity Detection in Live Musical Video Streams. 4165-4169 - Xinyu Xiao, Chunxia Zhang, Shiming Xiang, Chunhong Pan:
Reinforcement Stacked Learning with Semantic-Associated Attention for Visual Question Answering. 4170-4174 - Min Zhang, Meng Ma, Ping Wang:
Hierarchical Refined Attention for Scene Text Recognition. 4175-4179 - Vinod K. Kurmi, Vipul Bajaj, Badri N. Patro, K. S. Venkatesh, Vinay P. Namboodiri, Preethi Jyothi:
Collaborative Learning to Generate Audio-Video Jointly. 4180-4184 - Min Li, Zhenjiang Miao, Xiao-Ping Zhang, Wanru Xu:
An Attention-Seq2Seq Model Based on CRNN Encoding for Automatic Labanotation Generation from Motion Capture Data. 4185-4189 - Xinsheng Wang, Siyuan Feng, Jihua Zhu, Mark Hasegawa-Johnson, Odette Scharenborg:
Show and Speak: Directly Synthesize Spoken Description of Images. 4190-4194 - Zhiqiang Zhang, Jinjia Zhou, Wenxin Yu, Ning Jiang:
Drawgan: Text to Image Synthesis with Drawing Generative Adversarial Networks. 4195-4199 - Fanglu Xie, Go Irie, Tatsushi Matsubayashi:
Disentangling Subject-Dependent/-Independent Representations for 2D Motion Retargeting. 4200-4204 - Pierre R. Lebreton, Kazuhisa Yamagishi:
Network and Content-Dependent Bitrate Ladder Estimation for Adaptive Bitrate Video Streaming. 4205-4209 - Goluck Konuko, Giuseppe Valenzise, Stéphane Lathuilière:
Ultra-Low Bitrate Video Conferencing Using Deep Image Animation. 4210-4214 - Yan Huang, Bin Wang, C.-C. Jay Kuo, Hui Yuan, Jingliang Peng:
Hierarchical Bit-Wise Differential Coding (HBDC) of Point Cloud Attributes. 4215-4219 - Dat Thanh Nguyen, Maurice Quach, Giuseppe Valenzise, Pierre Duhamel:
Learning-Based Lossless Compression of 3D Point Cloud Geometry. 4220-4224 - Diogo Lopes, João Ascenso, Catarina Brites, Fernando Pereira:
Image Coding with Neural Network-Based Colorization. 4225-4229 - Xuekai Wei, Mingliang Zhou, Sam Kwong, Hui Yuan, Tao Xiang:
Joint Reinforcement Learning and Game Theory Bitrate Control Method for 360-Degree Dynamic Adaptive Streaming. 4230-4234 - Meng Niu, Kai Chen, Qingcai Chen, Lufeng Yang:
HCAG: A Hierarchical Context-Aware Graph Attention Model for Depression Detection. 4235-4239 - Baojin Huang, Zhongyuan Wang, Guangcheng Wang, Kui Jiang, Kangli Zeng, Zhen Han, Xin Tian, Yuhong Yang:
When Face Recognition Meets Occlusion: A New Benchmark. 4240-4244 - Mingfu Xiong, Zhongyuan Wang, Ruhan He, Xinrong Hu, Ming Cheng, Xiao Qin, Jia Chen:
A Triplet Appearance Parsing Network for Person Re-Identification. 4245-4249 - Xian Zhong, Yiting Liu, Wenxin Huang, Xiao Wang, Bo Ma, Jingling Yuan:
Part-Aligned Network with Background for Misaligned Person Search. 4250-4254 - Ruobing Zheng, Bo Song, Changjiang Ji:
Learning Pose-Adaptive Lip Sync with Cascaded Temporal Convolutional Network. 4255-4259 - Hung-Yi Su, Chung-Hsien Wu, Cheng-Ray Liou, Esther Ching-Lan Lin, Po See Chen:
Assessment of Bipolar Disorder Using Heterogeneous Data of Smartphone-Based Digital Phenotyping. 4260-4264 - Lei Li, Xiangzheng Li, Kangbo Wu, Kui Lin, Suping Wu:
Multi-Granularity Feature Interaction and Relation Reasoning for 3D Dense Alignment and Face Reconstruction. 4265-4269 - Agelos Kratimenos, Georgios Pavlakos, Petros Maragos:
Independent Sign Language Recognition with 3d Body, Hands, and Face Reconstruction. 4270-4274 - Licai Sun, Bin Liu, Jianhua Tao, Zheng Lian:
Multimodal Cross- and Self-Attention Network for Speech Emotion Recognition. 4275-4279 - Xinyuan Qian, Maulik C. Madhavi, Zexu Pan, Jiadong Wang, Haizhou Li:
Multi-Target DoA Estimation with an Audio-Visual Fusion Mechanism. 4280-4284 - Ying Cheng, Mengyu He, Jiashuo Yu, Rui Feng:
Improving Multimodal Speech Enhancement by Incorporating Self-Supervised and Curriculum Learning. 4285-4289 - Zhuoran Li, Rania Hassen, Zhou Wang:
Autoencoder for Vibrotactile Signal Compression. 4290-4294 - Jiabao Zhao, Xin Lin, Yifan Yang, Jing Yang, Liang He:
Cross-Modal Knowledge Distillation For Fine-Grained One-Shot Classification. 4295-4299 - Ye Zhu, Yu Wu, Hugo Latapie, Yi Yang, Yan Yan:
Learning Audio-Visual Correlations From Variational Cross-Modal Generation. 4300-4304 - Xinfang Liu, Xiushan Nie, Junya Teng, Fanchang Hao, Yilong Yin:
ECCL: Explicit Correlation-Based Convolution Boundary Locator for Moment Localization. 4305-4309 - Lin Li, Kaixi Hu, Yunpei Zheng, Jianquan Liu, Kong Aik Lee:
COOPNet: Multi-Modal Cooperative Gender Prediction in Social Media User Profiling. 4310-4314 - Vandana Rajan, Alessio Brutti, Andrea Cavallaro:
Robust Latent Representations Via Cross-Modal Translation and Alignment. 4315-4319 - Wangbin Sun, Fei Ma, Yang Li, Shao-Lun Huang, Shiguang Ni, Lin Zhang:
Semi-Supervised Multimodal Image Translation for Missing Modality Imputation. 4320-4324 - Yu Zhou, Yong Feng, Mingliang Zhou, Baohua Qiang, Leong Hou U, Jiajie Zhu:
Deep Adversarial Quantization Network for Cross-Modal Retrieval. 4325-4329 - Jianyang Qin, Lunke Fei, Jian Zhu, Jie Wen, Chunwei Tian, Shuai Wu:
Scalable Discriminative Discrete Hashing For Large-Scale Cross-Modal Retrieval. 4330-4334 - Zhe Ma, Fenghao Liu, Jianfeng Dong, Xiaoye Qu, Yuan He, Shouling Ji:
Hierarchical Similarity Learning for Language-Based Product Image Retrieval. 4335-4339 - Shuli Cheng, Liejun Wang, Anyu Du, Yongming Li:
Bidirectional Focused Semantic Alignment Attention Network for Cross-Modal Retrieval. 4340-4344 - Joshua Peter Ebenezer, Yongjun Wu, Hai Wei, Sriram Sethuraman, Zongyi Liu:
Detection of Audio-Video Synchronization Errors Via Event Detection. 4345-4349 - Xugong Qin, Yu Zhou, Youhui Guo, Dayan Wu, Weiping Wang:
FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection. 4350-4354 - Georgios Vougioukas, Aggelos Bletsas:
DoA estimation of a hidden RF source exploiting simple backscatter radio tags. 4355-4359 - David Schenck, Xavier Mestre, Marius Pesavento:
Probability of Resolution of G-MUSIC: An Asymptotic Approach. 4360-4364 - Minh Trinh-Hoang, Mohammed Nabil El Korso, Marius Pesavento:
A Partially-Relaxed Robust DOA Estimator Under Non-Gaussian Low-Rank Interference and Noise. 4365-4369 - Zhengyu Wan, Wei Liu:
Non-Coherent DOA Estimation of Off-Grid Signals With Uniform Circular Arrays. 4370-4374 - Majdoddin Esfandiari, Sergiy A. Vorobyov:
Enhanced Standard Esprit For Overcoming Imperfections In DOA Estimation. 4375-4379 - Feng Xu, Sergiy A. Vorobyov:
Constrained Tensor Decomposition for 2d DOA Estimation In Transmit Beamspace Mimo Radar with Subarrays. 4380-4384 - Yongsung Park, Peter Gerstoft:
Alternating Projections Gridless Covariance-Based Estimation For DOA. 4385-4389 - Femke B. Gelderblom, Yi Liu, Johannes Kvam, Tor André Myrvoll:
Synthetic Data For Dnn-Based Doa Estimation of Indoor Speech. 4390-4394 - Tom Tirer, Oded Bialer:
Direction Of Arrival Estimation For Non-Coherent Sub-Arrays Via Joint Sparse And Low-Rank Signal Recovery. 4395-4399 - Hamza Baali, Abdesselam Bouzerdoum, Abdelkrim Khelif:
Sparsity And Nonnegativity Constrained Krylov Approach For Direction Of Arrival Estimation. 4400-4404 - Feng Xi, Nir Shlezinger, Yonina C. Eldar:
Hybrid Analog-Digital MIMO Radar Receivers With Bit-Limited ADCs. 4405-4409 - Syed A. Hamza, Weitong Zhai, Xiangrong Wang, Moeness G. Amin:
Sparse Array Transceiver Design for Enhanced Adaptive Beamforming in MIMO Radar. 4410-4414 - Chao-Yi Wu, Jian Li, Tan F. Wong:
Sparse Parameter Estimation for PMCW MIMO Radar Using Few-Bit ADCs. 4415-4419 - Junpeng Shi, Fangqing Wen, Yongxiang Liu, Qinmu Shen, Zhihui Li, Zhen Liu:
Parameter Identifiability Of Spatial-Smoothing-Based Bistatic Mimo Radar. 4420-4424 - Zhen Wang, Qian He:
Parameter Estimation for Coherent Passive MIMO Radar with Unknown Signals under Direct Path Influence. 4425-4429 - Jie Li, Guisheng Liao, Yan Huang, Arye Nehorai:
Riemannian Geometric Optimization Methods for Joint Design of Transmit Sequence and Receive Filter of MIMO Radar. 4430-4434 - Xiaolu Zeng, Feng Zhang, Beibei Wang, K. J. Ray Liu:
High Accuracy Tracking of Targets Using Massive MIMO. 4435-4439 - Niloofar Mohamadi, Min Dong, Shahram ShahbazPanahi:
Admm-Based Fast Algorithm for Robust Multi-Group Multicast Beamforming. 4440-4444 - Robbe Van Rompaey, Marc Moonen:
Scalable and Distributed MMSE Algorithms for Uplink Receive Combining in Cell-Free Massive MIMO Systems. 4445-4449 - Sara Sharifi, Shahram ShahbazPanahi, Min Dong:
Antenna Selection for Massive MIMO Systems Based on POMDP Framework. 4450-4454 - Alessio Fascista, Angelo Coluccia, Henk Wymeersch, Gonzalo Seco-Granados:
RIS-Aided Joint Localization and Synchronization with a Single-Antenna Mmwave Receiver. 4455-4459 - Bruno Sokal, Paulo R. B. Gomes, André L. F. de Almeida, Martin Haardt:
Joint Channel, Data, and Phase-Noise Estimation in MIMO-OFDM Systems Using a Tensor Modeling Approach. 4460-4464 - Xuehan Wang, Gongping Huang, Israel Cohen, Jacob Benesty, Jingdong Chen:
Robust Steerable Differential Beamformers with Null Constraints for Concentric Circular Microphone Arrays. 4465-4469 - Takuma Okamoto:
Close-Talking Recording with Planarly Distributed Microphones. 4470-4474 - Felix Pfreundtner, Jing Yang, Gábor Sörös:
(W)Earable Microphone Array and Ultrasonic Echo Localization for Coarse Indoor Environment Mapping. 4475-4479 - Patrick W. A. Wijnings, Sander Stuijk, Rick Scholte, Henk Corporaal:
Characterization of Mems Microphone Sensitivity and Phase Distributions with Applications in Array Processing. 4480-4484 - Karn Watcharasupat, Anh H. T. Nguyen, Ching-Hui Ooi, Andy W. H. Khong:
Directional Sparse Filtering Using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures. 4485-4489 - Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid:
Distributed Speech Separation in Spatially Unconstrained Microphone Arrays. 4490-4494 - Mehdi Bekrani, Anh H. T. Nguyen, Andy W. H. Khong:
An Adaptive Non-Linear Process for Under-Determined Virtual Microphone Beamforming. 4495-4499 - Rajib Sharma, Israel Cohen, Baruch Berdugo:
Window Beamformer for Sparse Concentric Circular Array. 4500-4504 - Xiaoyu Ai, Lu Gan:
Single-Point Array Response Control with Minimum Pattern Deviation. 4505-4509 - Peng Chen, Wei Wang, Jingjie Gao:
Focusing-Based Wideband Adaptive Beamforming Using Covariance Matrix Reconstruction. 4510-4514 - Topi Halme, Eyal Nitzan, Visa Koivunen:
Bayesian Multiple Change-Point Detection of Propagating Events. 4515-4519 - Chun-Lin Liu, Zi-Min Lin:
One-Bit Autocorrelation Estimation With Non-Zero Thresholds. 4520-4524 - Rohan R. Pote, Bhaskar D. Rao:
A Novel Bayesian Approach for the Two-Dimensional Harmonic Retrieval Problem. 4525-4529 - Wenzhe Lu, Heng Qiao:
On Overfitting in Discrete Super-Resolution Recovery. 4530-4534 - Matthieu Simeoni, Paul Hurley:
SIML: Sieved Maximum Likelihood for Array Signal Processing. 4535-4539 - Yahya Sattar, Zubair Khalid:
Estimation of Groundwater Storage Variations in Indus River Basin Using Grace Data. 4540-4544 - Mohamed Kashef, Peter G. Vouras, Robert Jones, Richard Candell, Kate A. Remley:
Temporal Exemplar Channels In High-Multipath Environments. 4545-4549 - Geonho Han, Sucheol Kim, Junil Choi:
Multi-Vehicle Velocity Estimation Using IEEE 802.11ad Waveform. 4550-4554 - Elizabeth Ren, Gustavo Cid Ornelas, Hans-Andrea Loeliger:
Real-Time Interaural Time Delay Estimation via Onset Detection. 4555-4559 - Liang Xu, Ruixin Niu:
EKFNet: Learning System Noise Statistics from Measurement Data. 4560-4564 - Po-Chih Chen, P. P. Vaidyanathan:
Sliding-Capon Based Convolutional Beamspace for Linear Arrays. 4565-4569 - Zachariah Sutton, Peter Willett, Stefano Maranò:
Target Detection from Distributed Passive Sensors: Semi-Labeled Data Quantization. 4570-4574 - Gilles Monnoyer de Galland de Carnières, Thomas Feuillen, Luc Vandendorpe, Laurent Jacques:
Sparse Factorization-Based Detection of Off-the-Grid Moving Targets Using FMCW Radars. 4575-4579 - Afief D. Pambudi, Fauzia Ahmad, Abdelhak M. Zoubir:
A Robust Copula Model for Radar-Based Landmine Detection. 4580-4584 - Sudan Han, Pia Addabbo, Danilo Orlando, Giuseppe Ricci:
Radar Clutter Classification Using Expectation-Maximization Method. 4585-4589 - Pei Zhang, Yunpeng Bai, Dong Wang, Bendu Bai, Ying Li:
A Meta-Learning Framework for Few-Shot Classification of Remote Sensing Scene. 4590-4594 - Beichen Zhou, Jingjun Yi, Qi Bi:
Differential Convolution Feature Guided Deep Multi-Scale Multiple Instance Learning for Aerial Scene Classification. 4595-4599 - Junpeng Shi, Yongxiang Liu, Fangqing Wen, Zhen Liu, Panhe Hu, Zhenghui Gong:
Generalized Thinned Coprime Array for DOA Estimation. 4600-4604 - Ahmed M. A. Shaalan, Jun Du, Yanhui Tu:
TCLA Array: A New Sparse Array Design with Less Mutual Coupling. 4605-4609 - Wanlu Shi, Yingsong Li, Sergiy A. Vorobyov:
Low Mutual Coupling Sparse Array Design Using ULA Fitting. 4610-4614 - Huiping Huang, Abdelhak M. Zoubir:
Low-Rank and Sparse Decomposition for Joint DOA Estimation and Contaminated Sensors Detection with Sparsely Contaminated Arrays. 4615-4619 - Sina Shahsavari, Jacob Millhiser, Piya Pal:
Fundamental Trade-Offs in Noisy Super-Resolution with Synthetic Apertures. 4620-4624 - Amir Weiss, Arie Yeredor:
Enhanced Blind Calibration of Uniform Linear Arrays with One-Bit Quantization by Kullback-Leibler Divergence Covariance Fitting. 4625-4629 - Amir Weiss, Arie Yeredor:
Non-Iterative Blind Calibration of Nested Arrays with Asymptotically Optimal Weighting. 4630-4634 - Luca Ferranti, Kalle Åström, Magnus Oskarsson, Jani Boutellier, Juho Kannala:
Sensor Networks TDOA Self-Calibration: 2D Complexity Analysis and Solutions. 4635-4639 - Martin Larsson, Gabrielle Flood, Magnus Oskarsson, Kalle Åström:
Fast and Robust Stratified Self-Calibration Using Time-Difference-Of-Arrival Measurements. 4640-4644 - Ghattas Akkad, Ali Mansour, Bachar El-Hassan, Elie Inaty:
Stability Analysis of the RC-PLMS Adaptive Beamformer Using a Simple Transfer Function Approximation. 4645-4649 - Saeid Sedighi, Bhavani Shankar, Mojtaba Soltanalian, Björn E. Ottersten:
On The Asymptotic Performance of One-Bit Co-Array-Based Music. 4650-4654 - Rui Huang, Le Yang, Jun Tao, Yanbo Xue:
Kld Minimization-Based Constrained Measurement Filtering For Two-Step TDOA Indoor Tracking. 4635-4639 - Mahboobeh Sedighizad, Babak Seyfe, Shahrokh Valaee:
A Correntropy Based Algorithm for Robust Localization in Wireless Networks. 4660-4664 - Hengyan Liu, Wei Dai, Yuan Shen:
MuG: A Multipath-Exploited and Grid-Free Localisation Method. 4665-4669 - Ruchi Pandey, Santosh Nannuru, Aditya Siripuram:
Sparse Bayesian Learning for Acoustic Source Localization. 4670-4674 - You Lu, Yue Tian, Shaobo Han, Eric Cosatto, Sarper Ozharar, Yangmin Ding:
Automatic Fine-Grained Localization of Utility Pole Landmarks on Distributed Acoustic Sensing Traces Based on Bilinear Resnets. 4675-4679 - Yifan Wu, Roshan Sai Ayyalasomayajula, Michael J. Bianco, Dinesh Bharadia, Peter Gerstoft:
SSLIDE: Sound Source Localization for Indoors Based on Deep Learning. 4680-4684 - Yagiz Savas, Abolfazl Hashemi, Abraham P. Vinod, Brian M. Sadler, Ufuk Topcu:
Physical-Layer Security via Distributed Beamforming in the Presence of Adversaries with Unknown Locations. 4685-4689 - Anh-Huy Phan, Petr Tichavský, Konstantin Sobolev, Konstantin Sozykin, Dmitry Ermilov, Andrzej Cichocki:
Canonical Polyadic Tensor Decomposition With Low-Rank Factor Matrices. 4690-4694 - Yijing Chu, Shing-Chow Chan, Cheuk Ming Mak, Ming Wu:
A Diffusion FXLMS Algorithm for Multi-Channel Active Noise Control and Variable Spatial Smoothing. 4695-4699 - Ban-Sok Shin, Dmitriy Shutin:
ADAPT-Then-Combine Full Waveform Inversion for Distributed Subsurface Imaging In Seismic Networks. 4700-4704 - Julio Wissing, Benedikt T. Boenninghoff, Dorothea Kolossa, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Christopher Schymura:
Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain. 4705-4709 - Youngjoon Yu, Hong Joo Lee, Byeong Cheon Kim, Jung Uk Kim, Yong Man Ro:
Towards Robust Training of Multi-Sensor Data Fusion Network Against Adversarial Examples in Semantic Segmentation. 4710-4714 - Navid Reyhanian, Hamid Farmanbar, Zhi-Quan Luo:
Data-Driven Adaptive Network Resource Slicing for Multi-Tenant Networks. 4715-4719 - Zhongyuan Zhao, Gunjan Verma, Chirag Rao, Ananthram Swami, Santiago Segarra:
Distributed Scheduling Using Graph Neural Networks. 4720-4724 - Arindam Chowdhury, Gunjan Verma, Chirag Rao, Ananthram Swami, Santiago Segarra:
Efficient Power Allocation Using Graph Neural Networks and Deep Algorithm Unfolding. 4725-4729 - Marcos M. Vasconcelos, Urbashi Mitra:
A Sample-Efficient Scheme for Channel Resource Allocation in Networked Estimation. 4730-4734 - Weikun Chen, Ya-Feng Liu, Yu-Hong Dai, Zhi-Quan Luo:
An Efficient Linear Programming Rounding-and-Refinement Algorithm for Large-Scale Network Slicing Problem. 4735-4739 - Mohammad Javad-Kalbasi, Shahrokh Valaee:
Efficient Migration to the Next Generation of Networks Based on Digital Annealing. 4740-4744 - Ana I. Pérez-Neira, Miguel Angel Lagunas:
A Technique for OFDM Symbol Slicing. 4745-4749 - Holger Boche, Rafael F. Schaefer, H. Vincent Poor:
Communication Over Block Fading Channels - An Algorithmic Perspective On Optimal Transmission Schemes. 4750-4754 - Silei Wang, Fanxiang Kong, Qiang Li:
Secure UAV Communications Under Uncertain Eavesdroppers Locations. 4755-4759 - Chen Quan, Baocheng Geng, Pramod K. Varshney:
On Strategic Jamming in Distributed Detection Networks. 4760-4764 - Holger Boche, Rafael F. Schaefer, H. Vincent Poor:
Real Number Signal Processing can Detect Denial-of-Service Attacks. 4765-4769 - Jamison R. Ebert, Vamsi K. Amalladinne, Jean-François Chamberland, Krishna R. Narayanan:
A Hybrid Approach to Coded Compressed Sensing Where Coupling Takes Place Via the Outer Code. 4770-4774 - Bho Matthiesen, Yijie Mao, Petar Popovski, Bruno Clerckx:
Globally Optimal Beamforming for Rate Splitting Multiple Access. 4775-4779 - Haiyang Zhang, Nir Shlezinger, Francesco Guidi, Davide Dardari, Mohammadreza F. Imani, Yonina C. Eldar:
Beam Focusing for Multi-User MIMO Communications with Dynamic Metasurface Antennas. 4780-4784 - Kai Li, Ying Li, Lei Cheng, Qingjiang Shi, Zhi-Quan Luo:
Pushing The Limit of Type I Codebook For Fdd Massive Mimo Beamforming: A Channel Covariance Reconstruction Approach. 4785-4789 - Chong Zhang, Min Dong, Ben Liang:
First-Order Fast Algorithm for Structurally Optimal Multi-Group Multicast Beamforming in Large-Scale Systems. 4790-4794 - Aakash Arora, Christos G. Tsinos, R. Bhavani Shankar Mysore, Symeon Chatzinotas, Björn E. Ottersten:
Analog Beamforming With Antenna Selection For Large-Scale Antenna Arrays. 4795-4799 - Chandan Kumar Sheemar, Dirk T. M. Slock:
Beamforming for Bidirectional Mimo Full Duplex Under the Joint Sum Power and Per Antenna Power Constraints. 4800-4804 - Hamza Djelouat, Markus Leinonen, Markku J. Juntti:
Iterative Reweighted Algorithms for Joint User Identification and Channel Estimation in Spatially Correlated Massive MTC. 4805-4809 - R. S. Prasobh Sankar, Sundeep Prabhakar Chepuri:
Millimeter Wave MIMO Channel Estimation with 1-bit Spatial Sigma-Delta Analog-to-Digital Converters. 4810-4814 - Liang Liu, Ya-Feng Liu:
An Efficient Algorithm For Device Detection And Channel Estimation In Asynchronous IOT Systems. 4815-4819 - Chu Li, Jeremy Brauer, Aydin Sezgin, Christian T. Zenger:
Kalman Filter Based MIMO CSI Phase Recovery for COTS Wifi Devices. 4820-4824 - Jianxiu Li, Urbashi Mitra:
Improved Atomic Norm Based Channel Estimation for Time-Varying Narrowband Leaked Channels. 4825-4829 - Shuai Huang, Deqiang Qiu, Trac D. Tran:
Bayesian Massive MIMO Channel Estimation with Parameter Estimation Using Low-Resolution ADCs. 4830-4834 - Khalid A. Almahorg, Ramy H. Gohary:
Optimal Detection in the Presence of Non-Gaussian Jamming. 4835-4839 - Ziyue Wang, Zhilin Chen, Ya-Feng Liu, Foad Sohrabi, Wei Yu:
An Efficient Active Set Algorithm for Covariance Based Joint Data and Activity Detection for Massive Random Access with Massive MIMO. 4840-4844 - Dexin Zhang, Jincheng Dai, Kailin Tan, Kai Niu, Mingzhe Chen, H. Vincent Poor, Shuguang Cui:
Neural Layered Min-Sum Decoding for Protograph LDPC Codes. 4845-4849 - John D. Roth, David Alan Garren, R. Clark Robertson:
Integer Carrier Frequency Offset Estimation in OFDM with Zadoff-Chu Sequences. 4850-4854 - Osman Musa, Peter Jung, Giuseppe Caire:
Plug-And-Play Learned Gaussian-mixture Approximate Message Passing. 4855-4859 - Dongyun Kam, Byeong Yong Kong, Youngjoo Lee:
Low-Latency Polar Decoder Using Overlapped SCL Processing. 4860-4864 - Juan Vidal Alegría, Fredrik Rusek, Jesús Rodríguez Sánchez, Ove Edfors:
Modular Binary Tree Architecture for Distributed Large Intelligent Surface. 4565-4569 - Xin Guan, Xiaotong Zhao, Qingjiang Shi:
Stochastic Successive Weighted Sum-Rate Maximization for Multiuser MIMO Systems with Finite-Alphabet Inputs. 4870-4874 - Barak Avraham, Uri Erez, Elad Domanovitz:
Rate 1 Quasi Orthogonal Universal Transmission and Combining for MIMO Systems Achieving Full Diversity. 4875-4879 - Sumit Gautam, Symeon Chatzinotas, Björn E. Ottersten:
Energy Efficiency Optimization Technique for SWIPT-Enabled Multi-Group Multicasting Systems with Heterogeneous Users. 4880-4884 - André R. Flores, Rodrigo Caiado de Lamare, Bruno Clerckx:
Multi-Branch Tomlinson-Harashima Precoding for Rate Splitting Based Systems with Multiple Antennas. 4885-4889 - Mingjie Shao, Wing-Kin Ma:
Divide and Conquer: One-bit MIMO-OFDM Detection by Inexact Expectation Maximization. 4890-4894 - Priyadarshi Mukherjee, Constantinos Psomas, Ioannis Krikidis:
Differential Chaos Shift Keying-Based Wireless Power Transfer. 4895-4899 - Ting-Kuei Hu, Fernando Gama, Tianlong Chen, Zhangyang Wang, Alejandro Ribeiro, Brian M. Sadler:
VGAI: End-to-End Learning of Vision-Based Decentralized Controllers for Robot Swarms. 4900-4904 - Wen Jiang, Yihui Ren, Ying Liu, Ziao Wang, Xinghua Wang:
Recognition of Dynamic Hand Gesture Based on Mm-Wave Fmcw Radar Micro-Doppler Signatures. 4905-4909 - Paolo Di Lorenzo, Claudio Battiloro, Mattia Merluzzi, Sergio Barbarossa:
Dynamic Resource Optimization for Adaptive Federated Learning at the Wireless Network Edge. 4910-4914 - Lissy Pellaco, Mats Bengtsson, Joakim Jaldén:
Deep Weighted MMSE Downlink Beamforming. 4915-4919 - Sagar Shrestha, Xiao Fu, Mingyi Hong:
Deep Generative Model Learning For Blind Spectrum Cartography with NMF-Based Radio Map Disaggregation. 4920-4924 - Muhammad Shahmeer Omar, Xiaoli Ma:
Mitigating Clipping Distortion in OFDM Using Deep Residual Learning. 4925-4929 - Isayiyas Nigatu Tiba, Quan Zhang, Jing Jiang, Yongchao Wang:
A Low-Complexity Admm-Based Massive Mimo Detectors Via Deep Neural Networks. 4930-4934 - Ziqi Ke, Haris Vikalo:
Real-Time Radio Modulation Classification With An LSTM Auto-Encoder. 4935-4939 - Foad Sohrabi, Zhilin Chen, Wei Yu:
Deep Active Learning Approach to Adaptive Beamforming for mmWave Initial Alignment. 4940-4944 - Haoran Sun, Wenqiang Pu, Minghe Zhu, Xiao Fu, Tsung-Hui Chang, Mingyi Hong:
Learning to Continuously Optimize Wireless Resource in Episodically Dynamic Environment. 4945-4949 - Abhishek Kumar, Gunjan Verma, Chirag Rao, Ananthram Swami, Santiago Segarra:
Adaptive Contention Window Design Using Deep Q-Learning. 4950-4954 - Ezra Tampubolon, Haris Ceribasic, Holger Boche:
On Information Asymmetry in Online Reinforcement Learning. 4955-4959 - Fanxiang Kong, Qiang Li, Huaizong Shao:
Jamming Strategy Generation for Hidden Communication Modes Via Graph Convolution Networks. 4960-4964 - Navid Naderializadeh:
Contrastive Self-Supervised Learning for Wireless Power Control. 4965-4969 - Yair Sorek, Koby Todros:
Measure-Transformed Covariance Test for Robust Spectrum Sensing. 4970-4974 - Tidhar Lambez, Kobi Cohen:
Searching for Anomalies with Multiple Plays under Delay and Switching Costs. 4975-4979 - Fabio Fabozzi, Stéphanie Bidon, Sébastien Roche:
Robust estimation of high-order phase dynamics using Variational Bayes inference. 4980-4984 - Jean P. Chereau, Bruno Scalzo, Danilo P. Mandic:
Robust PCA Through Maximum Correntropy Power Iterations. 4985-4989 - Lang Liu, Joseph Salmon, Zaïd Harchaoui:
Score-Based Change Detection For Gradient-Based Learning Machines. 4990-4994 - Marek W. Rupniewski:
Super-Resolution Of Periodic Signals From Short Sequences Of Samples. 4995-4999 - Vikram Krishnamurthy:
Quickest Change Detection With Time Inconsistent Anticipatory Agents In Cyber-Physical Systems. 5000-5004 - Abhin Shah, Kartik Ahuja, Karthikeyan Shanmugam, Dennis Wei, Kush R. Varshney, Amit Dhurandhar:
Treatment Effect Estimation Using Invariant Risk Minimization. 5005-5009 - Kian Blanchette, Wesley S. Burr, Glen Takahara:
An F-Test for Polynomial Frequency Modulation. 5010-5014 - Taposh Banerjee, Smruti Padhy, Ahmad F. Taha, Eugene John:
Quickest Joint Detection and Classification of Faults in Statistically Periodic Processes. 5015-5019 - Dominik Reinhard, Michael Fauß, Abdelhak M. Zoubir:
An Asymptotically Pointwise Optimal Procedure For Sequential Joint Detection And Estimation. 5020-5024 - Amish Goel, Pierre Moulin:
Locally Optimal Detection of Stochastic Targeted Universal Adversarial Perturbations. 5025-5029 - Muhammad I. Qureshi, Ran Xin, Soummya Kar, Usman A. Khan:
A Decentralized Variance-Reduced Method for Stochastic Optimization Over Directed Graphs. 5030-5034 - Juan Augusto Maya, Leonardo Rey Vega:
On Distributed Composite Tests with Dependent Observations in WSN. 5035-5039 - Zhaoxian Wu, Han Shen, Tianyi Chen, Qing Ling:
Byzantine-Resilient Decentralized TD Learning with Linear Function Approximation. 5040-5044 - Juan Augusto Maya, Leonardo Rey Vega:
On The Effect of Spatial Correlation on Distributed Energy Detection of a Stochastic Process. 5045-5049 - Yiran He, Hoi-To Wai:
Provably Fast Asynchronous And Distributed Algorithms For Pagerank Centrality Computation. 5050-5054 - Rajarshi Saha, Stefano Rini, Milind Rao, Andrea Goldsmith:
Decentralized Optimization Over Noisy, Rate-Constrained Networks: How We Agree By Talking About How We Disagree. 5055-5059 - Andrey Garnaev, Athina P. Petropulu, Wade Trappe:
A Multiple Access Channel Game Using Latency Metric. 5060-5064 - Ralf R. Müller, Bernhard Gäde, Ali Bereyhi:
Linear Computation Coding. 5065-5069 - Eduardo Pavez, Benjamin Girault, Antonio Ortega, Philip A. Chou:
Spectral Folding And Two-Channel Filter-Banks On Arbitrary Graphs. 5070-5074 - Tsubasa Kusano, Kohei Yatabe, Yasuhiro Oikawa:
Sparse Time-Frequency Representation Via Atomic Norm Minimization. 5075-5079 - Alihan Kaplan, Volker Pohl:
Message Transmission Over Rapidly Time-Varying Channels. 5080-5084 - Linxiao Yang, Qingsong Wen, Bo Yang, Liang Sun:
A Robust and Efficient Multi-Scale Seasonal-Trend Decomposition. 5085-5089 - Charilaos A. Zisou, Georgios K. Apostolidis, Leontios J. Hadjileontiadis:
Noise-Assisted Multivariate Variational Mode Decomposition. 5090-5094 - Neophytos Charalambides, Mert Pilanci, Alfred O. Hero III:
Approximate Weighted C R Coded Matrix Multiplication. 5095-5099 - Pranav Kulkarni, P. P. Vaidyanathan:
Periodic Signal Denoising: An Analysis-Synthesis Framework Based on Ramanujan Filter Banks and Dictionaries. 5100-5104 - Jian Vora, Ajit Rajwade:
Compressive Signal Recovery Under Sensing Matrix Errors Combined With Unknown Measurement Gains. 5105-5109 - Hao Sun, Junting Chen:
Grid Optimization for Matrix-Based Source Localization Under Inhomogeneous Sensor Topology. 5110-5114 - Mona Zehni, Zhizhen Zhao:
MSR-GAN: Multi-Segment Reconstruction via Adversarial Learning. 5115-5119 - Guanqiang Zhou, Zhi Tian:
Count Sketch with Zero Checking: Efficient Recovery of Heavy Components. 5120-5124 - Zhichao Wang, Victor Solo:
Numerical Solution of Stochastic Differential Equations in Stiefel Manifolds via Tangent Space Parametrization. 5125-5129 - Hamish McPhee, Lorenzo Ortega, Jordi Vilà-Valls, Eric Chaumette:
On The Accuracy Limit of Joint Time-Delay/Doppler/Acceleration Estimation with a Band-Limited Signal. 5130-5134 - Farah Nassif, Soosan Beheshti:
Automatic Order Selection in Autoregressive Modeling with Application in EEG Sleep-Stage Classification. 5135-5139 - Bastien Berthelot, Éric Grivel, Pierrick Legrand:
New Variants of DFA Based on Loess and Lowess Methods: Generalization of the Detrending Moving Average. 5140-5144 - Rui Zhou, Junyan Liu, Sandeep Kumar, Daniel P. Palomar:
Parameter Estimation for Student's t VAR Model with Missing Data. 5145-5149 - Yifan Ran, Wei Dai:
Fast and Robust ADMM for Blind Super-Resolution. 5150-5154 - Bruno Scalzo, Alvaro Arroyo, Ljubisa Stankovic, Danilo P. Mandic:
Nonstationary Portfolios: Diversification in the Spectral Domain. 5155-5159 - Antoine Collas, Florent Bouchard, Arnaud Breloy, Chenfang Ren, Guillaume Ginolhac, Jean Philippe Ovarlez:
A Tyler-Type Estimator of Location and Scatter Leveraging Riemannian Optimization. 5160-5164 - Felix Schwock, Shima Abadi:
Statistical Properties of a Modified Welch Method That Uses Sample Percentiles. 5165-5169 - Namrata Nadagouda, Mark A. Davenport:
Switched Hawkes Processes. 5170-5174 - Tarig Ballal, Abdelrahman S. Abdelrahman, Ali H. Muqaibel, Tareq Y. Al-Naffouri:
An Adaptive Regularization Approach to Portfolio Optimization. 5175-5179 - Arpan Mukherjee, Ali Tajer, Pin-Yu Chen, Payel Das:
Active Estimation From Multimodal Data. 5180-5184 - Virginia Bordignon, Stefan Vlaski, Vincenzo Matta, Ali H. Sayed:
Network Classifiers Based on Social Learning. 5185-5189 - Anirudh Sridhar, H. Vincent Poor:
Bayes-Optimal Methods for Finding the Source of a Cascade. 5190-5194 - Burak Hasircioglu, Deniz Gündüz:
Private Wireless Federated Learning with Anonymous Over-the-Air Computation. 5195-5199 - Gökhan Gül, Michael Baßler:
Scalable Multilevel Quantization for Distributed Detection. 5200-5204 - Alejandro Parada-Mayorga, Alejandro Ribeiro:
Stability of Algebraic Neural Networks to Small Perturbations. 5205-5209 - Lin Zhou, Alfred Olivier Hero:
Resolution Limits of 20 Questions Search Strategies for Moving Targets. 5210-5214 - Y. Efe Erginbas, Stefan Vlaski, Ali H. Sayed:
Gramian-Based Adaptive Combination Policies for Diffusion Learning Over Networks. 5215-5219 - Konstantinos D. Polyzos, Qin Lu, Georgios B. Giannakis:
Graph-Adaptive Incremental Learning Using an Ensemble of Gaussian Process Experts. 5220-5224 - Siavash Mollaebrahim, Daniel Romero, Baltasar Beferull-Lozano:
Fast Decentralized Linear Functions Via Successive Graph Shift Operators. 5255-5259 - Stefania Sardellitti, Sergio Barbarossa, Paolo Di Lorenzo:
Online Learning of Time-Varying Signals and Graphs. 5230-5234 - Vitor Rosa Meireles Elias, Vinay Chakravarthi Gogineni, Wallace A. Martins, Stefan Werner:
Kernel Regression on Graphs in Random Fourier Features Space. 5235-5239 - Stefan Vlaski, Ali H. Sayed:
Graph-Homomorphic Perturbations for Private Decentralized Learning. 5240-5244 - Zhan Gao, Elvin Isufi, Alejandro Ribeiro:
Variance-Constrained Learning for Stochastic Graph Neural Networks. 5245-5249 - Wenzhong Yan, Di Jin, Zhidi Lin, Feng Yin:
Graph Neural Network for Large-Scale Network Localization. 5250-5254 - Luana Ruiz, Zhiyang Wang, Alejandro Ribeiro:
Graphon and Graph Neural Network Stability. 5255-5259 - Fernando Gama, Ekaterina I. Tolstaya, Alejandro Ribeiro:
Graph Neural Networks for Decentralized Controllers. 5260-5264 - Luana Ruiz, Fernando Gama, Alejandro Ribeiro, Elvin Isufi:
Nonlinear State-Space Generalizations of Graph Convolutional Neural Networks. 5265-5269 - Zhan Gao, Alejandro Ribeiro, Fernando Gama:
Wide and Deep Graph Neural Networks with Distributed Online Learning. 5270-5274 - Junya Hara, Koki Yamada, Shunsuke Ono, Yuichi Tanaka:
Design of Graph Signal Sampling Matrices for Arbitrary Signal Subspaces. 5275-5279 - Masatoshi Nagahama, Koki Yamada, Yuichi Tanaka, Stanley H. Chan, Yonina C. Eldar:
Graph Signal Denoising Using Nested-Structured Deep Algorithm Unrolling. 5280-5284 - Yiran He, Hoi-To Wai:
Identifying First-Order Lowpass Graph Signals Using Perron Frobenius Theorem. 5285-5289 - Siheng Chen, Yonina C. Eldar:
Graph Signal Denoising Via Unrolling Networks. 5290-5294 - Théo Gnassounou, Pierre Humbert, Laurent Oudre:
Adaptive Subsampling of Multidomain Signals with Product Graphs. 5295-5299 - Samuel Rey, Antonio G. Marques:
Robust Graph-Filter Identification with Graph Denoising Regularization. 5300-5304 - Mostafa Rahmani, Ping Li:
Fast and Provable Robust PCA VIA Normalized Coherence Pursuit. 5305-5309 - Ohad Rahamim, Ronen Talmon:
Aligning Sets of Temporal Signals with Riemannian Geometry and Koopman Operator. 5310-5314 - Elie Leroy, Arthur Marmin, Marc Castella, Laurent Duval:
Weight Identification Through Global Optimization in a New Hysteretic Neural Network Model. 5315-5319 - Yacouba Kaloga, Pierre Borgnat, Sundeep Prabhakar Chepuri, Patrice Abry, Amaury Habrard:
Multiview Variational Graph Autoencoders for Canonical Correlation Analysis. 5320-5324 - Baocheng Geng, Quan Chen, Pramod K. Varshney:
Cognitive Memory Constrained Human Decision Making based on Multi-source Information. 5325-5329 - Raphael Keusch, Hampus Malmberg, Hans-Andrea Loeliger:
Binary Control and Digital-to-Analog Conversion Using Composite NUV Priors and Iterative Gaussian Message Passing. 5330-5334 - Konstantinos Slavakis, Masahiro Yukawa:
Outlier-Robust Kernel Hierarchical-Optimization RLS on a Budget with Affine Constraints. 5335-5339 - Mahdi Imani, Seyede Fatemeh Ghoreishi:
Adaptive Real-Time Filter for Partially-Observed Boolean Dynamical Systems. 5340-5344 - Jonathan Kern, Elsa Dupraz, Abdeldjalil Aïssa-El-Bey, François Leduc-Primeau:
Improving the Energy-Efficiency of a Kalman Filter Using Unreliable Memories. 5345-5349 - Fatemeh Yaghoobi, Adrien Corenflos, Sakira Hassan, Simo Särkkä:
Parallel Iterated Extended and Sigma-Point Kalman Smoothers. 5350-5354 - Bastian Seifert, Chris Wendler, Markus Püschel:
Wiener Filter on Meet/Join Lattices. 5355-5359 - Michele Cirillo, Vincenzo Matta, Ali H. Sayed:
Learning Bollobás-Riordan Graphs Under Partial Observability. 5360-5364 - Saghar Bagheri, Gene Cheung, Antonio Ortega, Fen Wang:
Learning Sparse Graph Laplacian with K Eigenvector Prior via Iterative Glasso and Projection. 5365-5369 - Shahana Ibrahim, Xiao Fu:
Learning Mixed Membership from Adjacency Graph Via Systematic Edge Query: Identifiability and Algorithm. 5370-5374 - Mircea Moscu, Ricardo Augusto Borsoi, Cédric Richard:
Convergence Analysis of the Graph-Topology-Inference Kernel LMS Algorithm. 5375-5379 - Xiaolu Wang, Chaorui Yao, Haoyu Lei, Anthony Man-Cho So:
An Efficient Alternating Direction Method for Graph Learning from Smooth Signals. 5380-5384 - Geert Leus, Maosheng Yang, Mario Coutino, Elvin Isufi:
Topological Volterra Filters. 5385-5399 - T. Mitchell Roddenberry, Madeline Navarro, Santiago Segarra:
Network Topology Inference with Graphon Spectral Penalties. 5390-5394 - Chiraag Kaushik, T. Mitchell Roddenberry, Santiago Segarra:
Network Topology Change-Point Detection from Graph Signals with Prior Spectral Signatures. 5395-5399 - Alberto Natali, Mario Coutino, Elvin Isufi, Geert Leus:
Online Time-Varying Topology Identification Via Prediction-Correction Algorithms. 5400-5404 - B. Subbareddy, Aditya Siripuram, Jingxin Zhang:
Graph Learning Under Spectral Sparsity Constraints. 5405-5409 - Tatsuya Koyakumaru, Masahiro Yukawa, Eduardo Pavez, Antonio Ortega:
A Graph Learning Algorithm Based On Gaussian Markov Random Fields And Minimax Concave Penalty. 5410-5414 - Matthias Minder, Zahra Farsijani, Dhruti Shah, Mireille El Gheche, Pascal Frossard:
Figlearn: Filter and Graph Learning Using Optimal Transport. 5415-5419 - Huang Bai, Chuanrong Hong, Xiumei Li:
Construction of Unit-Norm Tight Frame Based Preconditioner for Sparse Coding. 5420-5424 - Jinxin Wang, Zengde Deng, Taoli Zheng, Anthony Man-Cho So:
Sparse High-Order Portfolios Via Proximal Dca And Sca. 5425-5429 - Hiroki Kuroda, Daichi Kitahara, Akira Hirabayashi:
A Convex Penalty for Block-Sparse Signals with Unknown Structures. 5430-5434 - Dorian Florescu, Felix Krahmer, Ayush Bhandari:
Event-Driven Modulo Sampling. 5435-5439 - Pulak Sarangi, Piya Pal:
No Relaxation: Guaranteed Recovery of Finite-Valued Signals from Undersampled Measurements. 5440-5444 - Dilshad Surroop, Pascal Combes, Philippe Martin:
Error Estimates in Second-Order Continuous-Time Sigma-Delta Modulators. 5445-5448 - Samuel Pinilla, Kumar Vijay Mishra, Brian M. Sadler, Henry Arguello:
Banraw: Band-Limited Radar Waveform Design Via Phase Retrieval. 5449-5453 - Satish Mulleti, Kiryung Lee, Yonina C. Eldar:
Sub-NYQUIST Multichannel Blind Deconvolution. 5454-5458 - Arian Eamaz, Farhang Yeganegi, Mojtaba Soltanalian:
Modified Arcsine Law for One-Bit Sampled Stationary Signals with Time-Varying Thresholds. 5459-5463 - Muhammed Tahsin Rahman, Mohammad Javad-Kalbasi, Shahrokh Valaee:
Near-Optimal Resampling in Particle Filters Using the Ising Energy Model. 5464-5468 - Holger Boche, Ullrich J. Mönich:
Time-Domain Concentration and Approximation of Computable Bandlimited Signals. 5469-5473 - Marek Hilton, Roxana Alexandru, Pier Luigi Dragotti:
Guaranteed Reconstruction from Integrate-and-Fire Neurons with Alpha Synaptic Activation. 5474-5478 - Konstantinos Ntemos, Virginia Bordignon, Stefan Vlaski, Ali H. Sayed:
Social Learning Under Inferential Attacks. 5479-5483 - Vikram Krishnamurthy, Rui Luo, Buddhika Nettasinghe:
Segregation in Social Networks: MARKOV Bridge Models and Estimation. 5484-5488 - Kobi Cohen, Amir Leshem:
Controlled Testing and Isolation for Suppressing Covid-19. 5489-5493 - Saurabh Sihag, Ali Tajer, Urbashi Mitra:
Two-Stage Graph-Constrained Group Testing: Theory and Application. 5494-5498 - Vassilis N. Ioannidis, Dimitris Berberidis, Georgios B. Giannakis:
Unveiling Anomalous Nodes Via Random Sampling and Consensus on Graphs. 5499-5503 - Alexandre Reiffers-Masson, Thierry Chonavel, Yezekael Hayel:
Estimating Fiedler Value on Large Networks Based on Random Walk Observations. 5504-5508 - Dion Eustathios Olivier Tzamarias, Eduardo Pavez, Benjamin Girault, Antonio Ortega, Ian Blanes, Joan Serra-Sagristà:
Orthogonality and Zero DC Tradeoffs in Biorthogonal Graph Filterbanks. 5509-5513 - Pei Li, Nir Shlezinger, Haiyang Zhang, Baoyun Wang, Yonina C. Eldar:
Graph Signal Compression via Task-Based Quantization. 5514-5518 - Mehdi Chahine Amrouche, Hervé Carfantan, Jérôme Idier:
A Partially Collapsed Gibbs Sampler for Unsupervised Nonnegative Sparse Signal Restoration. 5519-5523 - Bin She, Yaojun Wang, Guangmin Hu:
A Structure-Guided and Sparse-Representation-Based 3d Seismic Inversion Method. 5524-5528 - Yilang Zhang, Bingcong Li, Georgios B. Giannakis:
Accelerating Frank-Wolfe with Weighted Average Gradients. 5529-5533 - Giovanni Chierchia, Mireille El Gheche:
Yapa: Accelerated Proximal Algorithm for Convex Composite Problems. 5534-5538 - Elyas Sabeti, Peter X. K. Song, Alfred O. Hero III:
Data Discovery Using Lossless Compression-Based Sparse Representation. 5539-5543 - Cássio F. Dantas, Emmanuel Soubies, Cédric Févotte:
Safe Screening for Sparse Regression with the Kullback-Leibler Divergence. 5544-5548 - Tianxiang Gao, Songtao Lu, Jia Liu, Chris Chu:
On the Convergence of Randomized Bregman Coordinate Descent for Non-Lipschitz Composite Problems. 5549-5553 - Keita Kume, Isao Yamada:
A Global Cayley Parametrization of Stiefel Manifold for Direct Utilization of Optimization Mechanisms Over Vector Spaces. 5554-5558 - Songtao Lu, Naweed Khan, Ismail Yunus Akhalwaya, Ryan Riegel, Lior Horesh, Alexander G. Gray:
Training Logical Neural Networks by Primal-Dual Methods for Neuro-Symbolic Reasoning. 5559-5563 - Caio Gomes de Figueredo, Claudio J. Bordin Jr., Marcelo G. S. Bruno:
Cooperative Parameter Tracking on the Unit Sphere Using Distributed Adapt-Then-Combine Particle Filters and Parallel Transport. 5564-5568 - Douglas E. Johnston, Petar M. Djuric:
Bayesian Estimation of a Tail-Index with Marginalized Threshold. 5569-5573 - Rui Min, Christelle Garnier, François Septier, John Klein:
Block Kalman Filter: An Asymptotic Block Particle Filter in the Linear Gaussian Case. 5574-5578 - Yousef El-Laham, Liu Yang, Heather J. Lynch, Petar M. Djuric, Mónica F. Bugallo:
Particle Gibbs Sampling for Regime-Switching State-Space Models. 5579-5583 - Hechuan Wang, Mónica F. Bugallo, Petar M. Djuric:
Adaptive Importance Sampling Via Auto-Regressive Generative Models and Gaussian Processes. 5584-5588 - Chenhao Li, Simon J. Godsill:
Variational Parameter Learning in Sequential State-Space Model Via Particle Filtering. 5589-5593 - Jian Ding, Jianji Wang, Yue Zhang, Yuanjie Li, Nanning Zheng:
Correlation-Based Robust Linear Regression with Iterative Outlier Removal. 5594-5598 - Sebastian Ament, Carla P. Gomes:
On the Optimality of Backward Regression: Sparse Recovery and Subset Selection. 5599-5603 - Aditya Sant, Markus Leinonen, Bhaskar D. Rao:
General Total Variation Regularized Sparse Bayesian Learning for Robust Block-Sparse Signal Recovery. 5604-5608 - Michael Weylandt, George Michailidis:
Automatic Registration and Clustering of Time Series. 5609-5613 - Seyyid Emre Sofuoglu, Selin Aviyente:
Low-Rank on Graphs Plus Temporally Smooth Sparse Decomposition for Anomaly Detection in Spatiotemporal Data. 5614-5618 - Tianyi Liu, Andreas M. Tillmann, Yang Yang, Yonina C. Eldar, Marius Pesavento:
A Parallel Algorithm for Phase Retrieval with Dictionary Learning. 5619-5623 - Yao Tian, Haitao Yao, Meng Cai, Yaming Liu, Zejun Ma:
Improving RNN Transducer Modeling for Small-Footprint Keyword Spotting. 5624-5628 - Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman:
Cascaded Encoders for Unifying Streaming and Non-Streaming ASR. 5629-5633 - Bo Li, Anmol Gulati, Jiahui Yu, Tara N. Sainath, Chung-Cheng Chiu, Arun Narayanan, Shuo-Yiin Chang, Ruoming Pang, Yanzhang He, James Qin, Wei Han, Qiao Liang, Yu Zhang, Trevor Strohman, Yonghui Wu:
A Better and Faster end-to-end Model for Streaming ASR. 5634-5638 - Sankaran Panchapagesan, Daniel S. Park, Chung-Cheng Chiu, Yuan Shangguan, Qiao Liang, Alexander Gruenstein:
Efficient Knowledge Distillation for RNN-Transducer Models. 5639-5643 - Wei Zhou, Simon Berger, Ralf Schlüter, Hermann Ney:
Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition. 5644-5648 - Zuozhen Liu, Ta Li, Pengyuan Zhang:
RNN-T Based Open-Vocabulary Keyword Spotting in Mandarin with Multi-Level Detection. 5649-5653 - George Saon, Zoltán Tüske, Daniel Bolaños, Brian Kingsbury:
Advancing RNN Transducer Technology for Speech Recognition. 5654-5658 - Rohit Prabhavalkar, Yanzhang He, David Rybach, Sean Campbell, Arun Narayanan, Trevor Strohman, Tara N. Sainath:
Less is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging. 5659-5663 - Takafumi Moriya, Takanori Ashihara, Tomohiro Tanaka, Tsubasa Ochiai, Hiroshi Sato, Atsushi Ando, Yusuke Ijima, Ryo Masumura, Yusuke Shinohara:
Simpleflat: A Simple Whole-Network Pre-Training Approach for RNN Transducer-Based End-to-End Speech Recognition. 5664-5668 - Harsh Shrivastava, Ankush Garg, Yuan Cao, Yu Zhang, Tara N. Sainath:
Echo State Speech Recognition. 5669-5673 - Xianrui Zheng, Yulan Liu, Deniz Gunceler, Daniel Willett:
Using Synthetic Audio to Improve the Recognition of Out-of-Vocabulary Words in End-to-End Asr Systems. 5674-5678 - Ron J. Weiss, R. J. Skerry-Ryan, Eric Battenberg, Soroosh Mariooryad, Diederik P. Kingma:
Wave-Tacotron: Spectrogram-Free End-to-End Text-to-Speech Synthesis. 5679-5683 - Shiming Wang, Zhenhua Ling, Ruibo Fu, Jiangyan Yi, Jianhua Tao:
Patnet : A Phoneme-Level Autoregressive Transformer Network for Speech Synthesis. 5684-5688 - Qing He, Zhiping Xiu, Thilo Köhler, Jilong Wu:
Multi-Rate Attention Architecture for Fast Streamable Text-to-Speech Spectrum Modeling. 5689-5693 - Yusuke Yasuda, Xin Wang, Junichi Yamagishi:
End-to-End Text-to-Speech Using Latent Duration Based on VQ-VAE. 5694-5698 - Renqian Luo, Xu Tan, Rui Wang, Tao Qin, Jinzhu Li, Sheng Zhao, Enhong Chen, Tie-Yan Liu:
Lightspeech: Lightweight and Fast Text to Speech with Neural Architecture Search. 5699-5703 - Feng-Long Xie, Xinhui Li, Wen-Chao Su, Li Lu, Frank K. Soong:
A New High Quality Trajectory Tiling Based Hybrid TTS In Real Time. 5704-5708 - Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, Ron J. Weiss, Yonghui Wu:
Parallel Tacotron: Non-Autoregressive and Controllable TTS. 5709-5713 - Disong Wang, Liqun Deng, Yang Zhang, Nianzu Zheng, Yu Ting Yeung, Xiao Chen, Xunying Liu, Helen Meng:
Fcl-Taco2: Towards Fast, Controllable and Lightweight Text-to-Speech Synthesis. 5714-5718 - Alexandra Vioni, Myrsini Christidou, Nikolaos Ellinas, Georgios Vamvoukakis, Panos Kakoulidis, Taehoon Kim, June Sig Sung, Hyoungmin Park, Aimilios Chalamandaris, Pirros Tsiakoulis:
Prosodic Clustering for Phoneme-Level Prosody Control in End-to-End Speech Synthesis. 5719-5723 - Cheng Gong, Longbiao Wang, Zhenhua Ling, Shaotong Guo, Ju Zhang, Jianwu Dang:
Improving Naturalness and Controllability of Sequence-to-Sequence Speech Synthesis by Learning Local Prosody Representations. 5724-5728 - Chunhui Lu, Xue Wen, Ruolan Liu, Xiao Chen:
Multi-Speaker Emotional Speech Synthesis with Fine-Grained Prosody Modeling. 5729-5733 - Xiong Cai, Dongyang Dai, Zhiyong Wu, Xiang Li, Jingbei Li, Helen Meng:
Emotion Controllable Speech Synthesis Using Emotion-Unlabeled Dataset with the Assistance of Cross-Domain Speech Emotion Recognition. 5734-5738 - Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. 5739-5743 - Hassan Taherian, DeLiang Wang:
Time-Domain Loss Modulation Based on Overlap Ratio for Monaural Conversational Speaker Separation. 5744-5748 - Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Jinyu Li, Takuya Yoshioka, Chengyi Wang, Shujie Liu, Ming Zhou:
Continuous Speech Separation with Conformer. 5749-5753 - Martin Strauss, Bernd Edler:
A Flow-Based Neural Network for Time Domain Speech Enhancement. 5754-5758 - Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu:
Sandglasset: A Light Multi-Granularity Self-Attentive Network for Time-Domain Speech Separation. 5759-5763 - Zining Zhang, Bingsheng He, Zhenjie Zhang:
TransMask: A Compact and Fast Speech Separation Model Based on Transformer. 5764-5768 - Yuan-Kuei Wu, Kuan-Po Huang, Yu Tsao, Hung-yi Lee:
One Shot Learning for Speech Separation. 5769-5773 - Matthew Maciejewski, Jing Shi, Shinji Watanabe, Sanjeev Khudanpur:
Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and a Small Step. 5774-5778 - Chenxing Li, Jiaming Xu, Nima Mesgarani, Bo Xu:
Speaker and Direction Inferred Dual-Channel Speech Separation. 5779-5783 - Deepak Baby, Hervé Bourlard:
Speech Dereverberation Using Variational Autoencoders. 5784-5788 - Hyeong-Seok Choi, Sungjin Park, Jie Hwan Lee, Hoon Heo, Dongsuk Jeon, Kyogu Lee:
Real-Time Denoising and Dereverberation wtih Tiny Recurrent U-Net. 5789-5793 - Jingshu Zhang, Mark D. Plumbley, Wenwu Wang:
Weighted Magnitude-Phase Loss for Speech Dereverberation. 5794-5798 - Anthony Larcher, Ambuj Mehrish, Marie Tahon, Sylvain Meignier, Jean Carrive, David Doukhan, Olivier Galibert, Nicholas W. D. Evans:
Speaker Embeddings for Diarization of Broadcast Data In The Allies Challenge. 5799-5803 - David Looney, Nikolay D. Gaubitch:
On the Detection of Pitch-Shifted Voice: Machines and Human Listeners. 5804-5808 - Yoohwan Kwon, Hee-Soo Heo, Bong-Jin Lee, Joon Son Chung:
The ins and outs of speaker recognition: lessons from VoxSRC 2020. 5809-5813 - Jenthe Thienpondt, Brecht Desplanques, Kris Demuynck:
The Idlab Voxsrc-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification. 5814-5818 - Federico Landini, Ondrej Glembek, Pavel Matejka, Johan Rohdin, Lukás Burget, Mireia Díez, Anna Silnova:
Analysis of the but Diarization System for Voxconverse Challenge. 5819-5823 - Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong:
Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020. 5824-5828 - Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang:
Squeezing Value of Cross-Domain Labels: A Decoupled Scoring Approach for Speaker Verification. 5829-5833 - Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. 5834-5838 - Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang, Hui Chen:
Meta-Learning for Cross-Channel Speaker Verification. 5839-5843 - Chenpeng Du, Bing Han, Shuai Wang, Yanmin Qian, Kai Yu:
SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification. 5844-5848 - Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit Selection Synthesis Based Data Augmentation for Fixed Phrase Speaker Verification. 5849-5853 - Xiao Chen, Stephen A. Zahorian:
Improving Speaker Verification in Reverberant Environments. 5854-5858 - Siddharth Dalmia, Yuzong Liu, Srikanth Ronanki, Katrin Kirchhoff:
Transformer-Transducers for Code-Switched Speech Recognition. 5859-5863 - Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Streaming Transformers. 5864-5868 - Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Capturing Multi-Resolution Context by Dilated Self-Attention. 5869-5873 - Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, Jing Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, Yuekai Zhang:
Recent Developments on Espnet Toolkit Boosted By Conformer. 5874-5878 - Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Hierarchical Transformer-Based Large-Context End-To-End ASR with Large-Context Knowledge Distillation. 5879-5883 - Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Brian John King, Siegfried Kunzmann:
End-to-End Multi-Channel Transformer for Speech Recognition. 5884-5888 - Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao:
CASS-NAT: CTC Alignment-Based Single Step Non-Autoregressive Transformer for Speech Recognition. 5889-5893 - Xingchen Song, Zhiyong Wu, Yiheng Huang, Chao Weng, Dan Su, Helen M. Meng:
Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input. 5894-5898 - Menglong Xu, Shengqiang Li, Xiao-Lei Zhang:
Transformer-Based End-to-End Speech Recognition with Local Dense Synthesizer Attention. 5899-5903 - Xie Chen, Yu Wu, Zhenghao Wang, Shujie Liu, Jinyu Li:
Developing Real-Time Streaming Transformer Transducer for Speech Recognition on Large-Scale Dataset. 5904-5908 - Mohan Li, Catalin Zorila, Rama Doddipatla:
Head-Synchronous Decoding for Transformer-Based Streaming ASR. 5909-5913 - Keqi Deng, Gaofeng Cheng, Haoran Miao, Pengyuan Zhang, Yonghong Yan:
History Utterance Embedding Transformer LM for Speech Recognition. 5914-5918 - Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
Maskcyclegan-VC: Learning Non-Parallel Voice Conversion with Filling in Frames. 5919-5923 - Xinyuan Yu, Brian Mak:
Non-Parallel Many-To-Many Voice Conversion by Knowledge Transfer from a Text-To-Speech Model. 5924-5928 - Chao Wang, Yibiao Yu:
Non-Parallel Many-To-Many Voice Conversion Using Local Linguistic Tokens. 5929-5933 - Kazuhiro Kobayashi, Wen-Chin Huang, Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda:
Crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder. 5934-5938 - Yist Y. Lin, Chung-Ming Chien, Jheng-Hao Lin, Hung-yi Lee, Lin-Shan Lee:
Fragmentvc: Any-To-Any Voice Conversion by End-To-End Extracting and Fusing Fine-Grained Voice Fragments with Attention. 5939-5943 - Wen-Chin Huang, Yi-Chiao Wu, Tomoki Hayashi:
Any-to-One Sequence-to-Sequence Voice Conversion Using Self-Supervised Discrete Speech Representations. 5944-5948 - Mingjie Chen, Yanpei Shi, Thomas Hain:
Towards Low-Resource Stargan Voice Conversion Using Weight Adaptive Instance Normalization. 5949-5953 - Yen-Hao Chen, Da-Yi Wu, Tsung-Han Wu, Hung-yi Lee:
Again-VC: A One-Shot Voice Conversion Using Activation Guidance and Adaptive Instance Normalization. 5954-5958 - Ying Zhang, Hao Che, Jie Li, Chenxing Li, Xiaorui Wang, Zhongyuan Wang:
One-Shot Voice Conversion Based on Speaker Aware Module. 5959-5963 - Zhiyuan Tan, Jianguo Wei, Junhai Xu, Yuqing He, Wenhuan Lu:
Zero-Shot Voice Conversion with Adjusted Speaker Embeddings and Simple Acoustic Features. 5964-5968 - Shengkui Zhao, Hao Wang, Trung Hieu Nguyen, Bin Ma:
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram. 5969-5973 - Florian Lux, Ngoc Thang Vu:
Meta-Learning for Improving Rare Word Recognition in End-to-End ASR. 5974-5978 - Rudolf A. Braun, Srikanth R. Madikeri, Petr Motlícek:
A Comparison of Methods for OOV-Word Recognition on a New Public Dataset. 5979-5983 - Hainan Xu, Yinghui Huang, Yun Zhu, Kartik Audhkhasi, Bhuvana Ramabhadran:
Convolutional Dropout and Wordpiece Augmentation for End-to-End Speech Recognition. 5984-5988 - Tae Gyoon Kang, Ho-Gyeong Kim, Min-Joong Lee, Jihyun Lee, Hoshik Lee:
Partially Overlapped Inference for Long-Form Speech Recognition. 5989-5993 - Nanxin Chen, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer. 5994-5998 - Jon Macoskey, Grant P. Strimel, Ariya Rastrow:
Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization. 5999-6003 - Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang:
FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization. 6004-6008 - Kai Zhen, Hieu Duy Nguyen, Feng-Ju Chang, Athanasios Mouchtaris, Ariya Rastrow:
Sparsification via Compressed Sensing for Automatic Speech Recognition. 6009-6013 - Zhaofeng Wu, Ding Zhao, Qiao Liang, Jiahui Yu, Anmol Gulati, Ruoming Pang:
Dynamic Sparsity Neural Networks for Automatic Speech Recognition. 6014-6018 - Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
An Asynchronous WFST-Based Decoder for Automatic Speech Recognition. 6019-6023 - Yuekai Zhang, Sining Sun, Long Ma:
Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices. 6024-6028 - Takuma Okamoto, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Noise Level Limited Sub-Modeling for Diffusion Probabilistic Vocoders. 6029-6033 - Ahmed Mustafa, Nicola Pia, Guillaume Fuchs:
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization. 6034-6038 - Ryuichi Yamamoto, Eunwoo Song, Min-Jae Hwang, Jae-Min Kim:
Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators. 6039-6043 - Yunlong Jiao, Adam Gabrys, Georgi Tinchev, Bartosz Putrycz, Daniel Korzekwa, Viacheslav Klimkov:
Universal Neural Vocoding with Parallel Wavenet. 6044-6048 - Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Periodnet: A Non-Autoregressive Waveform Generation Model with a Structure Separating Periodic and Aperiodic Components. 6049-6053 - Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao:
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation. 6054-6058 - Rui Liu, Berrak Sisman, Haizhou Li:
Graphspeech: Syntax-Aware Graph Attention Network for Neural Speech Synthesis. 6059-6063 - Changhe Song, Jingbei Li, Yixuan Zhou, Zhiyong Wu, Helen M. Meng:
Syntactic Representation Learning For Neural Network Based TTS with Syntactic Parse Tree Traversal. 6064-6068 - Junjie Pan, Lin Wu, Xiang Yin, Pengfei Wu, Chenchang Xu, Zejun Ma:
A Chapter-Wise Understanding System for Text-To-Speech in Chinese Novels. 6069-6073 - Zilong Bai, Beibei Hu:
A Universal Bert-Based Front-End Model for Mandarin Text-To-Speech Synthesis. 6074-6078 - Guanghui Xu, Wei Song, Zhengchen Zhang, Chao Zhang, Xiaodong He, Bowen Zhou:
Improving Prosody Modelling with Cross-Utterance Bert Embeddings for End-to-End Speech Synthesis. 6079-6083 - Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker:
Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism. 6084-6088 - Zhuohuang Zhang, Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Dong Yu:
ADL-MVDR: All Deep Learning MVDR Beamformer for Target Speech Separation. 6089-6093 - Jiangyu Han, Xinyuan Zhou, Yanhua Long, Yijie Li:
Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation. 6094-6098 - Marc Delcroix, Katerina Zmolíková, Tsubasa Ochiai, Keisuke Kinoshita, Tomohiro Nakatani:
Speaker Activity Driven Neural Speech Extraction. 6099-6103 - Yunzhe Hao, Jiaming Xu, Peng Zhang, Bo Xu:
Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Environments. 6104-6108 - Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-Stage Speaker Extraction with Utterance and Frame-Level Reference Signals. 6109-6113 - Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Shoko Araki:
Neural Network-Based Virtual Microphone Estimator. 6114-6118 - Poul Hoang, Zheng-Hua Tan, Jan Mark de Haan, Jesper Jensen:
Joint Maximum Likelihood Estimation of Power Spectral Densities and Relative Acoustic Transfer Functions for Acoustic Beamforming. 6119-6123 - Stefan Thaleiser, Gerald Enzner:
Cue-Preserving MMSE Filter with Bayesian SNR Marginalization for Binaural Speech Enhancement. 6124-6128 - Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Hiroshi Sawada, Shoko Araki:
Blind and Neural Network-Guided Convolutional Beamformer for Joint Denoising, Dereverberation, and Source Separation. 6129-6133 - Ke Tan, Xueliang Zhang, DeLiang Wang:
Real-Time Speech Enhancement for Mobile Communication Based on Dual-Channel Complex Spectral Mapping. 6134-6138 - Sanyuan Chen, Yu Wu, Zhuo Chen, Takuya Yoshioka, Shujie Liu, Jin-Yu Li, Xiangzhan Yu:
Don't Shoot Butterfly with Rifles: Multi-Channel Continuous Speech Separation with Early Exit Transformer. 6139-6143 - Miquel India, Pooyan Safari, Javier Hernando:
Double Multi-Head Attention for Speaker Verification. 6144-6148 - Jee-weon Jung, Hee-Soo Heo, Ha-Jin Yu, Joon Son Chung:
Graph Attention Networks for Speaker Verification. 6149-6153 - Victoria Mingote, Antonio Miguel, Alfonso Ortega Giménez, Eduardo Lleida:
Memory Layers with Multi-Head Attention Mechanisms for Text-Dependent Speaker Verification. 6154-6158 - Ali Shahin Shamsabadi, Francisco Sepúlveda Teixeira, Alberto Abad, Bhiksha Raj, Andrea Cavallaro, Isabel Trancoso:
FoolHD: Fooling Speaker Identification by Highly Imperceptible Adversarial Disturbances. 6159-6163 - Monisankha Pal, Arindam Jati, Raghuveer Peri, Chin-Cheng Hsu, Wael AbdAlmageed, Shrikanth Narayanan:
Adversarial Defense for Deep Speaker Recognition Using Hybrid Adversarial Training. 6164-6168 - Mufan Sang, Wei Xia, John H. L. Hansen:
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning. 6169-6173 - Andrew Brown, Jaesung Huh, Arsha Nagrani, Joon Son Chung, Andrew Zisserman:
Playing a Part: Speaker Verification at the movies. 6174-6178 - Julien Balian, Raffaele Tavarone, Mathieu Poumeyrol, Alice Coucke:
Small Footprint Text-Independent Speaker Verification For Embedded Systems. 6179-6183 - Fuchuan Tong, Miao Zhao, Jianfeng Zhou, Hao Lu, Zheng Li, Lin Li, Qingyang Hong:
ASV-SUBTOOLS: Open Source Toolkit for Automatic Speaker Verification. 6184-6188 - Anurag Chowdhury, Arun Ross, Prabu David:
DEEPTALK: Vocal Style Encoding for Speaker Recognition and Speech Synthesis. 6189-6193 - Leda Sari, Kritika Singh, Jiatong Zhou, Lorenzo Torresani, Nayan Singhal, Yatharth Saraf:
A Multi-View Approach to Audio-Visual Speaker Verification. 6194-6198 - Yixin Chen, Weiyi Lu, Alejandro Mottini, Li Erran Li, Jasha Droppo, Zheng Du, Belinda Zeng:
Top-Down Attention in End-to-End Spoken Language Understanding. 6199-6203 - Md. Akmal Haidar, Mehdi Rezagholizadeh:
Fine-Tuning of Pre-Trained End-to-End Speech Recognition with Generative Adversarial Networks. 6204-6208 - Yun Tang, Juan Miguel Pino, Changhan Wang, Xutai Ma, Dmitriy Genzel:
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks. 6209-6213 - Yosuke Kashiwagi, Emiru Tsunoo, Shinji Watanabe:
Gaussian Kernelized Self-Attention for Long Sequence Data and its Application to CTC-Based Speech Recognition. 6214-6218 - Apoorv Vyas, Srikanth R. Madikeri, Hervé Bourlard:
Lattice-Free Mmi Adaptation of Self-Supervised Pretrained Acoustic Models. 6219-6223 - Jaesong Lee, Shinji Watanabe:
Intermediate Loss Regularization for CTC-Based Speech Recognition. 6224-6228 - Guoyu Liu, Lixin Cao:
Code-Switch Speech Rescoring with Monolingual Data. 6229-6233 - Neeraj Gaur, Brian Farris, Parisa Haghani, Isabel Leal, Pedro J. Moreno, Manasa Prasad, Bhuvana Ramabhadran, Yun Zhu:
Mixture of Informed Experts for Multilingual Speech Recognition. 6234-6238 - Burin Naowarat, Thananchai Kongthaworn, Korrawe Karunratanakul, Sheng Hui Wu, Ekapol Chuangsuwanich:
Reducing Spelling Inconsistencies in Code-Switching ASR Using Contextualized CTC Loss. 6239-6243 - Amit Das, Kshitiz Kumar, Jian Wu:
Multi-Dialect Speech Recognition in English Using Attention on Ensemble of Experts. 6244-6248 - Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Zhengqi Wen:
Decoupling Pronunciation and Language for End-to-End Code-Switching Automatic Speech Recognition. 6249-6253 - Houjun Huang, Xu Xiang, Yexin Yang, Rao Ma, Yanmin Qian:
AISpeech-SJTU Accent Identification System for the Accented English Speech Recognition Challenge. 6254-6258 - Suransh Chopra, Puneet Mathur, Ramit Sawhney, Rajiv Ratn Shah:
Meta-Learning for Low-Resource Speech Emotion Recognition. 6259-6263 - Yifei Yin, Yu Gu, Longshan Yao, Ying Zhou, Xuefeng Liang, He Zhang:
Progressive Co-Teaching for Ambiguous Speech Emotion Recognition. 6264-6268 - Wen Wu, Chao Zhang, Philip C. Woodland:
Emotion Recognition by Fusing Time Synchronous and Time Asynchronous Representations. 6269-6273 - Atsushi Ando, Ryo Masumura, Hiroshi Sato, Takafumi Moriya, Takanori Ashihara, Yusuke Ijima, Tomoki Toda:
Speech Emotion Recognition Based on Listener Adaptive Models. 6274-6278 - Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn W. Schuller:
Speech Emotion Recognition Using Semantic Information. 6279-6283 - Amir Shirian, Tanaya Guha:
Compact Graph Architecture for Speech Emotion Recognition. 6284-6288 - Xianfeng Wang, Min Wang, Wenbo Qi, Wanqi Su, Xiangqian Wang, Huan Zhou:
A Novel end-to-end Speech Emotion Recognition Network with Stacked Transformer Layers. 6289-6293 - Srividya Tirunellai Rajamani, Kumar T. Rajamani, Adria Mallol-Ragolta, Shuo Liu, Björn W. Schuller:
A Novel Attention-Based Gated Recurrent Unit and its Efficacy in Speech Emotion Recognition. 6294-6298 - Changzeng Fu, Chaoran Liu, Carlos Toshinori Ishi, Hiroshi Ishiguro:
MAEC: Multi-Instance Learning with an Adversarial Auto-Encoder-Based Classifier for Speech Emotion Recognition. 6299-6303 - Lili Guo, Longbiao Wang, Chenglin Xu, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition. 6304-6308 - Aneesh Muppidi, Martin Radfar:
Speech Emotion Recognition Using Quaternion Convolutional Neural Networks. 6309-6313 - Yuan Gao, Jiaxing Liu, Longbiao Wang, Jianwu Dang:
Domain-Adversarial Autoencoder with Attention Based Feature Level Fusion for Speech Emotion Recognition. 6314-6318 - Mingke Xu, Fan Zhang, Xiaodong Cui, Wei Zhang:
Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation. 6319-6323 - Raghavendra Pappagari, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
CopyPaste: An Augmentation Method for Speech Emotion Recognition. 6324-6328 - Mao Li, Bo Yang, Joshua Levy, Andreas Stolcke, Viktor Rozgic, Spyros Matsoukas, Constantinos Papayiannis, Daniel Bone, Chao Wang:
Contrastive Unsupervised Learning for Speech Emotion Recognition. 6329-6333 - Qi Cao, Mixiao Hou, Bingzhi Chen, Zheng Zhang, Guangming Lu:
Hierarchical Network Based on the Fusion of Static and Dynamic Features for Speech Emotion Recognition. 6334-6338 - Jiaxing Liu, Sen Chen, Longbiao Wang, Zhilei Liu, Yahui Fu, Lili Guo, Jianwu Dang:
Multimodal Emotion Recognition with Capsule Graph Convolutional Based Representation Fusion. 6339-6343 - Raghuveer Peri, Srinivas Parthasarathy, Charles Bradshaw, Shiva Sundaram:
Disentanglement for Audio-Visual Emotion Recognition Using Multitask Setup. 6344-6348 - Rohan Kumar Das, Jichen Yang, Haizhou Li:
Data Augmentation with Signal Companding for Detection of Logical Access Attacks. 6349-6353 - Xu Li, Na Li, Chao Weng, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Replay and Synthetic Speech Detection with Res2Net Architecture. 6354-6358 - Anwei Luo, Enlei Li, Yongliang Liu, Xiangui Kang, Z. Jane Wang:
A Capsule Network Based Approach for Detection of Audio Spoofing Attacks. 6359-6363 - Rajul Acharya, Harsh Kotta, Ankur T. Patil, Hemant A. Patil:
Cross-Teager Energy Cepstral Coefficients for Replay Spoof Detection on Voice Assistants. 6364-6368 - Hemlata Tak, Jose Patino, Massimiliano Todisco, Andreas Nautsch, Nicholas W. D. Evans, Anthony Larcher:
End-to-End anti-spoofing with RawNet2. 6369-6373 - Meng Liu, Longbiao Wang, Kong Aik Lee, Xuanda Chen, Jianwu Dang:
Replay-Attack Detection Using Features With Adaptive Spectro-Temporal Resolution. 6374-6378 - Vilayphone Vilaysouk, Amr Nour-Eldin, Dermot Connolly:
Improving Identification of System-Directed Speech Utterances by Deep Learning of ASR-Based Word Embeddings and Confidence Metrics. 6379-6382 - Atsunori Ogawa, Naohiro Tawara, Takatomo Kano, Marc Delcroix:
BLSTM-Based Confidence Estimation for End-to-End Speech Recognition. 6383-6387 - Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman:
Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition. 6388-6392 - David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw:
Learning Word-Level Confidence for Subword End-To-End ASR. 6393-6397 - Ashutosh Gupta, Ankur Kumar, Dhananjaya Gowda, Kwangyoun Kim, Sachin Singh, Shatrughan Singh, Chanwoo Kim:
Neural Utterance Confidence Measure for RNN-Transducers and Two Pass Models. 6398-6402 - Pingchuan Ma, Stavros Petridis, Maja Pantic:
Detecting Adversarial Attacks on Audiovisual Speech Recognition. 6403-6407 - Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gokce Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas:
REDAT: Accent-Invariant Representation for End-To-End ASR by Domain Adversarial Training with Relabeling. 6408-6412 - Tian Tan, Yizhou Lu, Rao Ma, Sen Zhu, Jiaqi Guo, Yanmin Qian:
AISpeech-SJTU ASR System for the Accented English Speech Recognition Challenge. 6413-6417 - Song Li, Beibei Ouyang, Dexin Liao, Shipeng Xia, Lin Li, Qingyang Hong:
End-To-End Multi-Accent Speech Recognition with Unsupervised Accent Modelling. 6418-6422 - Jinchao Li, Jianwei Yu, Zi Ye, Simon Wong, Man-Wai Mak, Brian Mak, Xunying Liu, Helen Meng:
A Comparative Study of Acoustic and Linguistic Features Classification for Alzheimer's Disease Detection. 6423-6427 - John B. Harvill, Dias Issa, Mark Hasegawa-Johnson, Chang Dong Yoo:
Synthesis of New Words for Improved Dysarthric Speech Recognition on an Expanded Vocabulary. 6428-6432 - Zi Ye, Shoukang Hu, Jinchao Li, Xurong Xie, Mengzhe Geng, Jianwei Yu, Junhao Xu, Boyang Xue, Shansong Liu, Xunying Liu, Helen Meng:
Development of the Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using the Dementiabank Corpus. 6433-6437 - Yujie Chi, Kiyoshi Honda, Jianguo Wei:
Portable Photoglottography for Monitoring Vocal Fold Vibrations in Speech Production. 6438-6442 - Ming Feng, Yin Wang, Kele Xu, Huaimin Wang, Bo Ding:
Improving Ultrasound Tongue Contour Extraction Using U-Net and Shape Consistency-Based Regularizer. 6443-6447 - Tilak Purohit, Achuth Rao M. V, Prasanta Kumar Ghosh:
Impact of Speaking Rate on the Source Filter Interaction in Speech: A Study. 6448-6452 - Abdolreza Sabzi Shahrebabaki, Negar Olfati, Ali Shariq Imran, Magne Hallstein Johnsen, Sabato Marco Siniscalchi, Torbjørn Svendsen:
A Two-Stage Deep Modeling Approach to Articulatory Inversion. 6453-6457 - Sarthak Kumar Maharana, Aravind Illa, Renuka Mannem, Yamini Belur, Preetie Shetty, Preethish-Kumar Veeramani, Seena Vengalil, Kiran Polavarapu, Atchayaram Nalini, Prasanta Kumar Ghosh:
Acoustic-to-Articulatory Inversion for Dysarthric Speech by Using Cross-Corpus Acoustic-Articulatory Data. 6458-6462 - Jiahong Yuan, Kenneth Church:
Speaking Rate and Tonal Realization in Mandarin Chinese: What Can We Learn From Large Speech Corpora? 6463-6467 - Yota Ueda, Kazuki Fujii, Yuki Saito, Shinnosuke Takamichi, Yukino Baba, Hiroshi Saruwatari:
Humanacgan: Conditional Generative Adversarial Network with Human-Based Auxiliary Classifier and its Evaluation in Phoneme Perception. 6468-6472 - Qiang Huang, Thomas Hain:
Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Networks. 6473-6477 - W. Bastiaan Kleijn, Andrew Storus, Michael Chinen, Tom Denton, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Hengchin Yeh:
Generative Speech Coding with Predictive Variance Regularization. 6478-6482 - Dang-Khoa Mac, Van-Huy Nguyen, Dinh-Nghi Nguyen, Kim-Anh Nguyen:
How to Make Text-to-Speech System Pronounce "Voldemort": an Experimental Approach of Foreign Word Phonemization in Vietnamese. 6483-6487 - Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Junichi Yamagishi:
How Similar or Different is Rakugo Speech Synthesizer to Professional Performers? 6488-6492 - Chandan K. A. Reddy, Vishak Gopal, Ross Cutler:
Dnsmos: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors. 6493-6497 - Kevin M. Chu, Leslie M. Collins, Boyla Mainsah:
A Causal Deep Learning Framework for Classifying Phonemes in Cochlear Implants. 6498-6502 - Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka:
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR. 6503-6507 - Jaeyun Song, Hajin Shim, Eunho Yang:
Mutually-Constrained Monotonic Multihead Attention for Online ASR. 6508-6512 - Gerardo Roa Dabike, Jon Barker:
The use of Voice Source Features for Sung Speech Recognition. 6513-6517 - Ke Li, Daniel Povey, Sanjeev Khudanpur:
A Parallelizable Lattice Rescoring Strategy with Neural Language Models. 6518-6522 - Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. 6523-6527 - Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu:
Cif-Based Collaborative Decoding for End-to-End Contextual Speech Recognition. 6528-6532 - Wei-Ning Hsu, Yao-Hung Hubert Tsai, Benjamin Bolte, Ruslan Salakhutdinov, Abdelrahman Mohamed:
Hubert: How Much Can a Bad Teacher Benefit ASR Pre-Training? 6533-6537 - Dongwei Jiang, Wubo Li, Ruixiong Zhang, Miao Cao, Ne Luo, Yang Han, Wei Zou, Kun Han, Xiangang Li:
A Further Study of Unsupervised Pretraining for Transformer Based Speech Recognition. 6538-6542 - Changfeng Gao, Gaofeng Cheng, Runyan Yang, Han Zhu, Pengyuan Zhang, Yonghong Yan:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Text Data. 6543-6547 - Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Semi-Supervised Speech Recognition Via Graph-Based Temporal Classification. 6548-6552 - Sameer Khurana, Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training. 6553-6557 - Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Chung-Cheng Chiu, Ruoming Pang, Arun Narayanan, Ananya Misra, Yu Zhang, Liangliang Cao:
Improving Streaming Automatic Speech Recognition with Non-Streaming Model Distillation on Unsupervised Data. 6558-6562 - Liping Chen, Yan Deng, Xi Wang, Frank K. Soong, Lei He:
Speech Bert Embedding for Improving Prosody in Neural TTS. 6563-6567 - Ruibo Fu, Jianhua Tao, Zhengqi Wen, Jiangyan Yi, Tao Wang, Chunyu Qiang:
Bi-Level Style and Prosody Decoupling Modeling for Personalized End-to-End Speech Synthesis. 6568-6572 - Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman:
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech. 6573-6577 - Zack Hodari, Alexis Moinet, Sri Karlapati, Jaime Lorenzo-Trueba, Thomas Merritt, Arnaud Joly, Ammar Abbas, Penny Karanasou, Thomas Drugman:
Camp: A Two-Stage Approach to Modelling Prosody in Context. 6578-6582 - Shuang Liang, Chenfeng Miao, Minchuan Chen, Jun Ma, Shaojun Wang, Jing Xiao:
Unsupervised Learning for Multi-Style Speech Synthesis with Limited Data. 6583-6587 - Adrian Lancucki:
Fastpitch: Parallel Text-to-Speech with Pitch Prediction. 6588-6592 - Goeric Huybrechts, Thomas Merritt, Giulia Comini, Bartek Perz, Raahil Shah, Jaime Lorenzo-Trueba:
Low-Resource Expressive Text-To-Speech Using Data Augmentation. 6593-6597 - Min-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim:
TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis. 6598-6602 - Hanbin Bae, Jae-Sung Bae, Young-Sun Joo, Young-Ik Kim, Hoon-Young Cho:
A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music. 6603-6607 - Detai Xin, Tatsuya Komatsu, Shinnosuke Takamichi, Hiroshi Saruwatari:
Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS. 6608-6612 - Yuzi Yan, Xu Tan, Bohan Li, Tao Qin, Sheng Zhao, Yuan Shen, Tie-Yan Liu:
Adaspeech 2: Adaptive Text to Speech with Untranscribed Data. 6613-6617 - Yibin Zheng, Xinhui Li, Li Lu:
Investigation of Fast and Efficient Methods for Multi-Speaker Modeling and Speaker Adaptation. 6618-6622 - Chandan K. A. Reddy, Harishchandra Dubey, Vishak Gopal, Ross Cutler, Sebastian Braun, Hannes Gamper, Robert Aichner, Sriram Srinivasan:
ICASSP 2021 Deep Noise Suppression Challenge. 6623-6627 - Andong Li, Wenzhe Liu, Xiaoxue Luo, Chengshi Zheng, Xiaodong Li:
ICASSP 2021 Deep Noise Suppression Challenge: Decoupling Magnitude and Phase Optimization with a Two-Stage Deep Network. 6628-6632 - Xiang Hao, Xiangdong Su, Radu Horaud, Xiaofei Li:
Fullsubnet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement. 6633-6637 - Jingdong Li, Dawei Luo, Yun Liu, Yuanyuan Zhu, Zhaoxia Li, Guohui Cui, Wenqi Tang, Wei Chen:
Densely Connected Multi-Stage Model with Channel Wise Subband Feature for Real-Time Speech Enhancement. 6638-6642 - Tyler Vuong, Yangyang Xia, Richard M. Stern:
A Modulation-Domain Loss for Neural-Network-Based Real-Time Speech Enhancement. 6643-6647 - Shengkui Zhao, Trung Hieu Nguyen, Bin Ma:
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses. 6648-6652 - Giovanni Morrone, Daniel Michelsanti, Zheng-Hua Tan, Jesper Jensen:
Audio-Visual Speech Inpainting with Deep Learning. 6653-6657 - Karthik Ramesh, Chao Xing, Wupeng Wang, Dong Wang, Xiao Chen:
Vset: A Multimodal Transformer for Visual Speech Enhancement. 6658-6662 - Mostafa Sadeghi, Xavier Alameda-Pineda:
Switching Variational Auto-Encoders for Noise-Agnostic Audio-Visual Speech Enhancement. 6663-6667 - Koichiro Ito, Masaaki Yamamoto, Kenji Nagamatsu:
Audio-Visual Speech Enhancement Method Conditioned in the Lip Motion and Speaker-Discriminative Embeddings. 6668-6672 - Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi, Ryo Masumura:
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss. 6673-6677 - Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li:
Muse: Multi-Modal Target Speaker Extraction with Visual Cues. 6678-6682 - Ying Liu, Yan Song, Ian McLoughlin, Lin Liu, Li-Rong Dai:
An Effective Deep Embedding Learning Method Based on Dense-Residual Networks for Speaker Verification. 6683-6687 - Sangwook Han, Jaeuk Byun, Jong Won Shin:
Time-Domain Speaker Verification Using Temporal Convolutional Networks. 6688-6692 - Chunlei Zhang, Meng Yu, Chao Weng, Dong Yu:
Towards Robust Speaker Verification with Target Speaker Enhancement. 6693-6697 - Naijun Zheng, Na Li, Bo Wu, Meng Yu, Jianwei Yu, Chao Weng, Dan Su, Xunying Liu, Helen Meng:
A Joint Training Framework of Multi-Look Separator and Speaker Embedding Extractor for Overlapped Speech. 6698-6702 - Ya-Qi Yu, Siqi Zheng, Hongbin Suo, Yun Lei, Wu-Jun Li:
Cam: Context-Aware Masking for Robust Speaker Verification. 6703-6707 - Youzhi Tu, Man-Wai Mak:
Short-Time Spectral Aggregation for Speaker Embedding. 6708-6712 - Haoran Zhang, Yuexian Zou, Helin Wang:
Contrastive Self-Supervised Learning for Text-Independent Speaker Verification. 6713-6717 - Haibin Wu, Xu Li, Andy T. Liu, Zhiyong Wu, Helen Meng, Hung-yi Lee:
Adversarial Defense for Automatic Speaker Verification by Cascaded Self-Supervised Learning Models. 6718-6722 - Wei Xia, Chunlei Zhang, Chao Weng, Meng Yu, Dong Yu:
Self-Supervised Text-Independent Speaker Verification Using Prototypical Momentum Contrastive Learning. 6723-6727 - Danwei Cai, Weiqing Wang, Ming Li:
An Iterative Framework for Self-Supervised Deep Speaker Representation Learning. 6728-6732 - Jaejin Cho, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Improving Reconstruction Loss Based Speaker Embedding in Unsupervised and Semi-Supervised Scenarios. 6733-6737 - Erfan Loweimi, Zoran Cvetkovic, Peter Bell, Steve Renals:
Speech Acoustic Modelling from Raw Phase Spectrum. 6738-6742 - Shunfei Chen, Xinhui Hu, Sheng Li, Xinkang Xu:
An Investigation of Using Hybrid Modeling Units for Improving End-to-End Speech Recognition System. 6743-6747 - Xiaodong Cui, Songtao Lu, Brian Kingsbury:
Federated Acoustic Modeling for Automatic Speech Recognition. 6748-6752 - Murali Karthick Baskar, Lukás Burget, Shinji Watanabe, Ramón Fernandez Astudillo, Jan Honza Cernocký:
Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition. 6753-6757 - Shoukang Hu, Xurong Xie, Shansong Liu, Mingyu Cui, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks. 6758-6762 - Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka:
Hypothesis Stitcher for End-to-End Speaker-Attributed ASR on Long-Form Multi-Talker Recordings. 6763-6767 - Jeremy Heng Meng Wong, Dimitrios Dimitriadis, Ken'ichi Kumatani, Yashesh Gaur, George Polovets, Partha Parthasarathy, Eric Sun, Jinyu Li, Yifan Gong:
Ensemble Combination between Different Time Segmentations. 6768-6772 - Chanwoo Kim, Abhinav Garg, Dhananjaya Gowda, Seongkyu Mun, Changwoo Han:
Streaming End-to-End Speech Recognition with Jointly Trained Neural Feature Enhancement. 6773-6777 - Yongqiang Wang, Yangyang Shi, Frank Zhang, Chunyang Wu, Julian Chan, Ching-Feng Yeh, Alex Xiao:
Transformer in Action: A Comparative Study of Transformer-Based Acoustic Models for Large Scale Speech Recognition Applications. 6778-6782 - Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Mike Seltzer:
Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition. 6783-6787 - Liqiang He, Dan Su, Dong Yu:
Learned Transferable Architectures Can Surpass Hand-Designed Architectures for Large Scale Speech Recognition. 6788-6792 - Jae-Jin Jeon, Eesung Kim:
Multitask Learning and Joint Optimization for Transformer-RNN-Transducer Speech Recognition. 6793-6797 - Colin Lea, Vikramjit Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey P. Bigham:
SEP-28k: A Dataset for Stuttering Event Detection from Podcasts with People Who Stutter. 6798-6802 - Nicholas Wilkinson, Thomas Niesler:
A Hybrid CNN-BiLSTM Voice Activity Detector. 6803-6807 - Yong Rae Jo, Young Ki Moon, Won-Ik Cho, Geun Sik Jo:
Self-Attentive VAD: Context-Aware Detection of Voice from Noise. 6808-6812 - Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Preventing Early Endpointing for Online Automatic Speech Recognition. 6813-6817 - Fei Jia, Somshubra Majumdar, Boris Ginsburg:
MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network for Voice Activity Detection. 6818-6822 - Xu Tan, Xiao-Lei Zhang:
Speech Enhancement Aided End-To-End Multi-Task Learning for Voice Activity Detection. 6823-6827 - Nan Li, Longbiao Wang, Masashi Unoki, Sheng Li, Rui Wang, Meng Ge, Jianwu Dang:
Robust Voice Activity Detection Using a Masked Auditory Encoder Based Convolutional Neural Network. 6828-6832 - Junyao Zhan, Qianhua He, Jianbin Su, Yanxiong Li:
A Stage Match for Query-by-Example Spoken Term Detection Based On Structure Information of Query. 6833-6837 - Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik:
Knowledge Transfer for Efficient on-Device False Trigger Mitigation. 6838-6842 - Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg:
Progressive Voice Trigger Detection: Accuracy vs Latency. 6843-6847 - Takuya Higuchi, Shreyas Saxena, Mehrez Souden, Tien Dung Tran, Masood Delfarah, Chandra Dhir:
Dynamic Curriculum Learning via Data Parameters for Noise Robust Keyword Spotting. 6848-6852 - Tzeviya Sylvia Fuchs, Yael Segal, Joseph Keshet:
CNN-Based Spoken Term Detection and Localization without Dynamic Programming. 6853-6857 - Jinmiao Huang, Waseem Gharbieh, Han Suk Shim, Eugene Kim:
Query-By-Example Keyword Spotting System Using Multi-Head Attention and Soft-triple Loss. 6858-6862 - Otavio Braga, Olivier Siohan:
A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection. 6863-6867 - Takashi Fukuda, Gakuto Kurata:
Generalized Knowledge Distillation from an Ensemble of Specialized Teachers Leveraging Unsupervised Neural Clustering. 6868-6872 - Kyu Jeong Han, Jing Pan, Venkata Krishna Naveen Tadala, Tao Ma, Dan Povey:
Multistream CNN for Robust Acoustic Modeling. 6873-6877 - Valentin Mendelev, Tina Raissi, Guglielmo Camporese, Manuel Giollo:
Improved Robustness to Disfluencies in Rnn-Transducer Based Speech Recognition. 6878-6882 - Purvi Agrawal, Sriram Ganapathy:
Representation Learning for Speech Recognition Using Feedback Based Relevance Weighting. 6883-6887 - Wei Wang, Zhikai Zhou, Yizhou Lu, Hongji Wang, Chenpeng Du, Yanmin Qian:
Towards Data Selection on TTS Data for Children's Speech Recognition. 6888-6892 - Archiki Prasad, Preethi Jyothi, Rajbabu Velmurugan:
An Investigation of End-to-End Models for Robust Speech Recognition. 6893-6897 - Wangyou Zhang, Christoph Böddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. 6898-6902 - Ilya Sklyar, Anna Piunova, Yulan Liu:
Streaming Multi-Speaker ASR with RNN-T. 6903-6907 - Jiatong Shi, Chunlei Zhang, Chao Weng, Shinji Watanabe, Meng Yu, Dong Yu:
Improving RNN Transducer with Target Speaker Extraction and Neural Uncertainty Estimation. 6908-6912 - Zhaoxu Nian, Yan-Hui Tu, Jun Du, Chin-Hui Lee:
A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition. 6913-6917 - Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie:
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods. 6918-6922 - Purva Barche, Krishna Gurugubelli, Anil Kumar Vuppala:
Comparative Study of Different Epoch Extraction Methods for Speech Associated with Voice Disorders. 6923-6927 - Tianda Li, Jia-Chen Gu, Hui Liu, Quan Liu, Zhen-Hua Ling, Zhiming Su, Xiaodan Zhu:
Have You Made a Decision? Where? A Pilot Study on Interpretability of Polarity Analysis Based on Advising Problem. 6928-6932 - Ruixiong Zhang, Haiwei Wu, Wubo Li, Dongwei Jiang, Wei Zou, Xiangang Li:
Transformer Based Unsupervised Pre-Training for Acoustic Representation Learning. 6933-6937 - Jindrich Matousek, Daniel Tihelka:
A Comparison of Convolutional Neural Networks for Glottal Closure Instant Detection from Raw Speech. 6938-6942 - Hao Huang, Kai Wang, Ying Hu, Sheng Li:
Encoder-Decoder Based Pitch Tracking and Joint Model Training for Mandarin Tone Classification. 6943-6947 - Shintaro Ando, Hiromasa Fujihara:
Construction of a Large-Scale Japanese ASR Corpus on TV Recordings. 6948-6952 - Shareef Babu Kalluri, Deepu Vijayasenan, Sriram Ganapathy, Ragesh Rajan M, Prashant Krishnan V:
NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling. 6953-6957 - Xinjian Li, David R. Mortensen, Florian Metze, Alan W. Black:
Multilingual Phonetic Dataset for Low Resource Speech Recognition. 6958-6962 - Naohiro Tawara, Atsunori Ogawa, Yuki Kitagishi, Hosana Kamiyama:
Age-VOX-Celeb: Multi-Modal Corpus for Facial and Speech Estimation. 6963-6967 - Tingwei Guo, Cheng Wen, Dongwei Jiang, Ne Luo, Ruixiong Zhang, Shuaijiang Zhao, Wubo Li, Cheng Gong, Wei Zou, Kun Han, Xiangang Li:
Didispeech: A Large Scale Mandarin Speech Corpus. 6968-6972 - Maria Joana Correia, Francisco Teixeira, Catarina Botelho, Isabel Trancoso, Bhiksha Raj:
The in-the-Wild Speech Medical Corpus. 6973-6977 - Cong-Thanh Do, Rama Doddipatla, Thomas Hain:
Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition. 6978-6982 - Hemant Kumar Kathania, Avinash Kumar, Mikko Kurimo:
Vowel Non-Vowel Based Spectral Warping and Time Scale Modification for Improvement in Children's ASR. 6983-6987 - Rohan Doshi, Youzheng Chen, Liyang Jiang, Xia Zhang, Fadi Biadsy, Bhuvana Ramabhadran, Fang Chu, Andrew Rosenberg, Pedro J. Moreno:
Extending Parrotron: An End-to-End, Speech Conversion and Speech Recognition Model for Atypical Speech. 6988-6992 - Gary Yeung, Ruchao Fan, Abeer Alwan:
Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition. 6993-6997 - Martin Karafiát, Karel Veselý, Jan Honza Cernocký, Ján Profant, Jirí Nytra, Miroslav Hlavácek, Tomás Pavlícek:
Analysis of X-Vectors for Low-Resource Speech Recognition. 6998-7002 - Liu Chen, Meysam Asgari:
Refining Automatic Speech Recognition System for Older Adults. 7003-7007 - Linghui Meng, Jin Xu, Xu Tan, Jindong Wang, Tao Qin, Bo Xu:
MixSpeech: Data Augmentation for Low-Resource Automatic Speech Recognition. 7008-7012 - Solomon Teferra Abate, Martha Yifiru Tachbelie, Tanja Schultz:
End-to-End Multilingual Automatic Speech Recognition for Less-Resourced Languages: The Case of Four Ethiopian Languages. 7013-7017 - Shannon Wotherspoon, William Hartmann, Matthew Snover, Owen Kimball:
Improved Data Selection for Domain Adaptation in ASR. 7018-7022 - Ruchao Fan, Amber Afshan, Abeer Alwan:
Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-Training and its Application to Children's ASR. 7023-7027 - Wenxin Hou, Yidong Wang, Shengzhou Gao, Takahiro Shinozaki:
Meta-Adapter: Efficient Cross-Lingual Adaptation With Meta-Learning. 7028-7032 - Abhijeet Awasthi, Aman Kansal, Sunita Sarawagi, Preethi Jyothi:
Error-Driven Fixed-Budget ASR Personalization for Accented Speakers. 7033-7037 - Max Morrison, Lucas Rencker, Zeyu Jin, Nicholas J. Bryan, Juan Pablo Cáceres, Bryan Pardo:
Context-Aware Prosody Correction for Text-Based Speech Editing. 7038-7042 - Minsu Kang, Jihyun Lee, Simin Kim, Injung Kim:
Fast DCTTS: Efficient Deep Convolutional Text-to-Speech. 7043-7047 - Ravindra Yadav, Ashish Sardana, Vinay P. Namboodiri, Rajesh M. Hegde:
Speech Prediction in Silent Videos Using Variational Autoencoders. 7048-7052 - Jennifer Williams, Yi Zhao, Erica Cooper, Junichi Yamagishi:
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm. 7053-7057 - Keisuke Matsubara, Takuma Okamoto, Ryoichi Takashima, Tetsuya Takiguchi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC. 7058-7062 - Chen Zhang, Yi Ren, Xu Tan, Jinglin Liu, Kejun Zhang, Tao Qin, Sheng Zhao, Tie-Yan Liu:
Denoispeech: Denoising Text to Speech with Frame-Level Noise Modeling. 7063-7067 - Tomoki Hayashi, Wen-Chin Huang, Kazuhiro Kobayashi, Tomoki Toda:
Non-Autoregressive Sequence-To-Sequence Voice Conversion. 7068-7072 - Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Ling Xu, Chen Shen, Zejun Ma:
PPG-Based Singing Voice Conversion with Adversarial Representation Learning. 7073-7077 - Xiaobin Zhuang, Tao Jiang, Szu-Yu Chou, Bin Wu, Peng Hu, Simon Lui:
Litesing: Towards Fast, Lightweight and Expressive Singing Voice Synthesis. 7078-7082 - Jordi Bonada, Merlijn Blaauw:
Semi-Supervised Learning for Singing Synthesis Timbre. 7083-7087 - Lars Thieling, Daniel Wilhelm, Peter Jax:
Recurrent Phase Reconstruction Using Estimated Phase Derivatives from Deep Neural Networks. 7088-7092 - Slava Shechtman, David Haws, Raul Fernandez:
Stable Checkpoint Selection and Evaluation in Sequence to Sequence Speech Synthesis. 7093-7097 - Kai Wang, Bengbeng He, Wei-Ping Zhu:
TSTNN: Two-Stage Transformer Based Neural Network for Speech Enhancement in the Time Domain. 7098-7102 - Huy Phan, Huy Le Nguyen, Oliver Y. Chén, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins:
Self-Attention Generative Adversarial Network for Speech Enhancement. 7103-7107 - Wei Xue, Gang Quan, Chao Zhang, Guohong Ding, Xiaodong He, Bowen Zhou:
Neural Kalman Filtering for Speech Enhancement. 7108-7112 - Zhihui Zhang, Xiaoqi Li, Yaxing Li, Yuanjie Dong, Dan Wang, Shengwu Xiong:
Neural Noise Embedding for End-To-End Speech Enhancement with Conditional Layer Normalization. 7113-7117 - Saurabh Kataria, Jesús Villalba, Najim Dehak:
Perceptual Loss Based Speech Denoising with an Ensemble of Audio Pattern Recognition and Self-Supervised Models. 7118-7122 - Khandokar Md. Nayem, Donald S. Williamson:
Towards An ASR Approach Using Acoustic and Language Models for Speech Enhancement. 7123-7127 - Nathan Howard, Alex Park, Turaj Zakizadeh Shabestary, Alexander Gruenstein, Rohit Prabhavalkar:
A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer and Large Scale Synthetic Data. 7128-7132 - Jean-Marc Valin, Srikanth V. Tenneti, Karim Helwani, Umut Isik, Arvindh Krishnaswamy:
Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On Percepnet. 7133-7137 - Nils L. Westhausen, Bernd T. Meyer:
Acoustic Echo Cancellation with the Dual-Signal Transformation LSTM Network. 7138-7142 - Adam Polyak, Lior Wolf, Yossi Adi, Ori Kabeli, Yaniv Taigman:
High Fidelity Speech Regeneration with Application to Speech Enhancement. 7143-7147 - Ju Lin, Yun Wang, Kaustubh Kalgaonkar, Gil Keren, Didi Zhang, Christian Fuegen:
A Time-Domain Convolutional Recurrent Network for Packet Loss Concealment. 7148-7152 - Arun Asokan Nair, Kazuhito Koishida:
Cascaded Time + Time-Frequency Unet For Speech Enhancement: Jointly Addressing Clipping, Codec Distortions, And Gaps. 7153-7157 - Jeremy Heng Meng Wong, Xiong Xiao, Yifan Gong:
Hidden Markov Model Diarisation with Speaker Location Information. 7158-7162 - Zeqian Li, Jacob Whitehill:
Compositional Embedding Models for Speaker Identification and Diarization with Simultaneous Speech From 2+ Speakers. 7163-7167 - Guangzhi Sun, D. Liu, Chao Zhang, Philip C. Woodland:
Content-Aware Speaker Embeddings for Speaker Diarisation. 7168-7172 - Tae Jin Park, Manoj Kumar, Shrikanth Narayanan:
Multi-Scale Speaker Diarization with Neural Affinity Score Fusion. 7173-7177 - Junzhe Zhu, Mark Hasegawa-Johnson, Nancy L. McElwain:
A Comparison Study on Infant-Parent Voice Diarization. 7178-7182 - Soumi Maiti, Hakan Erdogan, Kevin W. Wilson, Scott Wisdom, Shinji Watanabe, John R. Hershey:
End-To-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings. 7183-7187 - Shota Horiguchi, Paola García, Yusuke Fujita, Shinji Watanabe, Kenji Nagamatsu:
End-To-End Speaker Diarization as Post-Processing. 7188-7192 - Eunjung Han, Chul Lee, Andreas Stolcke:
BW-EDA-EEND: streaming END-TO-END Neural Speaker Diarization for a Variable Number of Speakers. 7193-7197 - Keisuke Kinoshita, Marc Delcroix, Naohiro Tawara:
Integrating End-to-End Neural and Clustering-Based Diarization: Getting the Best of Both Worlds. 7198-7202 - Amirhossein Hajavi, Ali Etemad:
Siamese Capsule Network for End-to-End Speaker Recognition in the Wild. 7203-7207 - Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan:
A Real-Time Speaker Diarization System Based on Spatial Spectrum. 7208-7212 - Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Unsupervised Neural Adaptation Model Based on Optimal Transport for Spoken Language Identification. 7213-7217 - Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann:
Joint ASR and Language Identification Using RNN-T: An Efficient Approach to Dynamic Language Switching. 7218-7222 - Muralikrishna H, Shantanu Kapoor, Dileep Aroor Dinesh, Padmanabhan Rajan:
Spoken Language Identification in Unseen Target Domain Using Within-Sample Similarity Loss. 7223-7227 - Vishwas M. Shetty, Srinivasan Umesh:
Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages. 7228-7232 - Xinjian Li, Juncheng Li, Jiali Yao, Alan W. Black, Florian Metze:
Phone Distribution Estimation for Low Resource Languages. 7233-7237 - Siyuan Feng, Piotr Zelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
How Phonotactics Affect Multilingual and Zero-Shot ASR Performance. 7238-7242 - Bo Wang, Yue Wu, Nemanja Vaci, Maria Liakata, Terry J. Lyons, Kate E. A. Saunders:
Modelling Paralinguistic Properties in Conversational Speech to Detect Bipolar Disorder and Borderline Personality Disorder. 7243-7247 - Vikram C. Mathad, Nancy Scherer, Kathy Chapman, Julie Liss, Visar Berisha:
An Attention Model for Hypernasality Prediction in Children with Cleft Palate. 7248-7252 - Qiang Gao, Haiwei Wu, Yanqing Sun, Yitao Duan:
An End-to-End Speech Accent Recognition Method Based on Hybrid CTC/Attention Transformer ASR. 7253-7257 - Yilin Pan, Venkata Srikanth Nallanthighal, Daniel Blackburn, Heidi Christensen, Aki Härmä:
Multi-Task Estimation of Age and Cognitive Decline from Speech. 7258-7262 - Wei-Cheng Lin, Kusha Sridhar, Carlos Busso:
Deepemocluster: a Semi-Supervised Framework for Latent Cluster Representation of Speech Emotions. 7263-7267 - Andreas Triantafyllopoulos, Björn W. Schuller:
The Role of Task and Acoustic Similarity in Audio Transfer Learning: Insights from the Speech Emotion Recognition Case. 7268-7272 - Amir Harati, Elizabeth Shriberg, Tomasz Rutowski, Piotr Chlebek, Yang Lu, Ricardo Oliveira:
Speech-Based Depression Prediction Using Encoder-Weight-Only Transfer Learning and a Large Corpus. 7273-7277 - Sri Harsha Dumpala, Sheri Rempel, Katerina Dikaios, Mehri Sajjadian, Rudolf Uher, Sageev Oore:
Estimating Severity of Depression From Acoustic Features and Embeddings of Natural Speech. 7278-7282 - Brian Stasak, Zhaocheng Huang, Dale Joachim, Julien Epps:
Automatic Elicitation Compliance for Short-Duration Speech Based Depression Detection. 7283-7287 - José Vicente Egas López, Gábor Gosztolya:
Deep Neural Network Embeddings for the Estimation of the Degree of Sleepiness. 7288-7292 - Jiahong Yuan, Xingyu Cai, Kenneth Church:
Pause-Encoded Language Models for Recognition of Alzheimer's Disease and Emotion. 7293-7297 - Juan Camilo Vásquez-Correa, Tomás Arias-Vergara, Philipp Klumpp, Paula Andrea Pérez-Toro, Juan Rafael Orozco-Arroyave, Elmar Nöth:
End-2-End Modeling of Speech and Gait from Patients with Parkinson's Disease: Comparison Between High Quality Vs. Smartphone Data. 7298-7302 - Lidan Wu, Daoming Zong, Shiliang Sun, Jing Zhao:
A Sequential Contrastive Learning Framework for Robust Dysarthric Speech Recognition. 7303-7307 - Ina Kodrasi, Michaela Pernon, Marina Laganaro, Hervé Bourlard:
Automatic And Perceptual Discrimination Between Dysarthria, Apraxia of Speech, and Neurotypical Speech. 7308-7312 - Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Nalini Atchayarcmf, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh:
Effect of Noise and Model Complexity on Detection of Amyotrophic Lateral Sclerosis and Parkinson's Disease Using Pitch and MFCC. 7313-7317 - Chaoyue Ding, Shiliang Sun, Jing Zhao:
Multi-Task Transformer with Input Feature Reconstruction for Dysarthric Speech Recognition. 7318-7322 - Zhaoci Liu, Zhiqiang Guo, Zhenhua Ling, Yunxia Li:
Detecting Alzheimer's Disease from Speech Using Neural Networks with Bottleneck Features and Data Augmentation. 7323-7327 - Parvaneh Janbakhshi, Ina Kodrasi, Hervé Bourlard:
Automatic Dysarthric Speech Detection Exploiting Pairwise Distance-Based Convolutional Neural Networks. 7328-7332 - Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. 7333-7337 - Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong:
Internal Language Model Training for Domain-Adaptive End-To-End Speech Recognition. 7338-7342 - Wen-Chin Huang, Chia-Hua Wu, Shang-Bao Luo, Kuan-Yu Chen, Hsin-Min Wang, Tomoki Toda:
Speech Recognition by Simply Fine-Tuning Bert. 7343-7347 - Aditya Gourav, Linda Liu, Ankur Gandhe, Yile Gu, Guitang Lan, Xiangyang Huang, Shashank Kalmane, Gautam Tiwari, Denis Filimonov, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko:
Personalization Strategies for End-to-End Speech Recognition Systems. 7348-7352 - Christopher Li, Pat Rondon, Diamantino Caseiro, Leonid Velikovich, Xavier Velez, Petar S. Aleksic:
Improving Entity Recall in Automatic Speech Recognition with Neural Embeddings. 7353-7357 - Taewoo Lee, Min-Joong Lee, Tae Gyoon Kang, Seokyeoung Jung, Minseok Kwon, Yeona Hong, Jungin Lee, Kyoung-Gu Woo, Ho-Gyeong Kim, Jiseung Jeong, Jihyun Lee, Hosik Lee, Young Sang Choi:
Adaptable Multi-Domain Language Model for Transformer ASR. 7358-7362 - Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Transformer Language Models with LSTM-Based Cross-Utterance Information Representation. 7363-7367 - Jilin Wang, Jiaji Huang, Kenneth Ward Church:
Large Margin Training Improves Language Models for ASR. 7368-7372 - Linda Liu, Yile Gu, Aditya Gourav, Ankur Gandhe, Shashank Kalmane, Denis Filimonov, Ariya Rastrow, Ivan Bulyko:
Domain-Aware Neural Language Models for Speech Recognition. 7373-7377 - Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Transformer Language Models for Speech Recognition. 7378-7382 - Junhao Xu, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng:
Mixed Precision Quantization of Transformer Language Models for Speech Recognition. 7383-7387 - Zhe Liu, Fuchun Peng:
Federated Marginal Personalization for ASR Rescoring. 7388-7392 - Sixing Wu, Dawei Zhang, Ying Li, Zhonghai Wu:
Multi Path Training Framework for Data-Driven Open-Domain Conversation System. 7393-7397 - Svetlana Stoyanchev, Simon Keizer, Rama Doddipatla:
Action State Update Approach to Dialogue Management. 7398-7402 - Yuhan Liu, Jiachen Du, Xiang Li, Ruifeng Xu:
Generating Empathetic Responses by Injecting Anticipated Emotion. 7403-7407 - Amalia Istiqlali Adiba, Takeshi Homma, Toshinori Miyoshi:
Towards Immediate Backchannel Generation Using Attention-Based Early Prediction Model. 7408-7412 - Sashank Gondala, Lyan Verwimp, Ernest Pusateri, Manos Tsagkias, Christophe Van Gysel:
Error-Driven Pruning of Language Models for Virtual Assistants. 7413-7417 - Jun Bai, Wenge Rong, Feiyu Xia, Yanmeng Wang, Yuanxin Ouyang, Zhang Xiong:
Paragraph Level Multi-Perspective Context Modeling for Question Generation. 7418-7422 - Yanmeng Wang, Ye Wang, Xingyu Lou, Wenge Rong, Zhenghong Hao, Shaojun Wang:
Improving Dialogue Response Generation Via Knowledge Graph Filter. 7423-7427 - Shijie Zhou, Wenge Rong, Jianfei Zhang, Yanmeng Wang, Libin Shi, Zhang Xiong:
Topic-Aware Dialogue Generation with Two-Hop Based Graph Attention. 7428-7432 - Yawei Kong, Lu Zhang, Can Ma, Cong Cao:
HSAN: A Hierarchical Self-Attention Network for Multi-Turn Dialogue Generation. 7433-7437 - Lei Shen, Haolan Zhan, Xin Shen, Yang Feng:
Learning to Select Context in a Hierarchical and Global Perspective for Open-Domain Dialogue Generation. 7438-7442 - Yu Cao, Liang Ding, Zhiliang Tian, Meng Fang:
Towards Efficiently Diversifying Dialogue Generation Via Embedding Augmentation. 7443-7447 - Valentin Pelloin, Nathalie Camelin, Antoine Laurent, Renato De Mori, Antoine Caubrière, Yannick Estève, Sylvain Meignier:
End2End Acoustic to Semantic Transduction. 7448-7452 - Akshat Gupta, Xinjian Li, Sai Krishna Rallabandi, Alan W. Black:
Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages. 7453-7457 - Yao Qian, Ximo Bian, Yu Shi, Naoyuki Kanda, Leo Shen, Zhen Xiao, Michael Zeng:
Speech-Language Pre-Training for End-to-End Spoken Language Understanding. 7458-7462 - Seongbin Kim, Gyuwan Kim, Seongjin Shin, Sangmin Lee:
Two-Stage Textual Knowledge Distillation for End-to-End Spoken Language Understanding. 7463-7467 - Cheng-I Lai, Yung-Sung Chuang, Hung-Yi Lee, Shang-Wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. 7468-7472 - Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke:
DO as I Mean, Not as I Say: Sequence Loss Training for Spoken Language Understanding. 7473-7477 - Minjeong Kim, Gyuwan Kim, Sang-Woo Lee, Jung-Woo Ha:
St-Bert: Cross-Modal Language Model Pre-Training for End-to-End Spoken Language Understanding. 7478-7482 - Edmilson da Silva Morais, Hong-Kwang Jeff Kuo, Samuel Thomas, Zoltán Tüske, Brian Kingsbury:
End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features. 7483-7487 - Zhiqi Huang, Fenglin Liu, Peilin Zhou, Yuexian Zou:
Sentiment Injected Iteratively Co-Interactive Network for Spoken Language Understanding. 7488-7492 - Samuel Thomas, Hong-Kwang Jeff Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory:
RNN Transducer Models for Spoken Language Understanding. 7493-7497 - Bidisha Sharma, Maulik C. Madhavi, Haizhou Li:
Leveraging Acoustic and Linguistic Embeddings from Pretrained Speech and Language Models for Intent Classification. 7498-7502 - Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe:
ORTHROS: non-autoregressive end-to-end speech translation With dual-decoder. 7503-7507 - Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler:
Cascaded Models with Cyclic Feedback for Direct Speech Translation. 7508-7512 - Hari Krishna Vydana, Martin Karafiát, Katerina Zmolíková, Lukás Burget, Honza Cernocký:
Jointly Trained Transformers Models for Spoken Language Translation. 7513-7517 - Yiting Lu, Yu Wang, Mark J. F. Gales:
Efficient Use of End-to-End Data in Spoken Language Processing. 7518-7522 - Xutai Ma, Yongqiang Wang, Mohammad Javad Dousti, Philipp Koehn, Juan Miguel Pino:
Streaming Simultaneous Speech Translation with Augmented Memory Transformer. 7523-7527 - Ha Nguyen, Yannick Estève, Laurent Besacier:
An Empirical Study of End-To-End Simultaneous Speech Translation Decoding Strategies. 7528-7532 - Wenjie Qin, Xiang Li, Yuhui Sun, Deyi Xiong, Jianwei Cui, Bin Wang:
Modeling Homophone Noise for Robust Neural Machine Translation. 7533-7537 - Surafel Melaku Lakew, Marcello Federico, Yue Wang, Cuong Hoang, Yogesh Virkar, Roberto Barra-Chicote, Robert Enyedi:
Machine Translation Verbosity Control for Automatic Dubbing. 7538-7542 - Yogesh Virkar, Marcello Federico, Robert Enyedi, Roberto Barra-Chicote:
Improvements to Prosodic Alignment for Automatic Dubbing. 7543-7574 - Ping Huang, Shiliang Sun, Hao Yang:
Image-Assisted Transformer in Zero-Resource Multi-Modal Translation. 7548-7552 - Daniel Li, Te I, Naveen Arivazhagan, Colin Cherry, Dirk Padfield:
Sentence Boundary Augmentation for Neural Machine Translation Robustness. 7553-7557 - Siyou Liu:
An Empirical Study on Task-Oriented Dialogue Translation. 7558-7562 - Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
MAPGN: Masked Pointer-Generator Network for Sequence-to-Sequence Pre-Training. 7563-7567 - Wanhui Qian, Fuqing Zhu, Jinzhu Yang, Jizhong Han, Songlin Hu:
Aligning the training and evaluation of unsupervised text style Transfer. 7568-7572 - Monica Sunkara, Chaitanya Shivade, Sravan Bodapati, Katrin Kirchhoff:
Neural Inverse Text Normalization. 7573-7577 - Junwei Liao, Yu Shi, Ming Gong, Linjun Shou, Sefik Emre Eskimez, Liyang Lu, Hong Qu, Michael Zeng:
Generating Human Readable Transcript for Automatic Speech Recognition with Pre-Trained Language Model. 7578-7582 - Weiwei Jiang, Junjie Li, Minchuan Chen, Jun Ma, Shaojun Wang, Jing Xiao:
Improving Neural Text Normalization with Partial Parameter Generator and Pointer-Generator Network. 7583-7587 - Wenhao Zhu, Shuang Liu, Chaoming Liu:
Incorporating Syntactic and Phonetic Information into Multimodal Word Embeddings Using Graph Convolutional Networks. 7588-7592 - Aradhya Neeraj Mathur, Devansh Batra, Yaman Kumar Singla, Rajiv Ratn Shah, Changyou Chen, Roger Zimmermann:
LIFI: Towards Linguistically Informed Frame Interpolation. 7593-7597 - Yucheng Zhou, Wei Tao, Wenqiang Zhang:
Triple Sequence Generative Adversarial Nets for Unsupervised Image Captioning. 7598-7602 - Liming Wang, Xinsheng Wang, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval. 7603-7607 - Pingchuan Ma, Brais Martínez, Stavros Petridis, Maja Pantic:
Towards Practical Lipreading with Distilled and Efficient Models. 7608-7612 - Pingchuan Ma, Stavros Petridis, Maja Pantic:
End-To-End Audio-Visual Speech Recognition with Conformers. 7613-7617 - Xinyue Liu, Mingda Li, Luoxin Chen, Prashan Wanigasekara, Weitong Ruan, Haidar Khan, Wael Hamza, Chengwei Su:
ASR N-Best Fusion Nets. 7618-7622 - Hongzhan Lin, Yuanmeng Yan, Guang Chen:
Boosting Low-Resource Intent Detection with in-Scope Prototypical Networks. 7623-7627 - Hang Liu, Meng Chen, Youzheng Wu, Xiaodong He, Bowen Zhou:
Conversational Query Rewriting with Self-Supervised Learning. 7628-7632 - Vishal Sunder, Eric Fosler-Lussier:
Handling Class Imbalance in Low-Resource Dialogue Systems by Combining Few-Shot Classification and Interpolation. 7633-7637 - Luchen Liu, Xixun Lin, Peng Zhang, Bin Wang:
Improving Cross-Domain Slot Filling with Common Syntactic Structure. 7638-7642 - Yanfei Hui, Jianzong Wang, Ning Cheng, Fengying Yu, Tianbo Wu, Jing Xiao:
Joint Intent Detection and Slot Filling Based on Continual Learning Model. 7643-7647 - Wei Liu, Peijie Huang, Dongzhu Liang, Zihao Zhou:
Knowledge-Based Chat Detection with False Mention Discrimination. 7648-7652 - Daria Soboleva, Ondrej Skopek, Márius Sajgalík, Victor Carbune, Felix Weissenberger, Julia Proskurnia, Bogdan Prisacari, Daniel Valcarce, Justin Lu, Rohit Prabhavalkar, Balint Miklos:
Replacing Human Audio with Synthetic Audio for on-Device Unspoken Punctuation Prediction. 7653-7657 - Zhiyuan Zeng, Hong Xu, Keqing He, Yuanmeng Yan, Sihong Liu, Zijun Liu, Weiran Xu:
Adversarial Generative Distance-Based Classifier for Robust Out-of-Domain Detection. 7658-7662 - Chaojie Liang, Peijie Huang, Wenbin Lai, Ziheng Ruan:
GAN-Based Out-of-Domain Detection Using Both In-Domain and Out-of-Domain Samples. 7663-7667 - Jiahao Wang, Minqian Liu, Xiaojun Quan:
Progressive Dialogue State Tracking for Multi-Domain Dialogue Systems. 7668-7672 - Yu Wang, Yilin Shen, Hongxia Jin:
Multi-Step Spoken Language Understanding System Based on Adversarial Learning. 7673-7677 - Haozhuang Liu, Ziran Li, Dongming Sheng, Hai-Tao Zheng, Ying Shen:
Multi-Entity Collaborative Relation Extraction. 7678-7682 - Hengzhu Tang, Yanan Cao, Zhenyu Zhang, Ruipeng Jia, Fang Fang, Shi Wang:
Multi-Granularity Heterogeneous Graph for Document-Level Relation Extraction. 7683-7687 - Xiangyu Xi, Wei Ye, Tong Zhang, Quanxiu Wang, Shikun Zhang, Huixing Jiang, Wei Wu:
Improving Event Detection by Exploiting Label Hierarchy. 7688-7692 - Jian Xie, Kai Zhang, Lin Sun, Yindu Su, Chenxiang Xu:
Improving NER in Social Media via Entity Type-Compatible Unknown Word Substitution. 7693-7697 - Yutong Wang, Renze Lou, Kai Zhang, Mao Yan Chen, Yujiu Yang:
More: A Metric Learning Based Framework for Open-Domain Relation Extraction. 7698-7702 - Denys Katerenchuk, Rivka Levitan:
"You Should Probably Read This": Hedge Detection in Text. 7703-7707 - Jinfeng Li, Tianyu Du, Xiangyu Liu, Rong Zhang, Hui Xue, Shouling Ji:
Enhancing Model Robustness by Incorporating Adversarial Knowledge into Semantic Representation. 7708-7712 - Keli Xie, Siyuan Lu, Meiqi Wang, Zhongfeng Wang:
Elbert: Fast Albert with Confidence-Window Based Early Exit. 7713-7717 - Jen-Tzung Chien, Wei-Hsiang Chang:
Dualformer: A Unified Bidirectional Sequence-to-Sequence Learning. 7718-7722 - Sathish Reddy Indurthi, Mohd Abbas Zaidi, Nikhil Kumar Lakumarapu, Beomseok Lee, Hyojung Han, Seokchan Ahn, Sangha Kim, Chanwoo Kim, Inchul Hwang:
Task Aware Multi-Task Learning for Speech to Text Tasks. 7723-7727 - Hao Guo, Xiangyang Li, Lei Zhang, Jia Liu, Wei Chen:
Label-Aware Text Representation for Multi-Label Text Classification. 7728-7732 - Yuan Wu, Diana Inkpen, Ahmed El-Roby:
Mixup Regularized Adversarial Networks for Multi-Domain Text Classification. 7733-7737 - Daniel Korzekwa, Jaime Lorenzo-Trueba, Szymon Zaporowski, Shira Calamaro, Thomas Drugman, Bozena Kostek:
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling. 7738-7742 - Binghuai Lin, Liyuan Wang:
Attention-Based Multi-Encoder Automatic Pronunciation Assessment. 7743-7747 - Bin Su, Shaoguang Mao, Frank K. Soong, Yan Xia, Jonathan Tien, Zhiyong Wu:
Improving Pronunciation Assessment Via Ordinal Regression with Anchored Reference Samples. 7748-7752 - Xizi Wei, Mark J. F. Gales, Kate M. Knill:
Analysing Bias in Spoken Language Assessment Using Concept Activation Vectors. 7753-7757 - Richeng Duan, Nancy F. Chen:
Senone-Aware Adversarial Multi-Task Training for Unsupervised Child to Adult Speech Adaptation. 7758-7762 - Yeh-Sheng Lin, Shu-Chuan Tseng:
Classifying Speech Intelligibility Levels of Children in Two Continuous Speech Styles. 7763-7767 - Yasser Hifny:
Recent Advances in Arabic Syntactic Diacritics Restoration. 7768-7772 - Michael Hentschel, Emiru Tsunoo, Takao Okuda:
Making Punctuation Restoration Robust and Fast with Multi-Task Learning and Knowledge Distillation. 7773-7777 - Tien-Ching Luo, Jen-Tzung Chien:
Variational Dialogue Generation with Normalizing Flows. 7778-7782 - Hwa-Yeon Kim, Jong-Hwan Kim, Jae-Min Kim:
NN-KOG2P: A Novel Grapheme-to-Phoneme Model for Korean Language. 7783-7787 - Yonghe Wang, Feilong Bao, Hui Zhang, Guanglai Gao:
Joint Alignment Learning-Attention Based Model for Grapheme-to-Phoneme Conversion. 7788-7792 - Chenyu You, Nuo Chen, Yuexian Zou:
Knowledge Distillation for Improved Accuracy in Spoken Question Answering. 7793-7797 - Luxi Xing, Yue Hu, Jing Yu, Yuqiang Xie, Wei Peng:
Coarse-To-Careful: Seeking Semantic-Related Knowledge for Open-Domain Commonsense Question Answering. 7798-7802 - Mahdi Namazifar, Alexandros Papangelis, Gökhan Tür, Dilek Hakkani-Tür:
Language Model is all You Need: Natural Language Understanding as Question Answering. 7803-7807 - Xu Wang, Shuai Zhao, Bo Cheng, Jiale Han, Yingting Li, Hao Yang, Ivan Sekulic, Guoshun Nan:
Integrating Subgraph-Aware Relation and Direction Reasoning for Question Answering. 7808-7812 - Jui-Heng Hsu, Po-Wei Shen, Hung-Ting Su, Chen-Hsi Chang, Jia-Fong Yeh, Winston H. Hsu:
Role Aware Multi-Party Dialogue Question Answering. 7813-7817 - Wei Peng, Yue Hu, Jing Yu, Luxi Xing, Yuqiang Xie, Zihao Zhu, Yajing Sun:
MCR-NET: A Multi-Step Co-Interactive Relation Network for Unanswerable Questions on Machine Reading Comprehension. 7818-7822 - Yuejie Lei, Yuanmeng Yan, Zhiyuan Zeng, Keqing He, Ximing Zhang, Weiran Xu:
Hierarchical Speaker-Aware Sequence-to-Sequence Model for Dialogue Summarization. 7823-7827 - Kai Chen, Guanyu Fu, Qingcai Chen, Baotian Hu:
A Large-Scale Chinese Long-Text Extractive Summarization Corpus. 7828-7832 - Nuo Chen, Fenglin Liu, Chenyu You, Peilin Zhou, Yuexian Zou:
Adaptive Bi-Directional Attention: Exploring Multi-Granularity Representations for Machine Reading Comprehension. 7833-7837 - Rui Yang, Runze Wang, Zhen-Hua Ling:
Graph Attention and Interaction Network With Multi-Task Learning for Fact Verification. 7838-7842 - Boxin Li, Tingwen Liu, Bin Wang, Lihong Wang:
Enhancing Deep Paraphrase Identification via Leveraging Word Alignment Information. 7843-7847 - Yu Wang, Yilin Shen, Hongxia Jin:
An End-To-End Actor-Critic-Based Neural Coreference Resolution System. 7848-7852 - Xinmiao Zhang, Keshab K. Parhi:
Reduced-Complexity Modular Polynomial Multiplication for R-LWE Cryptosystems. 7853-7857 - Lulu Ge, Keshab K. Parhi:
Seizure Detection Using Power Spectral Density via Hyperdimensional Computing. 7858-7862 - Faraz Bhatti, Thomas Greiner:
FPGA Hardware Design for Plenoptic 3D Image Processing Algorithm Targeting a Mobile Application. 7863-7867 - Ashish Shrivastava, Alan Gatherer, Tong Sun, Sushma Wokhlu, Alex Chandra:
SLAP: a Split Latency Adaptive VLIW Pipeline Architecture Which Enables on-The-Fly Variable SIMD Vector-Length. 7868-7872 - Shreyas Chaudhari, Harideep Nair, José M. F. Moura, John Paul Shen:
Unsupervised Clustering of Time Series Signals Using Neuromorphic Energy-Efficient Temporal Neural Networks. 7873-7877 - Yu-Lin Wei, Romit Roy Choudhury:
Angle-of-Arrival (AoA) Factorization in Multipath Channels. 7878-7882 - Zhenshan Xie, Xinmiao Zhang:
Scaled Fast Nested Key Equation Solver for Generalized Integrated Interleaved BCH Decoders. 7883-7887 - Zhangjie Peng, Cunhua Pan, Zhenkun Zhang, Xianzhe Chen, Li Li, A. Lee Swindlehurst:
Joint Optimization for Full-Duplex Cellular Communications Via Intelligent Reflecting Surface. 7888-7892 - Yi-Lin Lo, Chia-Hsiang Yang:
A Color Doppler Processing Engine with an Adaptive Clutter Filter for Portable Ultrasound Imaging Devices. 7893-7897 - Chieh-Fang Teng, Andrew Kuan-Shiuan Ho, Chen-Hsi Derek Wu, Sin-Sheng Wong, An-Yeu Andy Wu:
Convolutional Neural Network-Aided Bit-Flipping for Belief Propagation Decoding of Polar Codes. 7898-7902 - Florian Lemaitre, Arthur M. Hennequin, Lionel Lacassagne:
Taming Voting Algorithms on Gpus for an Efficient Connected Component Analysis Algorithm. 7903-7907 - Gonçalo Raposo, Pedro Tomás, Nuno Roma:
Positnn: Training Deep Neural Networks with Mixed Low-Precision Posit. 7908-7912 - Zohreh Hajiakhondi-Meybodi, Mohammad Salimibeni, Arash Mohammadi, Konstantinos N. Plataniotis:
Bluetooth Low Energy and CNN-Based Angle of Arrival Localization in Presence of Rayleigh Fading. 7913-7917 - Yuqian Hu, Muhammed Zahid Ozturk, Feng Zhang, Beibei Wang, Kuo J. Ray Liu:
Robust Device-Free Proximity Detection Using Wifi. 7918-7922 - Mohammadamin Atashi, Arash Mohammadi:
Online Dynamic Window (ODW) Assisted 2-Stage LSTM Indoor Localization for Smart Phones. 7923-7927 - Sihao Zhao, Xiao-Ping Zhang, Xiaowei Cui, Mingquan Lu:
Optimal TOA Localization for Moving Sensor in Asymmetric Network. 7928-7932 - Shicheng Hu, Miao Yang, Kai Kang, Hua Qian:
Low Complexity SLM for OFDMA System with Implicit Side Information. 7933-7937 - Chi-Shiang Wang, Pei-Yun Tsai:
Reduced-Complexity Channel Estimation by Hierarchical Interpolation Exploiting Sparsity for Massive MIMO Systems with Uniform Rectangular Array. 7938-7942 - Qing Yang, Ting Zhong, Fan Zhou:
Traffic Speed Forecasting Via Spatio-Temporal Attentive Graph Isomorphism Network. 7943-7947 - Fan Zhou, Xin Jing, Liang Li, Ting Zhong:
Inferring High-Resolutional Urban Flow With Internet Of Mobile Things. 7948-7952 - Liam M. Cronin, Soheil Sadeghi Eshkevari, Debarshi Sen, Shamim N. Pakzad:
Transfer Learning for Input Estimation of Vehicle Systems. 7953-7957 - Yunlu Wang, Cheng Yang, Menghan Hu, Jian Zhang, Qingli Li, Guangtao Zhai, Xiao-Ping Zhang:
Identification of Deep Breath While Moving Forward Based on Multiple Body Regions and Graph Signal Analysis. 7958-7962 - Su Pang, Hayder Radha:
Multi-Object Tracking Using Poisson Multi-Bernoulli Mixture Filtering For Autonomous Vehicles. 7963-7967 - Chengtao Xu, Fengyu He, Bowen Chen, Yushan Jiang, Houbing Song:
Adaptive RF Fingerprint Decomposition in Micro UAV Detection based on Machine Learning. 7968-7972 - Ruizhe Shen, Qi Zhan, Yu Wang, Huimin Ma:
Depression Detection by Analysing Eye Movements on Emotional Images. 7973-7977 - Guixin Huang, Sheng Huang, Luwen Huangfu, Dan Yang:
Weakly Supervised Patch Label Inference Network with Image Pyramid for Pavement Diseases Recognition in the Wild. 7978-7982 - Xiaoyi Shen, Dong-Yuan Shi, Woon-Seng Gan:
A Wireless Reference Active Noise Control Headphone Using Coherence Based Selection Technique. 7983-7987 - Mingfeng Hao, Mutallip Mamut, Nurbiya Yadikar, Alimjan Aysa, Kurban Ubul:
How to Use Time Information Effectively? Combining with Time Shift Module for Lipreading. 7988-7992 - Andrew Werchniak, Roberto Barra-Chicote, Yuriy Mishchenko, Jasha Droppo, Jeff Condal, Peng Liu, Anish Shah:
Exploring the application of synthetic audio in training keyword spotters. 7993-7996 - Siyang Yuan, Saurabh Gupta, Xing Fan, Derek Liu, Yang Liu, Chenlei Guo:
Graph Enhanced Query Rewriting for Spoken Language Understanding System. 7997-8001 - Madhurananda Pahar, Igor D. S. Miranda, Andreas H. Diacon, Thomas Niesler:
Deep Neural Network Based Cough Detection Using Bed-Mounted Accelerometer Measurements. 8002-8006 - Fengyu Wang, Xiaolu Zeng, Chenshu Wu, Beibei Wang, K. J. Ray Liu:
Radio Frequency Based Heart Rate Variability Monitoring. 8007-8011 - Diaa Badawi, Agamyrat Agambayev, Sule Ozev, A. Enis Çetin:
Discrete Cosine Transform Based Causal Convolutional Neural Network for Drift Compensation in Chemical Sensors. 8012-8016 - Sai Deepika Regani, Beibei Wang, K. J. Ray Liu:
Wifi-Based Device-Free Gesture Recognition Through-the-Wall. 8017-8021 - Muhammed Zahid Ozturk, Chenshu Wu, Beibei Wang, K. J. Ray Liu:
Sound Recovery From Radio Signals. 8022-8026 - Takaya Kawakatsu, Kenro Aihara, Atsuhiro Takasu, Jun Adachi, Haoqi Wang, Tomonori Nagayama:
Fully-Neural Approach to Vehicle Weighing and Strain Prediction on Bridges Using Wireless Accelerometers. 8027-8031 - Masahito Togami:
End To End Learning For Convolutive Multi-Channel Wiener Filtering. 8032-8036 - Mohammad Salimibeni, Parvin Malekzadeh, Arash Mohammadi, Petros Spachos, Konstantinos N. Plataniotis:
Makf-Sr: Multi-Agent Adaptive Kalman Filtering-Based Successor Representations. 8037-8041 - Dae Yon Hwang, Bilal Taha, Dimitrios Hatzinakos:
Variation-Stable Fusion for PPG-Based Biometric System. 8042-8046 - Subhankar Chattoraj, Sawon Pratiher, Souvik Pratiher, Hubert Konik:
Improving Stability of Adversarial Li-ion Cell Usage Data Generation using Generative Latent Space Modelling. 8047-8051 - Sungho Shin, Yoonho Boo, Wonyong Sung:
SQWA: Stochastic Quantized Weight Averaging For Improving The Generalization Capability Of Low-Precision Deep Neural Networks. 8052-8056 - Kavya Gupta, Béatrice Pesquet-Popescu, Fateh Kaakai, Jean-Christophe Pesquet:
A Quantitative Analysis Of The Robustness Of Neural Networks For Tabular Data. 8057-8061 - Hongliang Zhang, Lingyang Song, Zhu Han, H. Vincent Poor:
Spatial Equalization Before Reception: Reconfigurable Intelligent Surfaces for Multi-Path Mitigation. 8062-8066 - Jiang Liu, Xuewen Qian, Marco Di Renzo:
Interference Analysis in Reconfigurable Intelligent Surface-Assisted Multiple-Input Multiple-Output Systems. 8067-8071 - Shuai Nie, Ian F. Akyildiz:
Codebook Design for Dual-Polarized Ultra-Massive Mimo Communications at Millimeter Wave and Terahertz Bands. 8072-8076 - John A. Hodge, Kumar Vijay Mishra, Brian M. Sadler, Amir I. Zaghloul:
Performance Analysis of Spatial and Frequency Domain Index-Modulated Reconfigurable Intelligent Metasurfaces. 8077-8081 - Minchae Jung, Walid Saad:
Meta-Learning for 6G Communication Networks with Reconfigurable Intelligent Surfaces. 8082-8086 - Pingfan Song, Herman Verinaz-Jadan, Carmel L. Howe, Peter Quicke, Amanda J. Foust, Pier Luigi Dragotti:
Model-Inspired Deep Learning for Light-Field Microscopy with Application to Neuron Localization. 8087-8091 - Siheng Chen, Yonina C. Eldar:
Time-Varying Graph Signal Inpainting Via Unrolling Networks. 8092-8097 - Wei Chen, David Wipf, Miguel Rodrigues:
Deep Learning for Linear Inverse Problems Using the Plug-and-Play Priors Framework. 8098-8102 - Zhaodong Sun, Fabian Latorre, Thomas Sanchez, Volkan Cevher:
A Plug-and-Play Deep Image Prior. 8103-8107 - Subrata Sarkar, Rizwan Ahmad, Philip Schniter:
MRI Image Recovery using Damped Denoising Vector AMP. 8108-8112 - Marija Vella, João F. C. Mota:
Overcoming Measurement Inconsistency In Deep Learning For Linear Inverse Problems: Applications In Medical Imaging. 8113-8117 - Wei Cui, Wei Yu:
Scalable Reinforcement Learning For Routing In Ad-Hoc Networks Based On Physical-Layer Attributes. 8118-8122 - Mohamed Salah Ibrahim, Nicholas D. Sidiropoulos:
Blind Carbon Copy on Dirty Paper: Seamless Spectrum Underlay via Canonical Correlation Analysis. 8123-8127 - Shiyang Leng, Aylin Yener:
An Actor-Critic Reinforcement Learning Approach to Minimum age of Information Scheduling in Energy Harvesting Networks. 8128-8132 - B. R. Manoj, Guoda Tian, Sara Gunnarsson, Fredrik Tufvesson, Erik G. Larsson:
Moving Object Classification with a Sub-6 GHz Massive MIMO Array Using Real Data. 8133-8137 - Ryan M. Dreifuerst, Samuel Daulton, Yuchen Qian, Paul Parayil Varkey, Maximilian Balandat, Sanjay Kasturia, Anoop Tomar, Ali Yazdan, Vish Ponnampalam, Robert W. Heath Jr.:
Optimizing Coverage and Capacity in Cellular Networks using Machine Learning. 8138-8142 - Zhiyang Wang, Mark Eisen, Alejandro Ribeiro:
Unsupervised Learning for Asynchronous Resource Allocation In Ad-Hoc Wireless Networks. 8143-8147 - Anoosheh Heidarzadeh, Krishna Narayanan:
Two-Stage Adaptive Pooling with RT-QPCR for Covid-19 Screening. 8148-8152 - Daniel Yaron, Daphna Keidar, Elisha Goldstein, Yair Shachar, Ayelet Blass, Oz Frank, Nir Schipper, Nogah Shabshin, Ahuva Grubstein, Dror Suhami, Naama R. Bogot, Chedva S. Weiss, Eyal Sela, Amiel A. Dror, Mordehay Vaturi, Federico Mento, Elena Torri, Riccardo Inchingolo, Andrea Smargiassi, Gino Soldati, Tiziano Perrone, Libertario Demi, Meirav Galun, Shai Bagon, Yishai M. Elyada, Yonina C. Eldar:
Point of Care Image Analysis for COVID-19. 8153-8157 - Pushpendra Singh, Amit Singhal, Binish Fatimah, Anubha Gupta:
An Improved Data Driven Dynamic SIRD Model for Predictive Monitoring of COVID-19. 8158-8162 - Anirudh Sridhar, Osman Yagan, Rashad Eletreby, Simon A. Levin, Joshua B. Plotkin, H. Vincent Poor:
Leveraging A Multiple-Strain Model with Mutations in Analyzing the Spread of Covid-19. 8163-8167 - Ritesh Goenka, Shu-Jie Cao, Chau-Wai Wong, Ajit Rajwade, Dror Baron:
Contact Tracing Enhances the Efficiency of Covid-19 Group Testing. 8168-8172 - Anuj S. Vora, Ankur A. Kulkarni:
Optimal Questionnaires for Screening of Strategic Agents. 8173-8177 - Yanhao Zhang, Jianmin Wu, Xiong Xiong, Dangwei Li, Chenwei Xie, Yun Zheng, Pan Pan, Yinghui Xu:
Exploring Visual-Audio Composition Alignment Network for Quality Fashion Retrieval in Video. 8178-8182 - Liejun Wang, Haitao Yu:
A Secure Searchable Image Retrieval Scheme with Correct Retrieval Identity. 8183-8187 - Dechuan Teng, Libo Qin, Wanxiang Che, Sendong Zhao, Ting Liu:
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding. 8188-8192 - Libo Qin, Tailu Liu, Wanxiang Che, Bingbing Kang, Sendong Zhao, Ting Liu:
A Co-Interactive Transformer for Joint Slot Filling and Intent Detection. 8193-8197 - Yatian Wang, Xiaolin Song, Yezhen Wang, Pengfei Xu, Runbo Hu, Hua Chai:
Dual Metric Discriminator for Open Set Video Domain Adaptation. 8198-8202 - Tian Li, Xiang Chen, Shanghang Zhang, Zhen Dong, Kurt Keutzer:
Cross-Domain Sentiment Classification with Contrastive Learning and Mutual Information Maximization. 8203-8207 - Chenwen Liu, Shengheng Liu, Zihuan Mao, Yongming Huang, Haiming Wang:
Low-Complexity Parameter Learning for OTFS Modulation Based Automotive Radar. 8208-8212 - Ahmet M. Elbir, Sinem Coleri, Kumar Vijay Mishra:
Federated Dropout Learning for Hybrid Beamforming with Spatial Path Index Modulation in Multi-User Mmwave-Mimo Systems. 8213-8217 - Daniel M. Wong, Batu K. Chalise, Justin G. Metcalf, Moeness G. Amin:
Information Decoding and SDR Implementation of DFRC Systems without Training Signals. 8218-8222 - Siyu Zhu, Feng Xi, Shengyao Chen, Arye Nehorai:
A Low-Complexity MIMO Dual Function Radar Communication System via One-Bit Sampling. 8223-8227 - Zhaoyi Xu, Fan Liu, Konstantinos I. Diamantaras, Christos Masouros, Athina P. Petropulu:
Learning to Select for Mimo Radar Based on Hybrid Analog-Digital Beamforming. 8228-8232 - Mohammad Mahbubur Rahman, Emre Kurtoglu, Robiulhossain Mdrafi, Ali Cafer Gürbüz, Evie Malaia, Chris S. Crawford, Darrin J. Griffin, Sevgi Zubeyde Gurbuz:
Word-Level ASL Recognition and Trigger Sign Detection with RF Sensors. 8233-8237 - Ziyang Cheng, Jinyang He, Shengnan Shi, Zishu He, Bin Liao:
Hybrid Beamforming for Wideband OFDM Dual Function Radar Communications. 8238-8242 - Dingyou Ma, Nir Shlezinger, Tianyao Huang, Yimin Liu, Yonina C. Eldar:
Bit Constrained Communication Receivers In Joint Radar Communications Systems. 8243-8247 - Musa Furkan Keskin, Henk Wymeersch, Visa Koivunen:
ICI-Aware Parameter Estimation for Mimo-Ofdm Radar via Apes Spatial Filtering. 8248-8252 - Xiangrong Wang, Jing Xu, Aboulnasr Hassanien, Elias Aboutanios:
Joint Communications with FH-MIMO Radar Systems: An Extended Signaling Strategy. 8253-8257 - Jaakko Marin, Micael Bernhardt, Taneli Riihonen:
Full-Duplex Multifunction Transceiver with Joint Constant Envelope Transmission and Wideband Reception. 8258-8262 - Yongzhe Li, Xinyu Wu, Ran Tao:
Waveform Design for the Joint MIMO Radar and Communications with Low Integrated Sidelobe Levels and Accurate Information Embedding. 8263-8267 - Ken R. Duffy:
Ordered Reliability Bits Guessing Random Additive Noise Decoding. 8268-8272 - Andreas Buchberger, Christian Häger, Henry D. Pfister, Laurent Schmalen, Alexandre Graell i Amat:
Learned Decimation for Neural Belief Propagation Decoders : Invited Paper. 8273-8277 - Kira Kraft, Norbert Wehn:
ADMM-Based ML Decoding: from Theory to Practice. 8278-8282 - Thibaud Tonnellier, Marzieh Hashemipour, Nghia Doan, Warren J. Gross, Alexios Balatsoukas-Stimming:
Towards Practical Near-Maximum-Likelihood Decoding of Error-Correcting Codes: An Overview. 8283-8287 - Syed Mohsin Abbas, Thibaud Tonnellier, Furkan Ercan, Marwan Jalaleddine, Warren J. Gross:
High-Throughput VLSI Architecture for Soft-Decision Decoding with ORBGRAND. 8288-8292 - Marzieh Hashemipour-Nazari, Kees Goossens, Alexios Balatsoukas-Stimming:
Hardware Implementation of Iterative Projection-Aggregation Decoding of Reed-Muller Codes. 8293-8297 - Yuheng Wang, Haipeng Liu, Kening Cui, Anfu Zhou, Wensheng Li, Huadong Ma:
m-Activity: Accurate and Real-Time Human Activity Recognition Via Millimeter Wave Radar. 8298-8302 - Dongheng Zhang, Xiong Li, Yan Chen:
Pushing the Limit of Phase Offset for Contactless Sensing Using Commodity Wifi. 8303-8307 - Kohei Yamamoto, Tomoaki Ohtsuki:
Noncontact Heartbeat Detection by Viterbi Algorithm with Fusion of Beat-Beat Interval and Deep Learning-Driven Branch Metrics. 8308-8312 - Siyao Cheng, Jialiang Yan, Jianzhong Li, Jie Liu:
Typingwristband: A Human Slight Motion Sensing System Based on Vibration Detection. 8313-8317 - Ossi Kaltiokallio, Hüseyin Yigitler:
Movement Detection Using A Reciprocal Received Signal Strength Model. 8318-8322 - Xuyu Wang, Mohini Patil, Chao Yang, Shiwen Mao, Palak Anilkumar Patel:
Deep Convolutional Gaussian Processes for Mmwave Outdoor Localization. 8323-8327 - Jing Han, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Cecilia Mascolo:
Exploring Automatic COVID-19 Diagnosis via Voice and Symptoms from Crowdsourced Data. 8328-8332 - Daniyal Liaqat, Salaar Liaqat, Jun Lin Chen, Tina Sedaghat, Moshe Gabel, Frank Rudzicz, Eyal de Lara:
Coughwatch: Real-World Cough Detection using Smartwatches. 8333-8337 - Paula Andrea Pérez-Toro, Juan Camilo Vásquez-Correa, Tomás Arias-Vergara, Philipp Klumpp, M. Sierra-Castrillón, M. E. Roldán-López, David Aguillón, Liliana Hincapié-Henao, Carlos Andrés Tobón-Quintero, Tobias Bocklet, Maria Schuster, Juan Rafael Orozco-Arroyave, Elmar Nöth:
Acoustic and Linguistic Analyses to Assess Early-Onset and Genetic Alzheimer's Disease. 8338-8342 - Nengheng Zheng, Yupeng Shi, Yuyong Kang, Qinglin Meng:
A Noise-Robust Signal Processing Strategy for Cochlear Implants Using Neural Networks. 8343-8347 - Amr Gaballah, Abhishek Tiwari, Shrikanth Narayanan, Tiago H. Falk:
Context-Aware Speech Stress Detection in Hospital Workers Using Bi-LSTM Classifiers. 8348-8352 - Shengchen Li, Ke Tian, Rui Wang:
Unsupervised Heart Abnormality Detection Based on Phonocardiogram Analysis with Beta Variational Auto-Encoders. 8353-8357 - Ke Tan, DeLiang Wang:
Compressing Deep Neural Networks for Efficient Speech Enhancement. 8358-8362 - Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi:
Improved Mask-CTC for Non-Autoregressive End-to-End ASR. 8363-8367 - Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Michael L. Seltzer, Vikas Chandra:
Memory-Efficient Speech Recognition on Smart Devices. 8368-8372 - Farzaneh S. Fard, Vikrant Singh Tomar:
Expediting discovery in Neural Architecture Search by Combining Learning with Planning. 8373-8377 - Sangeeta Srivastava, Dhrubojyoti Roy, Mark Cartwright, Juan Pablo Bello, Anish Arora:
Specialized Embedding Approximation for Edge Intelligence: A Case Study in Urban Sound Classification. 8378-8382 - Song Li, Beibei Ouyang, Lin Li, Qingyang Hong:
Light-TTS: Lightweight Multi-Speaker Multi-Lingual Text-to-Speech. 8383-8387 - Yutao Chen, Ronghao Lin, Jian Li:
Efficient Long Periodic Binary Sequence Designs for Automotive Radar. 8388-8392 - Fan Liu, Christos Masouros:
Joint Localization and Predictive Beamforming in Vehicular Networks: Power Allocation Beyond Water-Filling. 8393-8397 - Yuwei Cheng, Jingran Su, Hongyu Chen, Yimin Liu:
A New Automotive Radar 4D Point Clouds Detector by Using Deep Learning. 8398-8402 - Sayed Hossein Dokhanchi, R. Bhavani Shankar Mysore, Kumar Vijay Mishra, Björn E. Ottersten:
Enhanced Automotive Target Detection through Radar and Communications Sensor Fusion. 8403-8407 - Gang Yao, Pu Wang, Karl Berntorp, Hassan Mansour, Petros Boufounos, Philip V. Orlik:
Extended Object Tracking With Automotive Radar Using B-Spline Chained Ellipses Model. 8408-8412 - Shunqiao Sun, Yimin D. Zhang:
Four-Dimensional High-Resolution Automotive Radar Imaging Exploiting Joint Sparse-Frequency and Sparse-Array Design. 8413-8417 - Shrishti Saha Shetu, Soumitro Chakrabarty, Emanuël Anco Peter Habets:
An Empirical Study of Visual Features for DNN Based Audio-Visual Speech Enhancement in Multi-Talker Environments. 8418-8422 - Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz:
On The Role of Visual Cues in Audiovisual Speech Enhancement. 8423-8427 - Christoph Böddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Naoyuki Kamo, Yanmin Qian, Reinhold Haeb-Umbach:
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. 8428-8432 - Aswin Shanmugam Subramanian, Chao Weng, Shinji Watanabe, Meng Yu, Yong Xu, Shi-Xiong Zhang, Dong Yu:
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization. 8433-8437 - Jonah Casebeer, Jamshed Kaikaus, Paris Smaragdis:
Communication-Cost Aware Microphone Selection for Neural Speech Enhancement with Ad-Hoc Microphone Arrays. 8438-8442 - Marvin Tammen, Simon Doclo:
Deep Multi-Frame MVDR Filtering for Single-Microphone Speech Enhancement. 8443-8447 - Hongwei Wang, Jilin Wang, Jun Fang, Hongbin Li:
Compressive Wideband Spectrum Sensing and Carrier Frequency Estimation with Unknown Mimo Channels. 8448-8452 - Fangzhou Wang, Hongbin Li, Braham Himed:
Joint Optimization of Spectrally Co-Existing Multi-Carrier Radar and Communication Systems in Cluttered Environments. 8453-8457 - Indu Priya Eedara, Moeness G. Amin, Giuseppe A. Fabrizio:
Target Detection in Frequency Hopping MIMO Dual-Function Radar-Communication Systems. 8458-8462 - Akshay S. Bondre, Christ D. Richmond:
Asymptotic Distribution of Generalized Likelihood Ratio Test Under Model Misspecification With Application to Cooperative Radar-Communications. 8463-8467 - Elias Aboutanios, Hamed Nosrati, Xiangrong Wang:
Online Antenna Selection for Enhanced DOA Estimation. 8468-8472 - Charles A. Mohr, Shannon D. Blunt:
Designing Random FM Radar Waveforms with Compact Spectrum. 8473-8477 - Nir Shlezinger, Erez Farhan, Hai Morgenstern, Yonina C. Eldar:
Collaborative Inference via Ensembles on the Edge. 8478-8482 - Peiyin Xing, Xiaofei Liu, Peixi Peng, Tiejun Huang, Yonghong Tian:
Allocating DNN Layers Computation Between Front-End Devices and The Cloud Server for Video Big Data Processing. 8483-8487 - Jiawei Shao, Haowei Zhang, Yuyi Mao, Jun Zhang:
Branchy-GNN: A Device-Edge Co-Inference Framework for Efficient Point Cloud Processing. 8488-8492 - Ivan V. Bajic, Weisi Lin, Yonghong Tian:
Collaborative Intelligence: Challenges and Opportunities. 8493-8497 - Mateen Ulhaq, Ivan V. Bajic:
Latent Space Motion Analysis for Collaborative Intelligence. 8498-8502 - Shurun Wang, Shiqi Wang, Wenhan Yang, Xinfeng Zhang, Shanshe Wang, Siwei Ma:
Teacher-Student Learning With Multi-Granularity Constraint Towards Compact Facial Feature Representation. 8503-8507 - Samuel Pfrommer, Alejandro Ribeiro, Fernando Gama:
Discriminability of Single-Layer Graph Neural Networks. 8508-8512 - Henry Kenlay, Dorina Thanou, Xiaowen Dong:
On The Stability of Graph Convolutional Neural Networks Under Edge Rewiring. 8513-8517 - Yimeng Min, Frederik Wenkel, Guy Wolf:
Geometric Scattering Attention Networks. 8518-8522 - Dylan Sandfelder, Priyesh Vijayan, William L. Hamilton:
Ego-GNNs: Exploiting Ego Structures in Graph Neural Networks. 8523-8527 - Lei Chen, Zhengdao Chen, Joan Bruna:
Learning the Relevant Substructures for Tasks on Graph Data. 8528-8532 - Ningyuan Teresa Huang, Soledad Villar:
A Short Tutorial on The Weisfeiler-Lehman Test And Its Variants. 8533-8537 - Xinyue Xu, Xiaolu Zheng:
Hybrid Model for Network Anomaly Detection with Gradient Boosting Decision Trees and Tabtransformer. 8538-8542 - Tzu-Hsin Yang, Yu-Tai Lin, Chao-Lun Wu, Chih-Yu Wang:
Voting-Based Ensemble Model for Network Anomaly Detection. 8543-8547 - Fengrui Liu, Xuefei Li, Wei Xiong, Haiyang Jiang, Gaogang Xie:
An Accuracy Network Anomaly Detection Method Based on Ensemble Model. 8548-8552 - Bin Li, Yijie Wang, Mingyu Liu, Kele Xu, Zhongyang Wang, Li Cheng, Yizhou Li:
Fden: Mining Effective Information of Features in Detecting Network Anomalies. 8553-8557 - Pratyush Garg, Rishabh Ranjan, Kamini Upadhyay, Monika Agrawal, Desh Deepak:
Multi-Scale Residual Network for Covid-19 Diagnosis Using Ct-Scans. 8558-8562 - Bingyang Li, Qi Zhang, Yinan Song, Zhicheng Zhao, Zhu Meng, Fei Su:
Diagnosing Covid-19 from CT Images Based on an Ensemble Learning Framework. 8563-8567 - Fares Bougourzi, Riccardo Contino, Cosimo Distante, Abdelmalik Taleb-Ahmed:
CNR-IEMN: A Deep Learning Based Approach to Recognise Covid-19 from CT-Scan. 8568-8572 - Shuohan Xue, Charith Abhayaratne:
Covid-19 Diagnostic Using 3d Deep Transfer Learning for Classification of Volumetric Computerised Tomography Chest Scans. 8573-8577 - Zaifeng Yang, Yubo Hou, Zhenghua Chen, Le Zhang, Jie Chen:
A Multi-Stage Progressive Learning Strategy for Covid-19 Diagnosis Using Chest Computed Tomography with Imbalanced Data. 8578-8582 - Shubham Chaudhary, Sadbhawna, Vinit Jakhetiya, Badri N. Subudhi, Ujjwal Baid, Sharath Chandra Guntuku:
Detecting Covid-19 and Community Acquired Pneumonia Using Chest CT Scan Images With Deep Learning. 8583-8587 - Chung-Ming Chien, Jheng-Hao Lin, Chien-yu Huang, Po-Chun Hsu, Hung-yi Lee:
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech. 8588-8592 - Zengqiang Shang, Haozhe Zhang, Ziyi Chen, Bolin Zhou, Pengyuan Zhang:
The Thinkit System for Icassp2021 M2voc Challenge. 8593-8597 - Wei Song, Xin Yuan, Zhengchen Zhang, Chao Zhang, Youzheng Wu, Xiaodong He, Bowen Zhou:
Dian: Duration Informed Auto-Regressive Network for Voice Cloning. 8598-8602 - Tao Wang, Ruibo Fu, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Chunyu Qiang, Shiming Wang:
Prosody and Voice Factorization for Few-Shot Speaker Adaptation in the Challenge M2voc 2021. 8603-8607 - Jie Wang, Yuren You, Feng Liu, Deyi Tuo, Shiyin Kang, Zhiyong Wu, Helen Meng:
The Huya Multi-Speaker and Multi-Style Speech Synthesis System for M2voc Challenge 2020. 8608-8612 - Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu:
The Multi-Speaker Multi-Style Voice Cloning Challenge 2021. 8613-8617
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.