default search action
26th O-COCOSDA 2023: Delhi, India
- 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2023, Delhi, India, December 4-6, 2023. IEEE 2023, ISBN 979-8-3503-4402-8
- Shiva Sagar Sapkota, Aman Shakya, Basanta Joshi:
Spoken Language Identification Using Convolutional Neural Network In Nepalese Context. 1-6 - Yi Zhu, Wenhuan Lu, Yangzom, Mengfei Hu, Kuntharrgyal Khysru, Jianguo Wei:
Construction and analysis of Tibetan Khampa dialect corpus for speech synthesis. 1-6 - A. K. Punnoose:
Analysis of Non-Matching Reference Approach to Predict Speech Intelligibility. 1-4 - YingWei Tan, XueFeng Ding:
Convolutional Recurrent Neural Network with Attention Mechanism and Feature Aggregation for Voice Activity Detection. 1-5 - Amruth Ashok Gadag, Rajib Sharma, Deepak K. T.:
Beamforming using Different Window Techniques for Near-Field Speech in Anechoic and Reverberant Environment. 1-5 - Kei Furukawa, Satoshi Nakamura:
Investigation of Validity of Paradigmatic Diagnosis for Downstep in Japanese. 1-6 - Jingwen Huang, Aijun Li:
An Experimental Study on Declarative and Interrogative Sentences in Shanghai Chinese*. 1-7 - Tonmoy Rajkhowa, Amartya Chowdhury, Prashant Bannulmath, Deepak K. T., S. R. Mahadeva Prasanna:
Optimizing Direct Speech-to-Text Translation for un-orthographic low-resource tribal languages using source transliterations. 1-6 - Sakshi Mittal, Aiman Shaikh, Pooja Gambhir, Amita Dev, Poonam Bansal:
An Ensemble Approach for Speaker Recognition using Deep Neural Networks. 1-7 - YingWei Tan, XueFeng Ding:
Heterogeneous Network Framework with Attention Mechanism of Speech Enhancement for Car Intelligent Cockpit Speech Recognition. 1-5 - Pooja Gambhir, Amita Dev, Poonam Bansal:
Investigating Activation Functions to Enhance Speaker Identification with LSTM Networks. 1-7 - Shalini V. Sathe, Ratnadeep R. Deshmukh, Santosh K. Maher, Swapnil Waghmare:
An Isolated Words Balanced Corpus for Native and Non-Native Urdu Speakers in Automatic Speech Recognition. 1-5 - Moumita Pakrashi, Brigitte Bigi, Shakuntala Mahanta:
Automatic Syllabification of Bengali in SPPAS. 1-6 - Thiyam Susma Devi, Pradip K. Das:
Speech Dataset Development for a Low-Resource Tibeto-Burman Tonal Language. 1-6 - Aijun Li, Yuan Ye, Ziyu Xiong:
INTO_CASS_HEFEI: A Speech Corpus for Intonation and Prosody Study of Hefei Chinese. 1-6 - Geetika Gupta, Karuna Kadian, Raksha Jain, Vimal Dwivedi, Arun Sharma:
Real-time Hate Speech Detection in Live Streaming Platforms using Quantum Machine Learning. 1-6 - Xiaoyan Zhang, Aijun Li, Zhiqiang Li:
Duration Properties and Contrast Preservation in Taifeng Tone Sandhi. 1-6 - Zhenghai You, Mewlude Nijat, Ying Shi, Chen Chen, Wenqiang Du, Askar Hamdulla, Dong Wang:
Zero-shot Mispronunciation Detection by Knowledge-based Data Augmentation. 1-6 - Bharati D. Borade, Ratnadeep R. Deshmukh, Santosh K. Maher, Swapnil Waghmare:
Designing and Developing a Marathi Speech Database for Native and Non-Native Emotional Speech in the Marathi Language. 1-6 - Hunny Gaur, Devendra K. Tayal, Amita Jain:
ViQG: Web Tool for Automatic Question Generation from Code for Viva Preparation. 1-6 - Bhavika Sachdeva, Harshita Rathee, Pooja Gambhir, Poonam Bansal:
Empirical Analysis of Machine Learning Models on Parkinson's Speech Dataset. 1-5 - Phondanai Khanti, Pannathorn Sathirasattayanon, Patthranit Kaewcharuay, Nanthayod Termkoh, Ekachai Phaisangittisagul, Kasorn Galajit, Jessada Karnjana:
Speech Watermarking for Tampering Detection Using Singular Spectrum Analysis with a Psychoacoustic Model. 1-7 - Ramesh K. Bhukya, Anjali Chaturvedi, Hardik Bajaj, Udgam Shah, Sumit Singh, Uma Shanker Tiwary:
Efficiently Transferring Pre-trained Language Model RoBERTa Base English to Hindi Using WECHSEL. 1-6 - Mukund Kumar Roy, Karunesh Kumar Arora, Joyanta Basu, Saikat Basu, Sunita Arora, Shyam S. Agarwal:
A Novel Approach for Bootstrapping and Automatic Transcription of Low Resourced Language Speech Corpus. 1-5 - Nayan Anand, Meenakshi Sirigiraju, Chiranjeevi Yarra:
IIITH MM2 Speech-Text: A preliminary data for automatic spoken data validation with matched and mismatched speech-text content. 1-6 - Aomin, Dahu Baiyila, Aijun Li:
Perception Of Long And Short Vowel Contrast In Mongolian. 1-4 - Shalini Tomar, Pragya Gupta, Shashidhar G. Koolagudi:
NITK-KLESC: Kannada Language Emotional Speech Corpus for Speaker Recognition. 1-6 - Rajesha N., Rejitha K. S., Narayan Kumar Choudhary:
Evaluation of Assamese Speech Data Transcriptions by Levenshtein Distance. 1-4 - Sreeja Manghat, Sreeram Manghat, Tanja Schultz:
Few-shot meta multilabel classifier for low resource accented code-switched speech. 1-6 - Vartika Tyagi, Amita Dev, Poonam Bansal:
Analysis and Classification of Dysarthric Speech. 1-6 - Lalaram Arya, Sai Naga Venu Gopal Bhamidi, Shashi Prabha, S. R. Mahadeva Prasanna:
Comparative Analysis of Direct Speech-to-Speech Translation and Voice Conversion Using Bi-LSTM. 1-6 - Xintong Zuo, Yuan Jia, Hui Feng:
Acoustic Features and Patterns of Chinese sibilants and English Fricatives by Native Uyghur Speakers. 1-5 - Joyshree Chakraborty, Rohit Sinha, Priyankoo Sarmah:
ASHI: A Database of Assamese Accented Hindi. 1-6 - Yishan Huang:
Yangru Tone in Southern Min: Variation across Contexts. 1-6 - Shakuntala Mahanta, Bibungshri Boro, Priti Raychoudhury:
Focus and Intonation in Dimasa. 1-6 - Rejitha K. S., Rajesha N., Narayan Kumar Choudhary:
Type-Token Analysis on LDC-IL Text Corpus. 1-6 - Shivani Goel, Rashmi Ashtt, Monali Wankar, Prasoon Gupta:
"The Potential of Speech Technology to Enhance the Quality of Life in Historic Cities". 1-7 - Ashwini S. Ganakwar, Santosh K. Maher, Ratnadeep R. Deshmukh:
Enhancing Sanskrit Isolated Word Recognition: A Comparative Analysis of MFCC and SVM Feature Integration. 1-6 - Kanwar Dimple Singh, Rashmi Ashtt:
Leveraging Speech Recognition for Smart Urban Last Mile Connectivity Enhancement. 1-26 - Ashwini S. Shinde, Vaishali V. Patil, Albaab Shaikh, Pratik More, Kajal Salve:
Design and Validation of HindiSER: Speech Emotion Recognition Dataset for Hindi Language. 1-5 - Parul Mann, Anmol Jha, Ritu Rani, Garima Jaiswal, Arun Sharma, Amita Dev:
Automated Diagnosis of Parkinson's Disease using Speech Signals with Machine Learning. 1-6 - Surbhi Khurana, Amita Dev, Poonam Bansal:
Feature Comparison for Speech Emotion Recognition on Hindi Language. 1-6 - Binbin Sun, Hui Feng, Tianqi Geng:
Prosodic Encoding of Focus and Interrogative mood in Tianjin Dialect. 1-6 - Mani Gupta, Rashmi Ashtt, Monali Wankar, Ajay Monga:
Speech Recognition Applications in Enhancing Safety for Women in Built Environment. 1-15 - Myat Aye Aye Aung, Win Pa Pa, Hay Mar Soe Naing:
M-Diarization: A Myanmar Speaker Diarization using Multi-scale dynamic weights. 1-5 - Abhayjeet Singh, Charu Shah, Rajashri Varadaraj, Sonakshi Chauhan, Prasanta Kumar Ghosh:
SPIRE-SIES: A Spontaneous Indian English Speech Corpus. 1-6 - Florance Yune, Khin Mar Soe:
Advancing Transfer Learning Paradigms for Myanmar (Burmese) to Wa (Austroasiatic Language Family) Language Translation. 1-6 - Keisuke Toyama, Katsuhito Sudoh, Satoshi Nakamura:
E2E Refined Dataset. 1-5 - Sreeja Manghat, Sreeram Manghat, Tanja Schultz:
Data augmentation strategies for low resource conversational code-switching. 1-7 - Soma Khan, Joyanta Basu, Milton Samirakshma Bepari, Madhab Pal, Rajib Roy:
Designing an IVR-based Speech Data Collection Framework for building Realistic Speech Corpus on Forensic Automatic Speaker Recognition. 1-6 - Aryaman Sharma, Harshit Gupta, Tabishi Singh, Gaurav Singal, Riti Kushwaha:
NayanCom - A Smart Patient Communication System. 1-8 - Parabattina Bhagath, Vanga Lasya, Pulapaka Dhyeya, Pradip K. Das:
Telugu Vakyalu: Spoken Telugu Sentences for IoT Applications. 1-5 - Sumonmas Thatphithakkul, Kwanchiva Thangthai, Sahatsawat Sriphol, Vataya Chunwijitra:
The Development of a Thai Telephone Conversational Speech Corpus. 1-6 - Shweta Sharma, Rashmi Ashtt, Monali Wankar:
"Enhancing Efficiency and Conservation via Speech Processing in Lutyens's Delhi Residential Revitalization". 1-7 - Linjiao Pan, Yuan Jia:
A Research on Uygur Primary Teachers' Production Characteristics and Hierarchy of Difficulty in Acquiring Vowels of Standard Chinese. 1-5 - Rashmi Ashtt, Mayank Mathur:
Transforming Shahjahanabad into a Smart Heritage City Integrating Good Governance, Speech, and IoT Technologies for Sustainable Urban Development. 1-6 - Suhani, Amita Dev, Poonam Bansal:
CTC-Based End-to-End Speech Recognition for Low Resource Language Sanskrit. 1-5 - Aashi Gupta, Priya Sharma, Kiran Malik, Ritika Kumari, Poonam Bansal:
ChatterBot - An AI Conversational Entity. 1-6 - Krisangi Saikia, Shakuntala Mahanta:
Exploration of Speech Rhythm in Deori L1 and L2. 1-8 - Yuan-Fu Liao, Shaw-Hwa Hwang, You-Shuo Chen, Han-Chun Lai, Yao-Hsing Chung, Li-Te Shen, Yen-Chun Huang, Chi-Jung Huang, Hsu Wen Han, Li-Wei Chen, Pei-Chung Su, Chao-Shih Huang:
Taiwanese Hakka Across Taiwan Corpus and Formosa Speech Recognition Challenge 2023 - Hakka ASR. 1-6 - Shambhu Sharan, Shweta Bansal, Poonam Bansal, Amita Dev, Shyam S. Agrawal:
Empirical Analysis of Phonological and Prosodic Features of Native and Non-Native Hindi Speakers. 1-7 - Surbhi Bharti, Prerna Jha, Medha Arora, Ashwni Kumar:
Speech Enhancement And Noise Reduction In Forensic Applications. 1-5 - Jue Yu, Kexin Zhang:
Acoustic Development of Vowel Production by Prelingually Deaf Chinese Mandarin-speaking Children with Cochlear Implants. 1-6 - Devendra Kayande, Indra Ballav Sonowal, Ramesh K. Bhukya:
Fine-tuning the Wav2Vec2 Model for Automatic Speech Emotion Recognition System. 1-6 - Yizhou Lan, Tongtong Xie, Jingbai Sun, Yuenan Zhu, Albert Lee:
Second Language Accent Perception and Language Attitude by Mandarin and Cantonese Speakers in Mainland China. 1-5 - Yasuyuki Usuda:
Prosody in Everyday Japanese Conversation at the Clause Final. 1-5 - Parth Sanjay Khadse, Sabyasachi Chandra, Sankar Mukherjee, Puja Bharati, Debolina Pramanik, Aniket Aitawade, Shyamal Kumar Das Mandal:
End-to-End Cross-Lingual Voice Conversion using CycleGAN for Low Resource Indian Languages. 1-6 - Hang Xi, Sakriani Sakti:
Exploring Difficulties Encountered by Professional Interpreters in Japanese-to-English and English-to-Japanese Simultaneous Translation. 1-6 - Sougata Mukherjee, Prashant Bannulmath, Deepak K. T., S. R. Mahadeva Prasanna:
Leveraging Cross Lingual Speech Representations To Build ASR For Under-resourced Languages. 1-6 - Jia-Jyu Su, Pang-Chen Liao, Yen-Ting Lin, Wu-Hao Li, Guan-Ting Liou, Cheng-Che Kao, Wei-Cheng Chen, Jen-Chieh Chiang, Wen-Yang Chang, Pin-Han Lin, Chen-Yu Chiang:
VoiceBank-2023: A Multi-Speaker Mandarin Speech Corpus for Constructing Personalized TTS Systems for the Speech Impaired. 1-6 - Yifan Mou, Lei Zhu:
The Effects of Aging on Electroglottographic and Acoustic Parameters of Voices and the Detection of Change Points in Vocal Aging. 1-6 - Kana Miyamoto, Hiroki Tanaka, Kazuhiro Shidara, Satoshi Nakamura:
Emotion Prediction Using Multi-source Biosignals During Cognitive Behavior Therapy with Conversational Virtual Agents. 1-6 - Bella Septina Ika Hartanti, Dipta Tanaya, Kurniawati Azizah, Dessi Puji Lestari, Ayu Purwarianti, Sakriani Sakti:
Generating Speech with Prosodic Prominence based on SSL-Visually Grounded Models. 1-6
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.