default search action
24th NoDaLiDa 2023: Tórshavn, Faroe Islands
- Tanel Alumäe, Mark Fishel:
Proceedings of the 24th Nordic Conference on Computational Linguistics, NoDaLiDa 2023, Tórshavn, Faroe Islands, May 22-24, 2023. University of Tartu Library 2023, ISBN 978-99-1621-999-7 - Frontmatter.
- Ghazaal Sheikhi, Samia Touileb, Sohail Ahmed Khan:
Automated Claim Detection for Fact-checking: A Case Study using Norwegian Pre-trained Language Models. 1-9 - Cedric Lothritz, Bertrand Lebichot, Kevin Allix, Saad Ezzini, Tegawendé F. Bissyandé, Jacques Klein, Andrey Boytsov, Clément Lefebvre, Anne Goujon:
Evaluating the Impact of Text De-Identification on Downstream NLP Tasks. 10-16 - Thór Sverrisson, Hafsteinn Einarsson:
Abstractive Text Summarization for Icelandic. 17-31 - Carlos Daniel Hernandez Mena, Annika Simonsen, Jón Guðnason:
ASR Language Resources for Faroese. 32-41 - Yuri Bizzoni, Pascale Moreira, Nicole Dwenger, Ida Marie Schytt Lassen, Mads Rosendahl Thomsen, Kristoffer L. Nielbo:
Good Reads and Easy Novels: Readability and Literary Quality in a Corpus of US-published Fiction. 42-51 - Maciej Janicki, Antti Kanner, Eetu Mäkelä:
Detection and attribution of quotes in Finnish news media: BERT vs. rule-based approach. 52-59 - Marina Björnsdóttir, Nora Hollenstein, Maria Barrett:
Dyslexia Prediction from Natural Reading of Danish Texts. 60-70 - Örvar Kárason, Hrafn Loftsson:
Is Part-of-Speech Tagging a Solved Problem for Icelandic? 71-79 - Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot, Barbara Plank:
Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction. 80-85 - Tiina Lindh-Knuutila, Hrafn Loftsson, Pedro Alonso Doval, Sebastian Andersson, Bjarni Barkarson, Héctor Cerezo-Costas, Jón Guðnason, Jökull Gylfason, Jarmo Hemminki, Heiki-Jaan Kaalep:
Microservices at Your Service: Bridging the Gap between NLP Research and Industry. 86-91 - Xiulin Yang, Jingyan Chen, Arjan van Eerden, Ahnaf Mozib Samin, Arianna Bisazza:
Slaapte or Sliep? Extending Neural-Network Simulations of English Past Tense Learning to Dutch and German. 92-102 - Denitsa Saynova, Bastiaan Bruinsma, Moa Johansson, Richard Johansson:
Class Explanations: the Role of Domain-Specific Content and Stop Words. 103-112 - Daniel Holmer, Evelina Rennes:
Constructing Pseudo-parallel Swedish Sentence Corpora for Automatic Text Simplification. 113-123 - Daniel Holmer, Lars Ahrenberg, Julius Monsen, Arne Jönsson, Mikael Apel, Marianna Grimaldi:
Who said what? Speaker Identification from Anonymous Minutes of Meetings. 124-134 - Luise Dürlich, Evangelia Gogoulou, Joakim Nivre:
On the Concept of Resource-Efficiency in NLP. 135-145 - Jeremy Barnes, Samia Touileb, Petter Mæhlum, Pierre Lison:
Identifying Token-Level Dialectal Features in Social Media. 146-158 - Sardana Ivanova, Fredrik Aas Andreassen, Matias Jentoft, Sondre Wold, Lilja Øvrelid:
NorQuAD: Norwegian Question Answering Dataset. 159-168 - Carl Börstell:
Extracting Sign Language Articulation from Videos with MediaPipe. 169-178 - Kadri Muischnek, Kaili Müürisep:
Named Entity layer in Estonian UD treebanks. 179-184 - Dan Saattrup Nielsen:
ScandEval: A Benchmark for Scandinavian Natural Language Processing. 185-201 - Lucas Georges Gabriel Charpentier, Sondre Wold, David Samuel, Egil Rønningstad:
BRENT: Bidirectional Retrieval Enhanced Norwegian Transformer. 202-214 - Anastassia Shaitarova, Anne Göhring, Martin Volk:
Machine vs. Human: Exploring Syntax and Lexicon in German Translations, with a Spotlight on Anglicisms. 215-227 - Bernt Ivar Utstøl Nødland:
Training and Evaluating Norwegian Sentence Embedding Models. 228-237 - Michele Boggia, Stig-Arne Grönroos, Niki A. Loppi, Timothee Mickus, Alessandro Raganato, Jörg Tiedemann, Raúl Vázquez:
Dozens of Translation Directions or Millions of Shared Parameters? Comparing Two Types of Multilinguality in Modular Machine Translation. 238-247 - Sara Kolding, Katrine Nymann, Ida Hansen, Kenneth C. Enevoldsen, Ross Deans Kristensen-McLachlan:
DanSumT5: Automatic Abstractive Summarization for Danish. 248-264 - Nhan Phan, Tamás Grósz, Mikko Kurimo:
CaptainA - A mobile app for practising Finnish pronunciation. 265-270 - Kia Kirstein Hansen, Maria Barrett, Max Müller-Eberstein, Cathrine Damgaard, Trine Eriksen, Rob van der Goot:
DanTok: Domain Beats Language for Danish Social Media POS Tagging. 271-279 - Aleksei Dorkin, Kairit Sirts:
Comparison of Current Approaches to Lemmatization: A Case Study in Estonian. 280-285 - Atli Jasonarson, Steinþór Steingrímsson, Einar Freyr Sigurðsson, Árni Magnússon, Finnur Ingimundarson:
Generating Errors: OCR Post-Processing for Icelandic. 286-291 - Annika Willoch Olstad, Anthi Papadopoulou, Pierre Lison:
Generation of Replacement Options in Text Sanitization. 292-300 - Jannik S. Pedersen, Martin S. Laursen, Pernille J. Vinholt, Thiusius Rajeeth Savarimuthu:
MeDa-BERT: A medical Danish pretrained transformer model. 301-307 - Sandra Saxov Lamhauge, Iben Nyholm Debess, Carlos Daniel Hernandez Mena, Annika Simonsen, Jón Guðnason:
Standardising Pronunciation for a Grapheme-to-Phoneme Converter for Faroese. 308-317 - Thomas Vakili, Hercules Dalianis:
Using Membership Inference Attacks to Evaluate Privacy-Preserving Language Modeling Fails for Pseudonymizing Data. 318-323 - Ali Allaith, Kirstine Degn, Alexander Conroy, Bolette S. Pedersen, Jens Bjerring-Hansen, Daniel Hershcovich:
Sentiment Classification of Historical Danish and Norwegian Literary Texts. 324-334 - Sara Stymne, Carin Östman, David Håkansson:
Parser Evaluation for Analyzing Swedish 19th-20th Century Literature. 335-346 - Mehrdad Farahani, Richard Johansson:
An Empirical Study of Multitask Learning to Improve Open Domain Dialogue Systems. 347-357 - Aarne Talman, Hande Çelikkanat, Sami Virpioja, Markus Heinonen, Jörg Tiedemann:
Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging. 358-365 - Finn Nielsen:
Alignment of Wikidata lexemes and Det Centrale Ordregister. 366-370 - Ekaterina Artemova, Barbara Plank:
Low-resource Bilingual Dialect Lexicon Induction with Large Language Models. 371-385 - Anders Mølmen Høst, Pierre Lison, Leon Moonen:
Constructing a Knowledge Graph from Textual Descriptions of Software Vulnerabilities in the National Vulnerability Database. 386-391 - Verena Blaschke, Hinrich Schütze, Barbara Plank:
A Survey of Corpora for Germanic Low-Resource Languages and Dialects. 392-414 - Aleksandrs Berdicevskis, Viktor Erbro:
You say tomato, I say the same: A large-scale study of linguistic accommodation in online communities. 415-424 - Dag Trygve Truslew Haug, Ahmet Yildirim, Kristin Hagen, Anders Nøklestad:
Integrating rules and neural nets for morphological tagging of Norwegian - Results and challenges. 425-435 - Laurent Prévot, Julie Hunter, Philippe Muller:
Comparing Methods for Segmenting Elementary Discourse Units in a French Conversational Corpus. 436-446 - José carlos Rosales Núñez, Djamé Seddah, Guillaume Wisniewski:
Multi-way Variational NMT for UGC: Improving Robustness in Zero-shot Scenarios via Mixture Density Networks. 447-459 - Rafal Cerniavski, Sara Stymne:
Multilingual Automatic Speech Recognition for Scandinavian Languages. 460-466 - Phoebe Parsons, Knut Kvale, Torbjørn Svendsen, Giampiero Salvi:
A character-based analysis of impacts of dialects on end-to-end Norwegian ASR. 467-476 - Dmytro Kalpakchi, Johan Boye:
Quasi: a synthetic Question-Answering dataset in Swedish using GPT-3 and zero-shot learning. 477-491 - Tanel Alumäe, Joonas Kalda, Külliki Bode, Martin Kaitsa:
Automatic Closed Captioning for Estonian Live Broadcasts. 492-499 - Steinunn Rut Friðriksdóttir, Hafsteinn Einarsson:
The Effect of Data Encoding on Relation Triplet Identification. 500-507 - Per Erik Solberg, Pablo Ortiz, Phoebe Parsons, Torbjørn Svendsen, Giampiero Salvi:
Improving Generalization of Norwegian ASR with Limited Linguistic Resources. 508-517 - Wei Zhou, Nina Tahmasebi, Haim Dubossarsky:
The Finer They Get: Combining Fine-Tuned Models For Better Semantic Change Detection. 518-528 - Ilmari Kylliäinen, Roman Yangarber:
Question Answering and Question Generation for Finnish. 529-540 - David Kletz, Marie Candito, Pascal Amsili:
Probing structural constraints of negation in Pretrained Language Models. 541-554 - Javier de la Rosa, Rolv-Arild Braaten, Per Egil Kummervold, Freddy Wetjen:
Boosting Norwegian Automatic Speech Recognition. 555-564 - Niklas Zechner:
Length Dependence of Vocabulary Richness. 565-573 - Arianna Masciolini:
A query engine for L1-L2 parallel dependency treebanks. 574-587 - Steinþór Steingrímsson, Hrafn Loftsson, Andy Way:
Filtering Matters: Experiments in Filtering Training Sets for Machine Translation. 588-600 - Luke O'Brien, Finnur Ingimundarson, Jón Guðnason, Steinþór Steingrímsson:
Gamli - Icelandic Oral History Corpus: Design, Collection and Evaluation. 601-609 - Matias Jentoft, David Samuel:
NoCoLA: The Norwegian Corpus of Linguistic Acceptability. 610-617 - David Samuel, Andrey Kutuzov, Samia Touileb, Erik Velldal, Lilja Øvrelid, Egil Rønningstad, Elina Sigdel, Anna Palatkina:
NorBench - A Benchmark for Norwegian Language Models. 618-633 - Oskar Holmström, Ehsan Doostmohammadi:
Making Instruction Finetuning Accessible to Non-English Languages: A Case Study on Swedish Models. 634-642 - Flammie A. Pirinen, Sjur N. Moshagen, Katri Hiovain-Asikainen:
GiellaLT - a stable infrastructure for Nordic minority languages and beyond. 643-649 - Kristjan Runarsson, Kristín Bjarnadóttir:
Adapting an Icelandic morphological database to Faroese. 650-654 - Martin S. Laursen, Jannik S. Pedersen, Rasmus Søgaard Hansen, Thiusius Rajeeth Savarimuthu, Pernille J. Vinholt:
Danish Clinical Named Entity Recognition and Relation Extraction. 655-666 - Dimitrios Kokkinakis, Ricardo Muñoz Sánchez, Mia-Marie Hammarlin:
Scaling-up the Resources for a Freely Available Swedish VADER (svVADER). 667-672 - Yiyi Chen, Russa Biswas, Johannes Bjerva:
Colex2Lang: Language Embeddings from Semantic Typology. 673-684 - Anni Eskelinen, Laura Silvala, Filip Ginter, Sampo Pyysalo, Veronika Laippala:
Toxicity Detection in Finnish Using Machine Translation. 685-697 - Thórunn Arnardóttir, Hinrik Hafsteinsson, Atli Jasonarson, Anton Ingaon, Steinþór Steingrímsson:
Evaluating a Universal Dependencies Conversion Pipeline for Icelandic. 698-704 - Agnes Luhtaru, Rauno Jaaska, Karl Kruusamäe, Mark Fishel:
Automatic Transcription for Estonian Children's Speech. 705-709 - Hele-Andra Kuulmets, Mark Fishel:
Translated Benchmarks Can Be Misleading: the Case of Estonian Question Answering. 710-716 - Peter Vajdecka, Elena Callegari, Desara Xhura, Atli Ásmundsson:
Predicting the presence of inline citations in academic text using binary classification. 717-722 - Liisa Rätsep, Mark Fishel:
Neural Text-to-Speech Synthesis for Võro. 723-727 - Vésteinn Snæbjarnarson, Annika Simonsen, Goran Glavas, Ivan Vulic:
Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese. 728-737 - Anssi Moisio, Mathias Creutz, Mikko Kurimo:
Evaluating Morphological Generalisation in Machine Translation by Distribution-Based Compositionality Assessment. 738-751 - Kairit Sirts:
Estonian Named Entity Recognition: New Datasets and Models. 752-761 - Lisa Yankovskaya, Maali Tars, Andre Tättar, Mark Fishel:
Machine Translation for Low-resource Finno-Ugric Languages. 762-771 - Elizaveta Korotkova, Mark Fishel:
Distilling Estonian Text Domains for Production-Oriented Machine Translation. 772-781 - Kais Allkivi-Metsoja, Jaagup Kippar:
Spelling Correction for Estonian Learner Language. 782-788
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.