May 5, 2019

1669 words 8 mins read

Paper Group NANR 61

Paper Group NANR 61

DALILA: The Dialectal Arabic Linguistic Learning Assistant. Reducing lexical complexity as a tool to increase text accessibility for children with dyslexia. YZU-NLP Team at SemEval-2016 Task 4: Ordinal Sentiment Classification Using a Recurrent Convolutional Network. Korean TimeML and Korean TimeBank. Parameter estimation of Japanese predicate argu …

DALILA: The Dialectal Arabic Linguistic Learning Assistant

Title DALILA: The Dialectal Arabic Linguistic Learning Assistant
Authors Salam Khalifa, Houda Bouamor, Nizar Habash
Abstract Dialectal Arabic (DA) poses serious challenges for Natural Language Processing (NLP). The number and sophistication of tools and datasets in DA are very limited in comparison to Modern Standard Arabic (MSA) and other languages. MSA tools do not effectively model DA which makes the direct use of MSA NLP tools for handling dialects impractical. This is particularly a challenge for the creation of tools to support learning Arabic as a living language on the web, where authentic material can be found in both MSA and DA. In this paper, we present the Dialectal Arabic Linguistic Learning Assistant (DALILA), a Chrome extension that utilizes cutting-edge Arabic dialect NLP research to assist learners and non-native speakers in understanding text written in either MSA or DA. DALILA provides dialectal word analysis and English gloss corresponding to each word.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1175/
PDF https://www.aclweb.org/anthology/L16-1175
PWC https://paperswithcode.com/paper/dalila-the-dialectal-arabic-linguistic
Repo
Framework

Reducing lexical complexity as a tool to increase text accessibility for children with dyslexia

Title Reducing lexical complexity as a tool to increase text accessibility for children with dyslexia
Authors N{'u}ria Gala, Johannes Ziegler
Abstract Lexical complexity plays a central role in readability, particularly for dyslexic children and poor readers because of their slow and laborious decoding and word recognition skills. Although some features to aid readability may be common to most languages (e.g., the majority of {`}easy{'} words are of low frequency), we believe that lexical complexity is mainly language-specific. In this paper, we define lexical complexity for French and we present a pilot study on the effects of text simplification in dyslexic children. The participants were asked to read out loud original and manually simplified versions of a standardized French text corpus and to answer comprehension questions after reading each text. The analysis of the results shows that the simplifications performed were beneficial in terms of reading speed and they reduced the number of reading errors (mainly lexical ones) without a loss in comprehension. Although the number of participants in this study was rather small (N=10), the results are promising and contribute to the development of applications in computational linguistics. |
Tasks Reading Comprehension, Text Simplification
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4107/
PDF https://www.aclweb.org/anthology/W16-4107
PWC https://paperswithcode.com/paper/reducing-lexical-complexity-as-a-tool-to
Repo
Framework

YZU-NLP Team at SemEval-2016 Task 4: Ordinal Sentiment Classification Using a Recurrent Convolutional Network

Title YZU-NLP Team at SemEval-2016 Task 4: Ordinal Sentiment Classification Using a Recurrent Convolutional Network
Authors Yunchao He, Liang-Chih Yu, Chin-Sheng Yang, K. Robert Lai, Weiyi Liu
Abstract
Tasks Sentiment Analysis, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1039/
PDF https://www.aclweb.org/anthology/S16-1039
PWC https://paperswithcode.com/paper/yzu-nlp-team-at-semeval-2016-task-4-ordinal
Repo
Framework

Korean TimeML and Korean TimeBank

Title Korean TimeML and Korean TimeBank
Authors Young-Seob Jeong, Won-Tae Joo, Hyun-Woo Do, Chae-Gyun Lim, Key-Sun Choi, Ho-Jin Choi
Abstract Many emerging documents usually contain temporal information. Because the temporal information is useful for various applications, it became important to develop a system of extracting the temporal information from the documents. Before developing the system, it first necessary to define or design the structure of temporal information. In other words, it is necessary to design a language which defines how to annotate the temporal information. There have been some studies about the annotation languages, but most of them was applicable to only a specific target language (e.g., English). Thus, it is necessary to design an individual annotation language for each language. In this paper, we propose a revised version of Koreain Time Mark-up Language (K-TimeML), and also introduce a dataset, named Korean TimeBank, that is constructed basd on the K-TimeML. We believe that the new K-TimeML and Korean TimeBank will be used in many further researches about extraction of temporal information.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1055/
PDF https://www.aclweb.org/anthology/L16-1055
PWC https://paperswithcode.com/paper/korean-timeml-and-korean-timebank
Repo
Framework

Parameter estimation of Japanese predicate argument structure analysis model using eye gaze information

Title Parameter estimation of Japanese predicate argument structure analysis model using eye gaze information
Authors Ryosuke Maki, Hitoshi Nishikawa, Takenobu Tokunaga
Abstract In this paper, we propose utilising eye gaze information for estimating parameters of a Japanese predicate argument structure (PAS) analysis model. We employ not only linguistic information in the text, but also the information of annotator eye gaze during their annotation process. We hypothesise that annotator{'}s frequent looks at certain candidates imply their plausibility of being the argument of the predicate. Based on this hypothesis, we consider annotator eye gaze for estimating the model parameters of the PAS analysis. The evaluation experiment showed that introducing eye gaze information increased the accuracy of the PAS analysis by 0.05 compared with the conventional methods.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1269/
PDF https://www.aclweb.org/anthology/C16-1269
PWC https://paperswithcode.com/paper/parameter-estimation-of-japanese-predicate
Repo
Framework

Enhancing Access to Online Education: Quality Machine Translation of MOOC Content

Title Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
Authors Valia Kordoni, Antal van den Bosch, Katia Lida Kermanidis, Vilelmini Sosoni, Kostadin Cholakov, Iris Hendrickx, Matthias Huck, Andy Way
Abstract The present work is an overview of the TraMOOC (Translation for Massive Open Online Courses) research and innovation project, a machine translation approach for online educational content. More specifically, videolectures, assignments, and MOOC forum text is automatically translated from English into eleven European and BRIC languages. Unlike previous approaches to machine translation, the output quality in TraMOOC relies on a multimodal evaluation schema that involves crowdsourcing, error type markup, an error taxonomy for translation model comparison, and implicit evaluation via text mining, i.e. entity recognition and its performance comparison between the source and the translated text, and sentiment analysis on the students{'} forum posts. Finally, the evaluation output will result in more and better quality in-domain parallel data that will be fed back to the translation engine for higher quality output. The translation service will be incorporated into the Iversity MOOC platform and into the VideoLectures.net digital library portal.
Tasks Machine Translation, Sentiment Analysis
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1003/
PDF https://www.aclweb.org/anthology/L16-1003
PWC https://paperswithcode.com/paper/enhancing-access-to-online-education-quality
Repo
Framework

Topicality-Based Indices for Essay Scoring

Title Topicality-Based Indices for Essay Scoring
Authors Beata Beigman Klebanov, Michael Flor, Binod Gyawali
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0507/
PDF https://www.aclweb.org/anthology/W16-0507
PWC https://paperswithcode.com/paper/topicality-based-indices-for-essay-scoring
Repo
Framework

AmritaCEN at SemEval-2016 Task 11: Complex Word Identification using Word Embedding

Title AmritaCEN at SemEval-2016 Task 11: Complex Word Identification using Word Embedding
Authors Sanjay S.P, An Kumar M, , Soman K P
Abstract
Tasks Complex Word Identification, Lexical Simplification, Text Simplification
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1159/
PDF https://www.aclweb.org/anthology/S16-1159
PWC https://paperswithcode.com/paper/amritacen-at-semeval-2016-task-11-complex
Repo
Framework

Improving the Annotation of Sentence Specificity

Title Improving the Annotation of Sentence Specificity
Authors Junyi Jessy Li, Bridget O{'}Daniel, Yi Wu, Wenli Zhao, Ani Nenkova
Abstract We introduce improved guidelines for annotation of sentence specificity, addressing the issues encountered in prior work. Our annotation provides judgements of sentences in context. Rather than binary judgements, we introduce a specificity scale which accommodates nuanced judgements. Our augmented annotation procedure also allows us to define where in the discourse context the lack of specificity can be resolved. In addition, the cause of the underspecification is annotated in the form of free text questions. We present results from a pilot annotation with this new scheme and demonstrate good inter-annotator agreement. We found that the lack of specificity distributes evenly among immediate prior context, long distance prior context and no prior context. We find that missing details that are not resolved in the the prior context are more likely to trigger questions about the reason behind events, {}why{''} and {}how{''}. Our data is accessible at http://www.cis.upenn.edu/{\textasciitilde}nlp/corpora/lrec16spec.html
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1620/
PDF https://www.aclweb.org/anthology/L16-1620
PWC https://paperswithcode.com/paper/improving-the-annotation-of-sentence
Repo
Framework

Domain-Specific Corpus Expansion with Focused Webcrawling

Title Domain-Specific Corpus Expansion with Focused Webcrawling
Authors Steffen Remus, Chris Biemann
Abstract This work presents a straightforward method for extending or creating in-domain web corpora by focused webcrawling. The focused webcrawler uses statistical N-gram language models to estimate the relatedness of documents and weblinks and needs as input only N-grams or plain texts of a predefined domain and seed URLs as starting points. Two experiments demonstrate that our focused crawler is able to stay focused in domain and language. The first experiment shows that the crawler stays in a focused domain, the second experiment demonstrates that language models trained on focused crawls obtain better perplexity scores on in-domain corpora. We distribute the focused crawler as open source software.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1572/
PDF https://www.aclweb.org/anthology/L16-1572
PWC https://paperswithcode.com/paper/domain-specific-corpus-expansion-with-focused
Repo
Framework

LIMSI-COT at SemEval-2016 Task 12: Temporal relation identification using a pipeline of classifiers

Title LIMSI-COT at SemEval-2016 Task 12: Temporal relation identification using a pipeline of classifiers
Authors Julien Tourille, Olivier Ferret, Aur{'e}lie N{'e}v{'e}ol, Xavier Tannier
Abstract
Tasks Entity Extraction, Relation Extraction, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1175/
PDF https://www.aclweb.org/anthology/S16-1175
PWC https://paperswithcode.com/paper/limsi-cot-at-semeval-2016-task-12-temporal
Repo
Framework

UofR at SemEval-2016 Task 8: Learning Synchronous Hyperedge Replacement Grammar for AMR Parsing

Title UofR at SemEval-2016 Task 8: Learning Synchronous Hyperedge Replacement Grammar for AMR Parsing
Authors Xiaochang Peng, Daniel Gildea
Abstract
Tasks Amr Parsing, Machine Translation, Question Answering
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1183/
PDF https://www.aclweb.org/anthology/S16-1183
PWC https://paperswithcode.com/paper/uofr-at-semeval-2016-task-8-learning
Repo
Framework

The Meaning Factory at SemEval-2016 Task 8: Producing AMRs with Boxer

Title The Meaning Factory at SemEval-2016 Task 8: Producing AMRs with Boxer
Authors Johannes Bjerva, Johan Bos, Hessel Haagsma
Abstract
Tasks Semantic Parsing
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1182/
PDF https://www.aclweb.org/anthology/S16-1182
PWC https://paperswithcode.com/paper/the-meaning-factory-at-semeval-2016-task-8
Repo
Framework

GUIR at SemEval-2016 task 12: Temporal Information Processing for Clinical Narratives

Title GUIR at SemEval-2016 task 12: Temporal Information Processing for Clinical Narratives
Authors Arman Cohan, Kevin Meurer, Nazli Goharian
Abstract
Tasks Information Retrieval
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1194/
PDF https://www.aclweb.org/anthology/S16-1194
PWC https://paperswithcode.com/paper/guir-at-semeval-2016-task-12-temporal
Repo
Framework

KULeuven-LIIR at SemEval 2016 Task 12: Detecting Narrative Containment in Clinical Records

Title KULeuven-LIIR at SemEval 2016 Task 12: Detecting Narrative Containment in Clinical Records
Authors Artuur Leeuwenberg, Marie-Francine Moens
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1199/
PDF https://www.aclweb.org/anthology/S16-1199
PWC https://paperswithcode.com/paper/kuleuven-liir-at-semeval-2016-task-12
Repo
Framework
comments powered by Disqus