May 5, 2019

1591 words 8 mins read

Paper Group NANR 17

Context-enhanced Adaptive Entity Linking. Langforia: Language Pipelines for Annotating Large Collections of Documents. New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian. UtahBMI at SemEval-2016 Task 12: Extracting Temporal Information from Clinical Text. Morphological Complexity Influence …

Context-enhanced Adaptive Entity Linking


Title	Context-enhanced Adaptive Entity Linking
Authors	Filip Ilievski, Giuseppe Rizzo, Marieke van Erp, Julien Plu, Rapha{"e}l Troncy
Abstract	More and more knowledge bases are publicly available as linked data. Since these knowledge bases contain structured descriptions of real-world entities, they can be exploited by entity linking systems that anchor entity mentions from text to the most relevant resources describing those entities. In this paper, we investigate adaptation of the entity linking task using contextual knowledge. The key intuition is that entity linking can be customized depending on the textual content, as well as on the application that would make use of the extracted information. We present an adaptive approach that relies on contextual knowledge from text to enhance the performance of ADEL, a hybrid linguistic and graph-based entity linking system. We evaluate our approach on a domain-specific corpus consisting of annotated WikiNews articles.
Tasks	Entity Linking
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1086/
PDF	https://www.aclweb.org/anthology/L16-1086
PWC	https://paperswithcode.com/paper/context-enhanced-adaptive-entity-linking
Repo
Framework

Langforia: Language Pipelines for Annotating Large Collections of Documents


Title	Langforia: Language Pipelines for Annotating Large Collections of Documents
Authors	Marcus Klang, Pierre Nugues
Abstract	In this paper, we describe \textbf{Langforia}, a multilingual processing pipeline to annotate texts with multiple layers: formatting, parts of speech, named entities, dependencies, semantic roles, and entity links. Langforia works as a web service, where the server hosts the language processing components and the client, the input and result visualization. To annotate a text or a Wikipedia page, the user chooses an NLP pipeline and enters the text in the interface or selects the page URL. Once processed, the results are returned to the client, where the user can select the annotation layers s/he wants to visualize. We designed Langforia with a specific focus for Wikipedia, although it can process any type of text. Wikipedia has become an essential encyclopedic corpus used in many NLP projects. However, processing articles and visualizing the annotations are nontrivial tasks that require dealing with multiple markup variants, encodings issues, and tool incompatibilities across the language versions. This motivated the development of a new architecture. A demonstration of Langforia is available for six languages: English, French, German, Spanish, Russian, and Swedish at \url{http://vilde.cs.lth.se:9000/} as well as a web API: \url{http://vilde.cs.lth.se:9000/api}. Langforia is also provided as a standalone library and is compatible with cluster computing.
Tasks	Dependency Parsing, Entity Linking, Question Answering, Semantic Role Labeling, Tokenization
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2016/
PDF	https://www.aclweb.org/anthology/C16-2016
PWC	https://paperswithcode.com/paper/langforia-language-pipelines-for-annotating
Repo
Framework

New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian


Title	New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian
Authors	Nikola Ljube{\v{s}}i{'c}, Filip Klubi{\v{c}}ka, {\v{Z}}eljko Agi{'c}, Ivo-Pavao Jazbec
Abstract	In this paper we present newly developed inflectional lexcions and manually annotated corpora of Croatian and Serbian. We introduce hrLex and srLex - two freely available inflectional lexicons of Croatian and Serbian - and describe the process of building these lexicons, supported by supervised machine learning techniques for lemma and paradigm prediction. Furthermore, we introduce hr500k, a manually annotated corpus of Croatian, 500 thousand tokens in size. We showcase the three newly developed resources on the task of morphosyntactic annotation of both languages by using a recently developed CRF tagger. We achieve best results yet reported on the task for both languages, beating the HunPos baseline trained on the same datasets by a wide margin.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1676/
PDF	https://www.aclweb.org/anthology/L16-1676
PWC	https://paperswithcode.com/paper/new-inflectional-lexicons-and-training
Repo
Framework

UtahBMI at SemEval-2016 Task 12: Extracting Temporal Information from Clinical Text


Title	UtahBMI at SemEval-2016 Task 12: Extracting Temporal Information from Clinical Text
Authors	Abdulrahman Khalifa, Sumithra Velupillai, Stephane Meystre
Abstract
Tasks	Decision Making, Temporal Information Extraction
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1195/
PDF	https://www.aclweb.org/anthology/S16-1195
PWC	https://paperswithcode.com/paper/utahbmi-at-semeval-2016-task-12-extracting
Repo
Framework

Morphological Complexity Influences Verb-Object Order in Swedish Sign Language


Title	Morphological Complexity Influences Verb-Object Order in Swedish Sign Language
Authors	Johannes Bjerva, Carl B{"o}rstell
Abstract	Computational linguistic approaches to sign languages could benefit from investigating how complexity influences structure. We investigate whether morphological complexity has an effect on the order of Verb (V) and Object (O) in Swedish Sign Language (SSL), on the basis of elicited data from five Deaf signers. We find a significant difference in the distribution of the orderings OV vs. VO, based on an analysis of morphological weight. While morphologically heavy verbs exhibit a general preference for OV, humanness seems to affect the ordering in the opposite direction, with [+human] Objects pushing towards a preference for VO.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4116/
PDF	https://www.aclweb.org/anthology/W16-4116
PWC	https://paperswithcode.com/paper/morphological-complexity-influences-verb
Repo
Framework

CENTAL at SemEval-2016 Task 12: a linguistically fed CRF model for medical and temporal information extraction


Title	CENTAL at SemEval-2016 Task 12: a linguistically fed CRF model for medical and temporal information extraction
Authors	Charlotte Hansart, Damien De Meyere, Patrick Watrin, Andr{'e} Bittar, C{'e}drick Fairon
Abstract
Tasks	Part-Of-Speech Tagging, Temporal Information Extraction
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1200/
PDF	https://www.aclweb.org/anthology/S16-1200
PWC	https://paperswithcode.com/paper/cental-at-semeval-2016-task-12-a
Repo
Framework

IRIS: English-Irish Machine Translation System


Title	IRIS: English-Irish Machine Translation System
Authors	Mihael Arcan, Caoilfhionn Lane, Eoin {'O} Droighne{'a}in, Paul Buitelaar
Abstract	We describe IRIS, a statistical machine translation (SMT) system for translating from English into Irish and vice versa. Since Irish is considered an under-resourced language with a limited amount of machine-readable text, building a machine translation system that produces reasonable translations is rather challenging. As translation is a difficult task, current research in SMT focuses on obtaining statistics either from a large amount of parallel, monolingual or other multilingual resources. Nevertheless, we collected available English-Irish data and developed an SMT system aimed at supporting human translators and enabling cross-lingual language technology tasks.
Tasks	Machine Translation
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1090/
PDF	https://www.aclweb.org/anthology/L16-1090
PWC	https://paperswithcode.com/paper/iris-english-irish-machine-translation-system
Repo
Framework

A Computational Analysis of Mahabharata


Title	A Computational Analysis of Mahabharata
Authors	Debarati Das, Bhaskarjyoti Das, Kavi Mahesh
Abstract
Tasks	Emotion Recognition
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-6328/
PDF	https://www.aclweb.org/anthology/W16-6328
PWC	https://paperswithcode.com/paper/a-computational-analysis-of-mahabharata
Repo
Framework

It Takes Three to Tango: Triangulation Approach to Answer Ranking in Community Question Answering


Title	It Takes Three to Tango: Triangulation Approach to Answer Ranking in Community Question Answering
Authors	Preslav Nakov, Llu{'\i}s M{`a}rquez, Francisco Guzm{'a}n
Abstract
Tasks	Community Question Answering, Question Answering
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1165/
PDF	https://www.aclweb.org/anthology/D16-1165
PWC	https://paperswithcode.com/paper/it-takes-three-to-tango-triangulation
Repo
Framework

Cancer Hallmark Text Classification Using Convolutional Neural Networks


Title	Cancer Hallmark Text Classification Using Convolutional Neural Networks
Authors	Simon Baker, Anna Korhonen, Sampo Pyysalo
Abstract	Methods based on deep learning approaches have recently achieved state-of-the-art performance in a range of machine learning tasks and are increasingly applied to natural language processing (NLP). Despite strong results in various established NLP tasks involving general domain texts, there is only limited work applying these models to biomedical NLP. In this paper, we consider a Convolutional Neural Network (CNN) approach to biomedical text classification. Evaluation using a recently introduced cancer domain dataset involving the categorization of documents according to the well-established hallmarks of cancer shows that a basic CNN model can achieve a level of performance competitive with a Support Vector Machine (SVM) trained using complex manually engineered features optimized to the task. We further show that simple modifications to the CNN hyperparameters, initialization, and training process allow the model to notably outperform the SVM, establishing a new state of the art result at this task. We make all of the resources and tools introduced in this study available under open licenses from \url{https://cambridgeltl.github.io/cancer-hallmark-cnn/}.
Tasks	Text Classification
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-5101/
PDF	https://www.aclweb.org/anthology/W16-5101
PWC	https://paperswithcode.com/paper/cancer-hallmark-text-classification-using
Repo
Framework

Global Inference to Chinese Temporal Relation Extraction


Title	Global Inference to Chinese Temporal Relation Extraction
Authors	Peifeng Li, Qiaoming Zhu, Guodong Zhou, Hongling Wang
Abstract	Previous studies on temporal relation extraction focus on mining sentence-level information or enforcing coherence on different temporal relation types among various event mentions in the same sentence or neighboring sentences, largely ignoring those discourse-level temporal relations in nonadjacent sentences. In this paper, we propose a discourse-level global inference model to mine those temporal relations between event mentions in document-level, especially in nonadjacent sentences. Moreover, we provide various kinds of discourse-level constraints, which derived from event semantics, to further improve our global inference model. Evaluation on a Chinese corpus justifies the effectiveness of our discourse-level global inference model over two strong baselines.
Tasks	Question Answering, Relation Extraction, Text Generation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1137/
PDF	https://www.aclweb.org/anthology/C16-1137
PWC	https://paperswithcode.com/paper/global-inference-to-chinese-temporal-relation
Repo
Framework

NUIG-UNLP at SemEval-2016 Task 13: A Simple Word Embedding-based Approach for Taxonomy Extraction


Title	NUIG-UNLP at SemEval-2016 Task 13: A Simple Word Embedding-based Approach for Taxonomy Extraction
Authors	Joel Pocostales
Abstract
Tasks	Natural Language Inference, Question Answering, Text Summarization
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1202/
PDF	https://www.aclweb.org/anthology/S16-1202
PWC	https://paperswithcode.com/paper/nuig-unlp-at-semeval-2016-task-13-a-simple
Repo
Framework

On Why Coarse Class Classification is Bottleneck in Noun Compound Interpretation


Title	On Why Coarse Class Classification is Bottleneck in Noun Compound Interpretation
Authors	Girishkumar Ponkiya, Pushpak Bhattacharyya, Girish K. Palshikar
Abstract
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-6336/
PDF	https://www.aclweb.org/anthology/W16-6336
PWC	https://paperswithcode.com/paper/on-why-coarse-class-classification-is
Repo
Framework

Verbframator:Semi-Automatic Verb Frame Annotator Tool with Special Reference to Marathi


Title	Verbframator:Semi-Automatic Verb Frame Annotator Tool with Special Reference to Marathi
Authors	Hanumant Redkar, S Singh, hya, N Ghag, ini, Jai Paranjape, Nilesh Joshi, Malhar Kulkarni, Pushpak Bhattacharyya
Abstract
Tasks	Machine Translation, Text Generation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-6337/
PDF	https://www.aclweb.org/anthology/W16-6337
PWC	https://paperswithcode.com/paper/verbframatorsemi-automatic-verb-frame
Repo
Framework

UMNDuluth at SemEval-2016 Task 14: WordNet’s Missing Lemmas


Title	UMNDuluth at SemEval-2016 Task 14: WordNet’s Missing Lemmas
Authors	Jon Rusert, Ted Pedersen
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1211/
PDF	https://www.aclweb.org/anthology/S16-1211
PWC	https://paperswithcode.com/paper/umnduluth-at-semeval-2016-task-14-wordnets
Repo
Framework