May 5, 2019

1389 words 7 mins read

Paper Group NANR 123

Paper Group NANR 123

Evaluating Context Selection Strategies to Build Emotive Vector Space Models. Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms. Universal Dependencies v1: A Multilingual Treebank Collection. Design of an Input Method for Taiwanese Hokkien using Unsupervized Word Segmentation for Language Modeling. Evaluating embeddings …

Evaluating Context Selection Strategies to Build Emotive Vector Space Models

Title Evaluating Context Selection Strategies to Build Emotive Vector Space Models
Authors Lucia C. Passaro, Aless Lenci, ro
Abstract In this paper we compare different context selection approaches to improve the creation of Emotive Vector Space Models (VSMs). The system is based on the results of an existing approach that showed the possibility to create and update VSMs by exploiting crowdsourcing and human annotation. Here, we introduce a method to manipulate the contexts of the VSMs under the assumption that the emotive connotation of a target word is a function of both its syntagmatic and paradigmatic association with the various emotions. To study the differences among the proposed spaces and to confirm the reliability of the system, we report on two experiments: in the first one we validated the best candidates extracted from each model, and in the second one we compared the models{'} performance on a random sample of target words. Both experiments have been implemented as crowdsourcing tasks.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1347/
PDF https://www.aclweb.org/anthology/L16-1347
PWC https://paperswithcode.com/paper/evaluating-context-selection-strategies-to
Repo
Framework

Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms

Title Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms
Authors Guillaume Jacquet, Maud Ehrmann, Ralf Steinberger, Jaakko V{"a}yrynen
Abstract This paper reports on an approach and experiments to automatically build a cross-lingual multi-word entity resource. Starting from a collection of millions of acronym/expansion pairs for 22 languages where expansion variants were grouped into monolingual clusters, we experiment with several aggregation strategies to link these clusters across languages. Aggregation strategies make use of string similarity distances and translation probabilities and they are based on vector space and graph representations. The accuracy of the approach is evaluated against Wikipedia{'}s redirection and cross-lingual linking tables. The resulting multi-word entity resource contains 64,000 multi-word entities with unique identifiers and their 600,000 multilingual lexical variants. We intend to make this new resource publicly available.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1084/
PDF https://www.aclweb.org/anthology/L16-1084
PWC https://paperswithcode.com/paper/cross-lingual-linking-of-multi-word-entities
Repo
Framework

Universal Dependencies v1: A Multilingual Treebank Collection

Title Universal Dependencies v1: A Multilingual Treebank Collection
Authors Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Haji{\v{c}}, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman
Abstract Cross-linguistically consistent annotation is necessary for sound comparative evaluation and cross-lingual learning experiments. It is also useful for multilingual system development and comparative linguistic studies. Universal Dependencies is an open community effort to create cross-linguistically consistent treebank annotation for many languages within a dependency-based lexicalist framework. In this paper, we describe v1 of the universal guidelines, the underlying design principles, and the currently available treebanks for 33 languages.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1262/
PDF https://www.aclweb.org/anthology/L16-1262
PWC https://paperswithcode.com/paper/universal-dependencies-v1-a-multilingual
Repo
Framework

Design of an Input Method for Taiwanese Hokkien using Unsupervized Word Segmentation for Language Modeling

Title Design of an Input Method for Taiwanese Hokkien using Unsupervized Word Segmentation for Language Modeling
Authors Pierre Magistry
Abstract
Tasks Language Modelling
Published 2016-10-01
URL https://www.aclweb.org/anthology/O16-1026/
PDF https://www.aclweb.org/anthology/O16-1026
PWC https://paperswithcode.com/paper/design-of-an-input-method-for-taiwanese
Repo
Framework

Evaluating embeddings on dictionary-based similarity

Title Evaluating embeddings on dictionary-based similarity
Authors Judit {'A}cs, Andr{'a}s Kornai
Abstract
Tasks Information Retrieval, Word Sense Disambiguation
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2514/
PDF https://www.aclweb.org/anthology/W16-2514
PWC https://paperswithcode.com/paper/evaluating-embeddings-on-dictionary-based
Repo
Framework

Learning Orthographic Features in Bi-directional LSTM for Biomedical Named Entity Recognition

Title Learning Orthographic Features in Bi-directional LSTM for Biomedical Named Entity Recognition
Authors Nut Limsopatham, Nigel Collier
Abstract End-to-end neural network models for named entity recognition (NER) have shown to achieve effective performances on general domain datasets (e.g. newswire), without requiring additional hand-crafted features. However, in biomedical domain, recent studies have shown that hand-engineered features (e.g. orthographic features) should be used to attain effective performance, due to the complexity of biomedical terminology (e.g. the use of acronyms and complex gene names). In this work, we propose a novel approach that allows a neural network model based on a long short-term memory (LSTM) to automatically learn orthographic features and incorporate them into a model for biomedical NER. Importantly, our bi-directional LSTM model learns and leverages orthographic features on an end-to-end basis. We evaluate our approach by comparing against existing neural network models for NER using three well-established biomedical datasets. Our experimental results show that the proposed approach consistently outperforms these strong baselines across all of the three datasets.
Tasks Feature Engineering, Named Entity Recognition, Word Embeddings
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-5102/
PDF https://www.aclweb.org/anthology/W16-5102
PWC https://paperswithcode.com/paper/learning-orthographic-features-in-bi
Repo
Framework

When Annotation Schemes Change Rules Help: A Configurable Approach to Coreference Resolution beyond OntoNotes

Title When Annotation Schemes Change Rules Help: A Configurable Approach to Coreference Resolution beyond OntoNotes
Authors Amir Zeldes, Shuo Zhang
Abstract
Tasks Coreference Resolution, Domain Adaptation
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0713/
PDF https://www.aclweb.org/anthology/W16-0713
PWC https://paperswithcode.com/paper/when-annotation-schemes-change-rules-help-a
Repo
Framework

Crowdsourced Corpus with Entity Salience Annotations

Title Crowdsourced Corpus with Entity Salience Annotations
Authors Milan Dojchinovski, Dinesh Reddy, Tom{'a}{\v{s}} Kliegr, Tom{'a}{\v{s}} Vitvar, Harald Sack
Abstract In this paper, we present a crowdsourced dataset which adds entity salience (importance) annotations to the Reuters-128 dataset, which is subset of Reuters-21578. The dataset is distributed under a free license and publish in the NLP Interchange Format, which fosters interoperability and re-use. We show the potential of the dataset on the task of learning an entity salience classifier and report on the results from several experiments.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1527/
PDF https://www.aclweb.org/anthology/L16-1527
PWC https://paperswithcode.com/paper/crowdsourced-corpus-with-entity-salience
Repo
Framework

Improving Temporal Relation Extraction with Training Instance Augmentation

Title Improving Temporal Relation Extraction with Training Instance Augmentation
Authors Chen Lin, Timothy Miller, Dmitriy Dligach, Steven Bethard, Guergana Savova
Abstract
Tasks Data Augmentation, Question Answering, Relation Extraction, Temporal Information Extraction
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2914/
PDF https://www.aclweb.org/anthology/W16-2914
PWC https://paperswithcode.com/paper/improving-temporal-relation-extraction-with
Repo
Framework
Title Predict Anchor Links across Social Networks via an Embedding Approach
Authors Tong Man, Huawei Shen, Shenghua Liu, Xiaolong Jin, and Xueqi Cheng
Abstract Predicting anchor links across social networks has important implications to an array of applications, including cross-network information diffusion and cross-domain recommendation. One challenging problem is: whether and to what extent we can address the anchor link prediction problem, if only structural information of networks is available. Most existing methods, unsupervised or supervised, directly work on networks themselves rather than on their intrinsic structural regularities, and thus their effectiveness is sensitive to the high dimension and sparsity of networks. To offer a robust method, we propose a novel supervised model, called PALE, which employs network embedding with awareness of observed anchor links as supervised information to capture the major and specific structural regularities and further learns a stable cross-network mapping for predicting anchor links. Through extensive experiments on two realistic datasets, we demonstrate that PALE significantly outperforms the state-of-the-art methods
Tasks Link Prediction, Network Embedding
Published 2016-06-25
URL http://www.bigdatalab.ac.cn/~shenhuawei/publications/2016/ijcai-man.pdf
PDF http://www.bigdatalab.ac.cn/~shenhuawei/publications/2016/ijcai-man.pdf
PWC https://paperswithcode.com/paper/predict-anchor-links-across-social-networks
Repo
Framework

Small Talk Improves User Impressions of Interview Dialogue Systems

Title Small Talk Improves User Impressions of Interview Dialogue Systems
Authors Takahiro Kobori, Mikio Nakano, Tomoaki Nakamura
Abstract
Tasks Dialogue Management
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3646/
PDF https://www.aclweb.org/anthology/W16-3646
PWC https://paperswithcode.com/paper/small-talk-improves-user-impressions-of
Repo
Framework

Creating a Large Benchmark for Open Information Extraction

Title Creating a Large Benchmark for Open Information Extraction
Authors Gabriel Stanovsky, Ido Dagan
Abstract
Tasks Open Information Extraction
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1252/
PDF https://www.aclweb.org/anthology/D16-1252
PWC https://paperswithcode.com/paper/creating-a-large-benchmark-for-open
Repo
Framework

Understanding Negation in Positive Terms Using Syntactic Dependencies

Title Understanding Negation in Positive Terms Using Syntactic Dependencies
Authors Zahra Sarabi, Eduardo Blanco
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1119/
PDF https://www.aclweb.org/anthology/D16-1119
PWC https://paperswithcode.com/paper/understanding-negation-in-positive-terms
Repo
Framework

The Challenges and Joys of Analysing Ongoing Language Change in Web-based Corpora: a Case Study

Title The Challenges and Joys of Analysing Ongoing Language Change in Web-based Corpora: a Case Study
Authors Anne Krause
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2604/
PDF https://www.aclweb.org/anthology/W16-2604
PWC https://paperswithcode.com/paper/the-challenges-and-joys-of-analysing-ongoing
Repo
Framework

An Effect of Background Population Sample Size on the Performance of a Likelihood Ratio-based Forensic Text Comparison System: A Monte Carlo Simulation with Gaussian Mixture Model

Title An Effect of Background Population Sample Size on the Performance of a Likelihood Ratio-based Forensic Text Comparison System: A Monte Carlo Simulation with Gaussian Mixture Model
Authors Shunichi Ishihara
Abstract
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/U16-1012/
PDF https://www.aclweb.org/anthology/U16-1012
PWC https://paperswithcode.com/paper/an-effect-of-background-population-sample
Repo
Framework
comments powered by Disqus