July 26, 2019

1903 words 9 mins read

Paper Group NANR 144

The Making of the Royal Society Corpus. Investigating Opinion Mining through Language Varieties: a Case Study of Brazilian and European Portuguese tweets. 反義詞「多」和「少」在數量名結構中的不對稱現象－－以語料庫為本的分析 (The Asymmetric Occurences of \textitDou1 and \textitShao3 in the [Numeral + Measure Word/Classifier + Noun] Construction: A Corpus-based Analysis) [In Chinese] …

The Making of the Royal Society Corpus


Title	The Making of the Royal Society Corpus
Authors	J{"o}rg Knappen, Stefan Fischer, Hannah Kermes, Elke Teich, Peter Fankhauser
Abstract
Tasks	Optical Character Recognition, Part-Of-Speech Tagging, Word Embeddings
Published	2017-05-01
URL	https://www.aclweb.org/anthology/W17-0503/
PDF	https://www.aclweb.org/anthology/W17-0503
PWC	https://paperswithcode.com/paper/the-making-of-the-royal-society-corpus
Repo
Framework

Investigating Opinion Mining through Language Varieties: a Case Study of Brazilian and European Portuguese tweets


Title	Investigating Opinion Mining through Language Varieties: a Case Study of Brazilian and European Portuguese tweets
Authors	Douglas Vit{'o}rio, Ellen Polliana Ramos Souza, Ingryd Pereira, Adriano Oliveira
Abstract
Tasks	Decision Making, Opinion Mining, Sentiment Analysis
Published	2017-10-01
URL	https://www.aclweb.org/anthology/W17-6607/
PDF	https://www.aclweb.org/anthology/W17-6607
PWC	https://paperswithcode.com/paper/investigating-opinion-mining-through-language
Repo
Framework

反義詞「多」和「少」在數量名結構中的不對稱現象－－以語料庫為本的分析 (The Asymmetric Occurences of \textitDou1 and \textitShao3 in the [Numeral + Measure Word/Classifier + Noun] Construction: A Corpus-based Analysis) [In Chinese]


Title	反義詞「多」和「少」在數量名結構中的不對稱現象－－以語料庫為本的分析 (The Asymmetric Occurences of \textitDou1 and \textitShao3 in the [Numeral + Measure Word/Classifier + Noun] Construction: A Corpus-based Analysis) [In Chinese]
Authors	Wei-Yu Chen, Siaw-Fong Chung
Abstract
Tasks
Published	2017-06-01
URL	https://www.aclweb.org/anthology/O17-2002/
PDF	https://www.aclweb.org/anthology/O17-2002
PWC	https://paperswithcode.com/paper/ac34eaaaaaaaa-eaca-ca-ac-c34ei14i14aeaaococa
Repo
Framework

Approximation Algorithms for \ell_0-Low Rank Approximation


Title	Approximation Algorithms for \ell_0-Low Rank Approximation
Authors	Karl Bringmann, Pavel Kolev, David Woodruff
Abstract	We study the $\ell_0$-Low Rank Approximation Problem, where the goal is, given an $m \times n$ matrix $A$, to output a rank-$k$ matrix $A'$ for which $\A’-A_0$ is minimized. Here, for a matrix $B$, $\B_0$ denotes the number of its non-zero entries. This NP-hard variant of low rank approximation is natural for problems with no underlying metric, and its goal is to minimize the number of disagreeing data positions. We provide approximation algorithms which significantly improve the running time and approximation factor of previous work. For $k > 1$, we show how to find, in poly$(mn)$ time for every $k$, a rank $O(k \log(n/k))$ matrix $A'$ for which $\A’-A_0 \leq O(k^2 \log(n/k)) \OPT$. To the best of our knowledge, this is the first algorithm with provable guarantees for the $\ell_0$-Low Rank Approximation Problem for $k > 1$, even for bicriteria algorithms. For the well-studied case when $k = 1$, we give a $(2+\epsilon)$-approximation in {\it sublinear time}, which is impossible for other variants of low rank approximation such as for the Frobenius norm. We strengthen this for the well-studied case of binary matrices to obtain a $(1+O(\psi))$-approximation in sublinear time, where $\psi = \OPT/\nnz{A}$. For small $\psi$, our approximation factor is $1+o(1)$.
Tasks
Published	2017-12-01
URL	http://papers.nips.cc/paper/7242-approximation-algorithms-for-ell_0-low-rank-approximation
PDF	http://papers.nips.cc/paper/7242-approximation-algorithms-for-ell_0-low-rank-approximation.pdf
PWC	https://paperswithcode.com/paper/approximation-algorithms-for-ell_0-low-rank-1
Repo
Framework

Learning What’s Easy: Fully Differentiable Neural Easy-First Taggers


Title	Learning What’s Easy: Fully Differentiable Neural Easy-First Taggers
Authors	Andr{'e} F. T. Martins, Julia Kreutzer
Abstract	We introduce a novel neural easy-first decoder that learns to solve sequence tagging tasks in a flexible order. In contrast to previous easy-first decoders, our models are end-to-end differentiable. The decoder iteratively updates a {``}sketch{''} of the predictions over the sequence. At its core is an attention mechanism that controls which parts of the input are strategically the best to process next. We present a new constrained softmax transformation that ensures the same cumulative attention to every word, and show how to efficiently evaluate and backpropagate over it. Our models compare favourably to BILSTM taggers on three sequence tagging tasks. \|
Tasks	Imitation Learning, Named Entity Recognition, Part-Of-Speech Tagging
Published	2017-09-01
URL	https://www.aclweb.org/anthology/D17-1036/
PDF	https://www.aclweb.org/anthology/D17-1036
PWC	https://paperswithcode.com/paper/learning-whats-easy-fully-differentiable
Repo
Framework

A Computational Model of Human Preferences for Pronoun Resolution


Title	A Computational Model of Human Preferences for Pronoun Resolution
Authors	Olga Seminck, Pascal Amsili
Abstract	We present a cognitive computational model of pronoun resolution that reproduces the human interpretation preferences of the Subject Assignment Strategy and the Parallel Function Strategy. Our model relies on a probabilistic pronoun resolution system trained on corpus data. Factors influencing pronoun resolution are represented as features weighted by their relative importance. The importance the model gives to the preferences is in line with psycholinguistic studies. We demonstrate the cognitive plausibility of the model by running it on experimental items and simulating antecedent choice and reading times of human participants. Our model can be used as a new means to study pronoun resolution, because it captures the interaction of preferences.
Tasks	Coreference Resolution
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-4006/
PDF	https://www.aclweb.org/anthology/E17-4006
PWC	https://paperswithcode.com/paper/a-computational-model-of-human-preferences
Repo
Framework

Aligning Entity Names with Online Aliases on Twitter


Title	Aligning Entity Names with Online Aliases on Twitter
Authors	Kevin McKelvey, Peter Goutzounis, Stephen da Cruz, Nathanael Chambers
Abstract	This paper presents new models that automatically align online aliases with their real entity names. Many research applications rely on identifying entity names in text, but people often refer to entities with unexpected nicknames and aliases. For example, The King and King James are aliases for Lebron James, a professional basketball player. Recent work on entity linking attempts to resolve mentions to knowledge base entries, like a wikipedia page, but linking is unfortunately limited to well-known entities with pre-built pages. This paper asks a more basic question: can aliases be aligned without background knowledge of the entity? Further, can the semantics surrounding alias mentions be used to inform alignments? We describe statistical models that make decisions based on the lexicographic properties of the aliases with their semantic context in a large corpus of tweets. We experiment on a database of Twitter users and their usernames, and present the first human evaluation for this task. Alignment accuracy approaches human performance at 81{%}, and we show that while lexicographic features are most important, the semantic context of an alias further improves classification accuracy.
Tasks	Coreference Resolution, Entity Linking, Sentiment Analysis
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1104/
PDF	https://www.aclweb.org/anthology/W17-1104
PWC	https://paperswithcode.com/paper/aligning-entity-names-with-online-aliases-on
Repo
Framework

LCT-MALTA’s Submission to RepEval 2017 Shared Task


Title	LCT-MALTA’s Submission to RepEval 2017 Shared Task
Authors	Hoa Trong Vu, Thuong-Hai Pham, Xiaoyu Bai, Marc Tanti, Lonneke van der Plas, Albert Gatt
Abstract	System using BiLSTM and max pooling. Embedding is enhanced by POS, character and dependency info.
Tasks	Natural Language Inference, Sentence Embeddings, Word Embeddings
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-5311/
PDF	https://www.aclweb.org/anthology/W17-5311
PWC	https://paperswithcode.com/paper/lct-maltas-submission-to-repeval-2017-shared
Repo
Framework

Enriching Basque Coreference Resolution System using Semantic Knowledge sources


Title	Enriching Basque Coreference Resolution System using Semantic Knowledge sources
Authors	Ander Soraluze, Olatz Arregi, Xabier Arregi, Arantza D{'\i}az de Ilarraza
Abstract	In this paper we present a Basque coreference resolution system enriched with semantic knowledge. An error analysis carried out revealed the deficiencies that the system had in resolving coreference cases in which semantic or world knowledge is needed. We attempt to improve the deficiencies using two semantic knowledge sources, specifically Wikipedia and WordNet.
Tasks	Coreference Resolution
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1502/
PDF	https://www.aclweb.org/anthology/W17-1502
PWC	https://paperswithcode.com/paper/enriching-basque-coreference-resolution
Repo
Framework

Multi-source annotation projection of coreference chains: assessing strategies and testing opportunities


Title	Multi-source annotation projection of coreference chains: assessing strategies and testing opportunities
Authors	Yulia Grishina, Manfred Stede
Abstract	In this paper, we examine the possibility of using annotation projection from multiple sources for automatically obtaining coreference annotations in the target language. We implement a multi-source annotation projection algorithm and apply it on an English-German-Russian parallel corpus in order to transfer coreference chains from two sources to the target side. Operating in two settings {–} a low-resource and a more linguistically-informed one {–} we show that automatic coreference transfer could benefit from combining information from multiple languages, and assess the quality of both the extraction and the linking of target coreference mentions.
Tasks	Coreference Resolution, Named Entity Recognition
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1506/
PDF	https://www.aclweb.org/anthology/W17-1506
PWC	https://paperswithcode.com/paper/multi-source-annotation-projection-of
Repo
Framework


Title	Multivariate Linear Regression of Symptoms-related Tweets for Infectious Gastroenteritis Scale Estimation
Authors	Ryo Takeuchi, Hayate Iso, Kaoru Ito, Shoko Wakamiya, Eiji Aramaki
Abstract	To date, various Twitter-based event detection systems have been proposed. Most of their targets, however, share common characteristics. They are seasonal or global events such as earthquakes and flu pandemics. In contrast, this study targets unseasonal and local disease events. Our system investigates the frequencies of disease-related words such as {`}nausea{''},{`}chill{''},and {``}diarrhea{''} and estimates the number of patients using regression of these word frequencies. Experiments conducted using Japanese 47 areas from January 2017 to April 2017 revealed that the detection of small and unseasonal event is extremely difficult (overall performance: 0.13). However, we found that the event scale and the detection performance show high correlation in the specified cases (in the phase of patient increasing or decreasing). The results also suggest that when 150 and more patients appear in a high population area, we can expect that our social sensors detect this outbreak. Based on these results, we can infer that social sensors can reliably detect unseasonal and local disease events under certain conditions, just as they can for seasonal or global events. \|
Tasks
Published	2017-11-01
URL	https://www.aclweb.org/anthology/W17-5803/
PDF	https://www.aclweb.org/anthology/W17-5803
PWC	https://paperswithcode.com/paper/multivariate-linear-regression-of-symptoms
Repo
Framework

CORBON 2017 Shared Task: Projection-Based Coreference Resolution


Title	CORBON 2017 Shared Task: Projection-Based Coreference Resolution
Authors	Yulia Grishina
Abstract	The CORBON 2017 Shared Task, organised as part of the Coreference Resolution Beyond OntoNotes workshop at EACL 2017, presented a new challenge for multilingual coreference resolution: we offer a projection-based setting in which one is supposed to build a coreference resolver for a new language exploiting little or even no knowledge of it, with our languages of interest being German and Russian. We additionally offer a more traditional setting, targeting the development of a multilingual coreference resolver without any restrictions on the resources and methods used. In this paper, we describe the task setting and provide the results of one participant who successfully completed the task, comparing their results to the closely related previous research. Analysing the task setting and the results, we discuss the major challenges and make suggestions on the future directions of coreference evaluation.
Tasks	Coreference Resolution, Word Alignment
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1507/
PDF	https://www.aclweb.org/anthology/W17-1507
PWC	https://paperswithcode.com/paper/corbon-2017-shared-task-projection-based
Repo
Framework

A Two-stage Sieve Approach for Quote Attribution


Title	A Two-stage Sieve Approach for Quote Attribution
Authors	Grace Muzny, Michael Fang, Angel Chang, Dan Jurafsky
Abstract	We present a deterministic sieve-based system for attributing quotations in literary text and a new dataset: QuoteLi3. Quote attribution, determining who said what in a given text, is important for tasks like creating dialogue systems, and in newer areas like computational literary studies, where it creates opportunities to analyze novels at scale rather than only a few at a time. We release QuoteLi3, which contains more than 6,000 annotations linking quotes to speaker mentions and quotes to speaker entities, and introduce a new algorithm for quote attribution. Our two-stage algorithm first links quotes to mentions, then mentions to entities. Using two stages encapsulates difficult sub-problems and improves system performance. The modular design allows us to tune for overall performance or higher precision, which is useful for many real-world use cases. Our system achieves an average F-score of 87.5 across three novels, outperforming previous systems, and can be tuned for precision of 90.4 at a recall of 65.1.
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-1044/
PDF	https://www.aclweb.org/anthology/E17-1044
PWC	https://paperswithcode.com/paper/a-two-stage-sieve-approach-for-quote
Repo
Framework

DAG Automata for Meaning Representation


Title	DAG Automata for Meaning Representation
Authors	Frank Drewes
Abstract
Tasks
Published	2017-07-01
URL	https://www.aclweb.org/anthology/W17-3409/
PDF	https://www.aclweb.org/anthology/W17-3409
PWC	https://paperswithcode.com/paper/dag-automata-for-meaning-representation
Repo
Framework

Introducing Structure into Neural Network-Based Semantic Models


Title	Introducing Structure into Neural Network-Based Semantic Models
Authors	Stephen Clark
Abstract
Tasks
Published	2017-07-01
URL	https://www.aclweb.org/anthology/W17-3412/
PDF	https://www.aclweb.org/anthology/W17-3412
PWC	https://paperswithcode.com/paper/introducing-structure-into-neural-network
Repo
Framework