May 4, 2019

1851 words 9 mins read

Paper Group NANR 205

The ELRA License Wizard. How Factuality Determines Sentiment Inferences. Problematic Cases in the Annotation of Negation in Spanish. SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity. Evaluation of distributional semantic models: a holistic a …

The ELRA License Wizard


Title	The ELRA License Wizard
Authors	Val{'e}rie Mapelli, Vladimir Popescu, Lin Liu, Meritxell Fern{'a}ndez Barrera, Khalid Choukri
Abstract	To allow an easy understanding of the various licenses that exist for the use of Language Resources (ELRA{'}s, META-SHARE{'}s, Creative Commons{'}, etc.), ELRA has developed a License Wizardto help the right-holders share/distribute their resources under the appropriate license. It also aims to be exploited by users to better understand the legal obligations that apply in various licensing situations. The present paper elaborates on the License Wizard functionalities of this web configurator, which enables to select a number of legal features and obtain the user license adapted to the users selection, to define which user licenses they would like to select in order to distribute their Language Resources, to integrate the user license terms into a Distribution Agreement that could be proposed to ELRA or META-SHARE for further distribution through the ELRA Catalogue of Language Resources. Thanks to a flexible back office, the structure of the legal feature selection can easily be reviewed to include other features that may be relevant for other licenses. Integrating contributions from other initiatives thus aim to be one of the obvious next steps, with a special focus on CLARIN and Linked Data experiences.
Tasks	Feature Selection
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1718/
PDF	https://www.aclweb.org/anthology/L16-1718
PWC	https://paperswithcode.com/paper/the-elra-license-wizard
Repo
Framework

How Factuality Determines Sentiment Inferences


Title	How Factuality Determines Sentiment Inferences
Authors	Manfred Klenner, Simon Clematide
Abstract
Tasks	Common Sense Reasoning
Published	2016-08-01
URL	https://www.aclweb.org/anthology/S16-2008/
PDF	https://www.aclweb.org/anthology/S16-2008
PWC	https://paperswithcode.com/paper/how-factuality-determines-sentiment
Repo
Framework

Problematic Cases in the Annotation of Negation in Spanish


Title	Problematic Cases in the Annotation of Negation in Spanish
Authors	Salud Mar{'\i}a Jim{'e}nez-Zafra, Maite Martin, L. Alfonso Ure{~n}a-L{'o}pez, Toni Mart{'\i}, Mariona Taul{'e}
Abstract	This paper presents the main sources of disagreement found during the annotation of the Spanish SFU Review Corpus with negation (SFU ReviewSP -NEG). Negation detection is a challenge in most of the task related to NLP, so the availability of corpora annotated with this phenomenon is essential in order to advance in tasks related to this area. A thorough analysis of the problems found during the annotation could help in the study of this phenomenon.
Tasks	Negation Detection, Opinion Mining, Question Answering, Sentiment Analysis
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-5006/
PDF	https://www.aclweb.org/anthology/W16-5006
PWC	https://paperswithcode.com/paper/problematic-cases-in-the-annotation-of
Repo
Framework

SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity


Title	SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity
Authors	Sergio Jimenez
Abstract
Tasks	Negation Detection, Semantic Textual Similarity
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1116/
PDF	https://www.aclweb.org/anthology/S16-1116
PWC	https://paperswithcode.com/paper/sergiojimenez-at-semeval-2016-task-1
Repo
Framework

Evaluation of distributional semantic models: a holistic approach


Title	Evaluation of distributional semantic models: a holistic approach
Authors	Gabriel Bernier-Colborne, Patrick Drouin
Abstract	We investigate how both model-related factors and application-related factors affect the accuracy of distributional semantic models (DSMs) in the context of specialized lexicography, and how these factors interact. This holistic approach to the evaluation of DSMs provides valuable guidelines for the use of these models and insight into the kind of semantic information they capture.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4707/
PDF	https://www.aclweb.org/anthology/W16-4707
PWC	https://paperswithcode.com/paper/evaluation-of-distributional-semantic-models
Repo
Framework

DISCO Nets : DISsimilarity COefficients Networks


Title	DISCO Nets : DISsimilarity COefficients Networks
Authors	Diane Bouchacourt, Pawan K. Mudigonda, Sebastian Nowozin
Abstract	We present a new type of probabilistic model which we call DISsimilarity COefficient Networks (DISCO Nets). DISCO Nets allow us to efficiently sample from a posterior distribution parametrised by a neural network. During training, DISCO Nets are learned by minimising the dissimilarity coefficient between the true distribution and the estimated distribution. This allows us to tailor the training to the loss related to the task at hand. We empirically show that (i) by modeling uncertainty on the output value, DISCO Nets outperform equivalent non-probabilistic predictive networks and (ii) DISCO Nets accurately model the uncertainty of the output, outperforming existing probabilistic models based on deep neural networks.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6143-disco-nets-dissimilarity-coefficients-networks
PDF	http://papers.nips.cc/paper/6143-disco-nets-dissimilarity-coefficients-networks.pdf
PWC	https://paperswithcode.com/paper/disco-nets-dissimilarity-coefficients
Repo
Framework

Human-like Natural Language Generation Using Monte Carlo Tree Search


Title	Human-like Natural Language Generation Using Monte Carlo Tree Search
Authors	Kaori Kumagai, Ichiro Kobayashi, Daichi Mochihashi, Hideki Asoh, Tomoaki Nakamura, Takayuki Nagai
Abstract
Tasks	Language Modelling, Text Generation
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-5502/
PDF	https://www.aclweb.org/anthology/W16-5502
PWC	https://paperswithcode.com/paper/human-like-natural-language-generation-using
Repo
Framework

SemEval-2016 Task 2: Interpretable Semantic Textual Similarity


Title	SemEval-2016 Task 2: Interpretable Semantic Textual Similarity
Authors	Eneko Agirre, Aitor Gonzalez-Agirre, I{~n}igo Lopez-Gazpio, Montse Maritxalar, German Rigau, Larraitz Uria
Abstract
Tasks	Image Captioning, Semantic Textual Similarity
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1082/
PDF	https://www.aclweb.org/anthology/S16-1082
PWC	https://paperswithcode.com/paper/semeval-2016-task-2-interpretable-semantic
Repo
Framework

Microsyntactic Phenomena as a Computational Linguistics Issue


Title	Microsyntactic Phenomena as a Computational Linguistics Issue
Authors	Leonid Iomdin
Abstract	Microsyntactic linguistic units, such as syntactic idioms and non-standard syntactic constructions, are poorly represented in linguistic resources, mostly because the former are elements occupying an intermediate position between the lexicon and the grammar and the latter are too specific to be routinely tackled by general grammars. Consequently, many such units produce substantial gaps in systems intended to solve sophisticated computational linguistics tasks, such as parsing, deep semantic analysis, question answering, machine translation, or text generation. They also present obstacles for applying advanced techniques to these tasks, such as machine learning. The paper discusses an approach aimed at bridging such gaps, focusing on the development of monolingual and multilingual corpora where microsyntactic units are to be tagged.
Tasks	Machine Translation, Question Answering, Text Generation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3803/
PDF	https://www.aclweb.org/anthology/W16-3803
PWC	https://paperswithcode.com/paper/microsyntactic-phenomena-as-a-computational
Repo
Framework

The Development of Multimodal Lexical Resources


Title	The Development of Multimodal Lexical Resources
Authors	James Pustejovsky, Tuan Do, Gitit Kehat, Nikhil Krishnaswamy
Abstract	Human communication is a multimodal activity, involving not only speech and written expressions, but intonation, images, gestures, visual clues, and the interpretation of actions through perception. In this paper, we describe the design of a multimodal lexicon that is able to accommodate the diverse modalities that present themselves in NLP applications. We have been developing a multimodal semantic representation, VoxML, that integrates the encoding of semantic, visual, gestural, and action-based features associated with linguistic expressions.
Tasks	Question Answering, Visual Question Answering
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3807/
PDF	https://www.aclweb.org/anthology/W16-3807
PWC	https://paperswithcode.com/paper/the-development-of-multimodal-lexical
Repo
Framework

Representing Support Verbs in FrameNet


Title	Representing Support Verbs in FrameNet
Authors	Miriam R. L. Petruck, Michael Ellsworth
Abstract
Tasks	Word Embeddings
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-1811/
PDF	https://www.aclweb.org/anthology/W16-1811
PWC	https://paperswithcode.com/paper/representing-support-verbs-in-framenet
Repo
Framework

The Grammar of English Deverbal Compounds and their Meaning


Title	The Grammar of English Deverbal Compounds and their Meaning
Authors	Gianina Iord{\u{a}}chioaia, Lonneke van der Plas, Glorianna Jagfeld
Abstract	We present an interdisciplinary study on the interaction between the interpretation of noun-noun deverbal compounds (DCs; e.g., task assignment) and the morphosyntactic properties of their deverbal heads in English. Underlying hypotheses from theoretical linguistics are tested with tools and resources from computational linguistics. We start with Grimshaw{'}s (1990) insight that deverbal nouns are ambiguous between argument-supporting nominal (ASN) readings, which inherit verbal arguments (e.g., the assignment of the tasks), and the less verbal and more lexicalized Result Nominal and Simple Event readings (e.g., a two-page assignment). Following Grimshaw, our hypothesis is that the former will realize object arguments in DCs, while the latter will receive a wider range of interpretations like root compounds headed by non-derived nouns (e.g., chocolate box). Evidence from a large corpus assisted by machine learning techniques confirms this hypothesis, by showing that, besides other features, the realization of internal arguments by deverbal heads outside compounds (i.e., the most distinctive ASN-property in Grimshaw 1990) is a good predictor for an object interpretation of non-heads in DCs.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3811/
PDF	https://www.aclweb.org/anthology/W16-3811
PWC	https://paperswithcode.com/paper/the-grammar-of-english-deverbal-compounds-and
Repo
Framework

Identification of Flexible Multiword Expressions with the Help of Dependency Structure Annotation


Title	Identification of Flexible Multiword Expressions with the Help of Dependency Structure Annotation
Authors	Ayaka Morimoto, Akifumi Yoshimoto, Akihiko Kato, Hiroyuki Shindo, Yuji Matsumoto
Abstract	This paper presents our ongoing work on compilation of English multi-word expression (MWE) lexicon. We are especially interested in collecting flexible MWEs, in which some other components can intervene the expression such as {`}a number of{''} vs {`}a large number of{''} where a modifier of {``}number{''} can be placed in the expression and inherit the original meaning. We fiest collect possible candidates of flexible English MWEs from the web, and annotate all of their occurrences in the Wall Street Journal portion of Ontonotes corpus. We make use of word dependency strcuture information of the sentences converted from the phrase structure annotation. This process enables semi-automatic annotation of MWEs in the corpus and simultanaously produces the internal and external dependency representation of flexible MWEs. \|
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3813/
PDF	https://www.aclweb.org/anthology/W16-3813
PWC	https://paperswithcode.com/paper/identification-of-flexible-multiword
Repo
Framework

A new look at possessive reflexivization: A comparative study between Czech and Russian


Title	A new look at possessive reflexivization: A comparative study between Czech and Russian
Authors	Anna Nedoluzhko
Abstract	The paper presents a contrastive description of reflexive possessive pronouns {`}sv{\r{u}}j{''} in Czech and {`}svoj{''} in Russian. The research concerns syntactic, semantic and pragmatic aspects. With our analysis, we shed a new light on the already investigated issue, which comes from a detailed comparison of the phenomenon of possessive reflexivization in two typologically and genetically similar languages. We show that whereas in Czech, the possessive reflexivization is mostly limited to syntactic functions and does not go beyond the grammar, in Russian it gets additional semantic meanings and moves substan-tially towards the lexicon. The obtained knowledge allows us to explain heretofore unclear marginal uses of reflexives in each language.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3814/
PDF	https://www.aclweb.org/anthology/W16-3814
PWC	https://paperswithcode.com/paper/a-new-look-at-possessive-reflexivization-a
Repo
Framework

Processing non-canonical or noisy text: fortuitous data to the rescue


Title	Processing non-canonical or noisy text: fortuitous data to the rescue
Authors	Barbara Plank
Abstract	Real world data differs radically from the benchmark corpora we use in NLP, resulting in large performance drops. The reason for this problem is obvious: NLP models are trained on limited samples from canonical varieties considered standard. However, there are many dimensions, e.g., sociodemographic, language, genre, sentence type, etc. on which texts can differ from the standard. The solution is not obvious: we cannot control for all factors, and it is not clear how to best go beyond the current practice of training on homogeneous data from a single domain and language. In this talk, I review the notion of canonicity, and how it shapes our community{'}s approach to language. I argue for the use of fortuitous data. Fortuitous data is data out there that just waits to be harvested. It includes data which is in plain sight, but is often neglected, and more distant sources like behavioral data, which first need to be refined. They provide additional contexts and a myriad of opportunities to build more adaptive language technology, some of which I will explore in this talk.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3901/
PDF	https://www.aclweb.org/anthology/W16-3901
PWC	https://paperswithcode.com/paper/processing-non-canonical-or-noisy-text
Repo
Framework