May 4, 2019

1851 words 9 mins read

Paper Group NANR 205

Paper Group NANR 205

The ELRA License Wizard. How Factuality Determines Sentiment Inferences. Problematic Cases in the Annotation of Negation in Spanish. SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity. Evaluation of distributional semantic models: a holistic a …

The ELRA License Wizard

Title The ELRA License Wizard
Authors Val{'e}rie Mapelli, Vladimir Popescu, Lin Liu, Meritxell Fern{'a}ndez Barrera, Khalid Choukri
Abstract To allow an easy understanding of the various licenses that exist for the use of Language Resources (ELRA{'}s, META-SHARE{'}s, Creative Commons{'}, etc.), ELRA has developed a License Wizardto help the right-holders share/distribute their resources under the appropriate license. It also aims to be exploited by users to better understand the legal obligations that apply in various licensing situations. The present paper elaborates on the License Wizard functionalities of this web configurator, which enables to select a number of legal features and obtain the user license adapted to the users selection, to define which user licenses they would like to select in order to distribute their Language Resources, to integrate the user license terms into a Distribution Agreement that could be proposed to ELRA or META-SHARE for further distribution through the ELRA Catalogue of Language Resources. Thanks to a flexible back office, the structure of the legal feature selection can easily be reviewed to include other features that may be relevant for other licenses. Integrating contributions from other initiatives thus aim to be one of the obvious next steps, with a special focus on CLARIN and Linked Data experiences.
Tasks Feature Selection
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1718/
PDF https://www.aclweb.org/anthology/L16-1718
PWC https://paperswithcode.com/paper/the-elra-license-wizard
Repo
Framework

How Factuality Determines Sentiment Inferences

Title How Factuality Determines Sentiment Inferences
Authors Manfred Klenner, Simon Clematide
Abstract
Tasks Common Sense Reasoning
Published 2016-08-01
URL https://www.aclweb.org/anthology/S16-2008/
PDF https://www.aclweb.org/anthology/S16-2008
PWC https://paperswithcode.com/paper/how-factuality-determines-sentiment
Repo
Framework

Problematic Cases in the Annotation of Negation in Spanish

Title Problematic Cases in the Annotation of Negation in Spanish
Authors Salud Mar{'\i}a Jim{'e}nez-Zafra, Maite Martin, L. Alfonso Ure{~n}a-L{'o}pez, Toni Mart{'\i}, Mariona Taul{'e}
Abstract This paper presents the main sources of disagreement found during the annotation of the Spanish SFU Review Corpus with negation (SFU ReviewSP -NEG). Negation detection is a challenge in most of the task related to NLP, so the availability of corpora annotated with this phenomenon is essential in order to advance in tasks related to this area. A thorough analysis of the problems found during the annotation could help in the study of this phenomenon.
Tasks Negation Detection, Opinion Mining, Question Answering, Sentiment Analysis
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-5006/
PDF https://www.aclweb.org/anthology/W16-5006
PWC https://paperswithcode.com/paper/problematic-cases-in-the-annotation-of
Repo
Framework

SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity

Title SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity
Authors Sergio Jimenez
Abstract
Tasks Negation Detection, Semantic Textual Similarity
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1116/
PDF https://www.aclweb.org/anthology/S16-1116
PWC https://paperswithcode.com/paper/sergiojimenez-at-semeval-2016-task-1
Repo
Framework

Evaluation of distributional semantic models: a holistic approach

Title Evaluation of distributional semantic models: a holistic approach
Authors Gabriel Bernier-Colborne, Patrick Drouin
Abstract We investigate how both model-related factors and application-related factors affect the accuracy of distributional semantic models (DSMs) in the context of specialized lexicography, and how these factors interact. This holistic approach to the evaluation of DSMs provides valuable guidelines for the use of these models and insight into the kind of semantic information they capture.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4707/
PDF https://www.aclweb.org/anthology/W16-4707
PWC https://paperswithcode.com/paper/evaluation-of-distributional-semantic-models
Repo
Framework

DISCO Nets : DISsimilarity COefficients Networks

Title DISCO Nets : DISsimilarity COefficients Networks
Authors Diane Bouchacourt, Pawan K. Mudigonda, Sebastian Nowozin
Abstract We present a new type of probabilistic model which we call DISsimilarity COefficient Networks (DISCO Nets). DISCO Nets allow us to efficiently sample from a posterior distribution parametrised by a neural network. During training, DISCO Nets are learned by minimising the dissimilarity coefficient between the true distribution and the estimated distribution. This allows us to tailor the training to the loss related to the task at hand. We empirically show that (i) by modeling uncertainty on the output value, DISCO Nets outperform equivalent non-probabilistic predictive networks and (ii) DISCO Nets accurately model the uncertainty of the output, outperforming existing probabilistic models based on deep neural networks.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6143-disco-nets-dissimilarity-coefficients-networks
PDF http://papers.nips.cc/paper/6143-disco-nets-dissimilarity-coefficients-networks.pdf
PWC https://paperswithcode.com/paper/disco-nets-dissimilarity-coefficients
Repo
Framework
Title Human-like Natural Language Generation Using Monte Carlo Tree Search
Authors Kaori Kumagai, Ichiro Kobayashi, Daichi Mochihashi, Hideki Asoh, Tomoaki Nakamura, Takayuki Nagai
Abstract
Tasks Language Modelling, Text Generation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-5502/
PDF https://www.aclweb.org/anthology/W16-5502
PWC https://paperswithcode.com/paper/human-like-natural-language-generation-using
Repo
Framework

SemEval-2016 Task 2: Interpretable Semantic Textual Similarity

Title SemEval-2016 Task 2: Interpretable Semantic Textual Similarity
Authors Eneko Agirre, Aitor Gonzalez-Agirre, I{~n}igo Lopez-Gazpio, Montse Maritxalar, German Rigau, Larraitz Uria
Abstract
Tasks Image Captioning, Semantic Textual Similarity
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1082/
PDF https://www.aclweb.org/anthology/S16-1082
PWC https://paperswithcode.com/paper/semeval-2016-task-2-interpretable-semantic
Repo
Framework

Microsyntactic Phenomena as a Computational Linguistics Issue

Title Microsyntactic Phenomena as a Computational Linguistics Issue
Authors Leonid Iomdin
Abstract Microsyntactic linguistic units, such as syntactic idioms and non-standard syntactic constructions, are poorly represented in linguistic resources, mostly because the former are elements occupying an intermediate position between the lexicon and the grammar and the latter are too specific to be routinely tackled by general grammars. Consequently, many such units produce substantial gaps in systems intended to solve sophisticated computational linguistics tasks, such as parsing, deep semantic analysis, question answering, machine translation, or text generation. They also present obstacles for applying advanced techniques to these tasks, such as machine learning. The paper discusses an approach aimed at bridging such gaps, focusing on the development of monolingual and multilingual corpora where microsyntactic units are to be tagged.
Tasks Machine Translation, Question Answering, Text Generation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3803/
PDF https://www.aclweb.org/anthology/W16-3803
PWC https://paperswithcode.com/paper/microsyntactic-phenomena-as-a-computational
Repo
Framework

The Development of Multimodal Lexical Resources

Title The Development of Multimodal Lexical Resources
Authors James Pustejovsky, Tuan Do, Gitit Kehat, Nikhil Krishnaswamy
Abstract Human communication is a multimodal activity, involving not only speech and written expressions, but intonation, images, gestures, visual clues, and the interpretation of actions through perception. In this paper, we describe the design of a multimodal lexicon that is able to accommodate the diverse modalities that present themselves in NLP applications. We have been developing a multimodal semantic representation, VoxML, that integrates the encoding of semantic, visual, gestural, and action-based features associated with linguistic expressions.
Tasks Question Answering, Visual Question Answering
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3807/
PDF https://www.aclweb.org/anthology/W16-3807
PWC https://paperswithcode.com/paper/the-development-of-multimodal-lexical
Repo
Framework

Representing Support Verbs in FrameNet

Title Representing Support Verbs in FrameNet
Authors Miriam R. L. Petruck, Michael Ellsworth
Abstract
Tasks Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-1811/
PDF https://www.aclweb.org/anthology/W16-1811
PWC https://paperswithcode.com/paper/representing-support-verbs-in-framenet
Repo
Framework

The Grammar of English Deverbal Compounds and their Meaning

Title The Grammar of English Deverbal Compounds and their Meaning
Authors Gianina Iord{\u{a}}chioaia, Lonneke van der Plas, Glorianna Jagfeld
Abstract We present an interdisciplinary study on the interaction between the interpretation of noun-noun deverbal compounds (DCs; e.g., task assignment) and the morphosyntactic properties of their deverbal heads in English. Underlying hypotheses from theoretical linguistics are tested with tools and resources from computational linguistics. We start with Grimshaw{'}s (1990) insight that deverbal nouns are ambiguous between argument-supporting nominal (ASN) readings, which inherit verbal arguments (e.g., the assignment of the tasks), and the less verbal and more lexicalized Result Nominal and Simple Event readings (e.g., a two-page assignment). Following Grimshaw, our hypothesis is that the former will realize object arguments in DCs, while the latter will receive a wider range of interpretations like root compounds headed by non-derived nouns (e.g., chocolate box). Evidence from a large corpus assisted by machine learning techniques confirms this hypothesis, by showing that, besides other features, the realization of internal arguments by deverbal heads outside compounds (i.e., the most distinctive ASN-property in Grimshaw 1990) is a good predictor for an object interpretation of non-heads in DCs.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3811/
PDF https://www.aclweb.org/anthology/W16-3811
PWC https://paperswithcode.com/paper/the-grammar-of-english-deverbal-compounds-and
Repo
Framework

Identification of Flexible Multiword Expressions with the Help of Dependency Structure Annotation

Title Identification of Flexible Multiword Expressions with the Help of Dependency Structure Annotation
Authors Ayaka Morimoto, Akifumi Yoshimoto, Akihiko Kato, Hiroyuki Shindo, Yuji Matsumoto
Abstract This paper presents our ongoing work on compilation of English multi-word expression (MWE) lexicon. We are especially interested in collecting flexible MWEs, in which some other components can intervene the expression such as {}a number of{''} vs {}a large number of{''} where a modifier of {``}number{''} can be placed in the expression and inherit the original meaning. We fiest collect possible candidates of flexible English MWEs from the web, and annotate all of their occurrences in the Wall Street Journal portion of Ontonotes corpus. We make use of word dependency strcuture information of the sentences converted from the phrase structure annotation. This process enables semi-automatic annotation of MWEs in the corpus and simultanaously produces the internal and external dependency representation of flexible MWEs. |
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3813/
PDF https://www.aclweb.org/anthology/W16-3813
PWC https://paperswithcode.com/paper/identification-of-flexible-multiword
Repo
Framework

A new look at possessive reflexivization: A comparative study between Czech and Russian

Title A new look at possessive reflexivization: A comparative study between Czech and Russian
Authors Anna Nedoluzhko
Abstract The paper presents a contrastive description of reflexive possessive pronouns {}sv{\r{u}}j{''} in Czech and {}svoj{''} in Russian. The research concerns syntactic, semantic and pragmatic aspects. With our analysis, we shed a new light on the already investigated issue, which comes from a detailed comparison of the phenomenon of possessive reflexivization in two typologically and genetically similar languages. We show that whereas in Czech, the possessive reflexivization is mostly limited to syntactic functions and does not go beyond the grammar, in Russian it gets additional semantic meanings and moves substan-tially towards the lexicon. The obtained knowledge allows us to explain heretofore unclear marginal uses of reflexives in each language.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3814/
PDF https://www.aclweb.org/anthology/W16-3814
PWC https://paperswithcode.com/paper/a-new-look-at-possessive-reflexivization-a
Repo
Framework

Processing non-canonical or noisy text: fortuitous data to the rescue

Title Processing non-canonical or noisy text: fortuitous data to the rescue
Authors Barbara Plank
Abstract Real world data differs radically from the benchmark corpora we use in NLP, resulting in large performance drops. The reason for this problem is obvious: NLP models are trained on limited samples from canonical varieties considered standard. However, there are many dimensions, e.g., sociodemographic, language, genre, sentence type, etc. on which texts can differ from the standard. The solution is not obvious: we cannot control for all factors, and it is not clear how to best go beyond the current practice of training on homogeneous data from a single domain and language. In this talk, I review the notion of canonicity, and how it shapes our community{'}s approach to language. I argue for the use of fortuitous data. Fortuitous data is data out there that just waits to be harvested. It includes data which is in plain sight, but is often neglected, and more distant sources like behavioral data, which first need to be refined. They provide additional contexts and a myriad of opportunities to build more adaptive language technology, some of which I will explore in this talk.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3901/
PDF https://www.aclweb.org/anthology/W16-3901
PWC https://paperswithcode.com/paper/processing-non-canonical-or-noisy-text
Repo
Framework
comments powered by Disqus