Paper Group NANR 205
The ELRA License Wizard. How Factuality Determines Sentiment Inferences. Problematic Cases in the Annotation of Negation in Spanish. SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity. Evaluation of distributional semantic models: a holistic a …
The ELRA License Wizard
Title | The ELRA License Wizard |
Authors | Val{'e}rie Mapelli, Vladimir Popescu, Lin Liu, Meritxell Fern{'a}ndez Barrera, Khalid Choukri |
Abstract | To allow an easy understanding of the various licenses that exist for the use of Language Resources (ELRA{'}s, META-SHARE{'}s, Creative Commons{'}, etc.), ELRA has developed a License Wizardto help the right-holders share/distribute their resources under the appropriate license. It also aims to be exploited by users to better understand the legal obligations that apply in various licensing situations. The present paper elaborates on the License Wizard functionalities of this web configurator, which enables to select a number of legal features and obtain the user license adapted to the users selection, to define which user licenses they would like to select in order to distribute their Language Resources, to integrate the user license terms into a Distribution Agreement that could be proposed to ELRA or META-SHARE for further distribution through the ELRA Catalogue of Language Resources. Thanks to a flexible back office, the structure of the legal feature selection can easily be reviewed to include other features that may be relevant for other licenses. Integrating contributions from other initiatives thus aim to be one of the obvious next steps, with a special focus on CLARIN and Linked Data experiences. |
Tasks | Feature Selection |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1718/ |
https://www.aclweb.org/anthology/L16-1718 | |
PWC | https://paperswithcode.com/paper/the-elra-license-wizard |
Repo | |
Framework | |
How Factuality Determines Sentiment Inferences
Title | How Factuality Determines Sentiment Inferences |
Authors | Manfred Klenner, Simon Clematide |
Abstract | |
Tasks | Common Sense Reasoning |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/S16-2008/ |
https://www.aclweb.org/anthology/S16-2008 | |
PWC | https://paperswithcode.com/paper/how-factuality-determines-sentiment |
Repo | |
Framework | |
Problematic Cases in the Annotation of Negation in Spanish
Title | Problematic Cases in the Annotation of Negation in Spanish |
Authors | Salud Mar{'\i}a Jim{'e}nez-Zafra, Maite Martin, L. Alfonso Ure{~n}a-L{'o}pez, Toni Mart{'\i}, Mariona Taul{'e} |
Abstract | This paper presents the main sources of disagreement found during the annotation of the Spanish SFU Review Corpus with negation (SFU ReviewSP -NEG). Negation detection is a challenge in most of the task related to NLP, so the availability of corpora annotated with this phenomenon is essential in order to advance in tasks related to this area. A thorough analysis of the problems found during the annotation could help in the study of this phenomenon. |
Tasks | Negation Detection, Opinion Mining, Question Answering, Sentiment Analysis |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-5006/ |
https://www.aclweb.org/anthology/W16-5006 | |
PWC | https://paperswithcode.com/paper/problematic-cases-in-the-annotation-of |
Repo | |
Framework | |
SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity
Title | SERGIOJIMENEZ at SemEval-2016 Task 1: Effectively Combining Paraphrase Database, String Matching, WordNet, and Word Embedding for Semantic Textual Similarity |
Authors | Sergio Jimenez |
Abstract | |
Tasks | Negation Detection, Semantic Textual Similarity |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/S16-1116/ |
https://www.aclweb.org/anthology/S16-1116 | |
PWC | https://paperswithcode.com/paper/sergiojimenez-at-semeval-2016-task-1 |
Repo | |
Framework | |
Evaluation of distributional semantic models: a holistic approach
Title | Evaluation of distributional semantic models: a holistic approach |
Authors | Gabriel Bernier-Colborne, Patrick Drouin |
Abstract | We investigate how both model-related factors and application-related factors affect the accuracy of distributional semantic models (DSMs) in the context of specialized lexicography, and how these factors interact. This holistic approach to the evaluation of DSMs provides valuable guidelines for the use of these models and insight into the kind of semantic information they capture. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4707/ |
https://www.aclweb.org/anthology/W16-4707 | |
PWC | https://paperswithcode.com/paper/evaluation-of-distributional-semantic-models |
Repo | |
Framework | |
DISCO Nets : DISsimilarity COefficients Networks
Title | DISCO Nets : DISsimilarity COefficients Networks |
Authors | Diane Bouchacourt, Pawan K. Mudigonda, Sebastian Nowozin |
Abstract | We present a new type of probabilistic model which we call DISsimilarity COefficient Networks (DISCO Nets). DISCO Nets allow us to efficiently sample from a posterior distribution parametrised by a neural network. During training, DISCO Nets are learned by minimising the dissimilarity coefficient between the true distribution and the estimated distribution. This allows us to tailor the training to the loss related to the task at hand. We empirically show that (i) by modeling uncertainty on the output value, DISCO Nets outperform equivalent non-probabilistic predictive networks and (ii) DISCO Nets accurately model the uncertainty of the output, outperforming existing probabilistic models based on deep neural networks. |
Tasks | |
Published | 2016-12-01 |
URL | http://papers.nips.cc/paper/6143-disco-nets-dissimilarity-coefficients-networks |
http://papers.nips.cc/paper/6143-disco-nets-dissimilarity-coefficients-networks.pdf | |
PWC | https://paperswithcode.com/paper/disco-nets-dissimilarity-coefficients |
Repo | |
Framework | |
Human-like Natural Language Generation Using Monte Carlo Tree Search
Title | Human-like Natural Language Generation Using Monte Carlo Tree Search |
Authors | Kaori Kumagai, Ichiro Kobayashi, Daichi Mochihashi, Hideki Asoh, Tomoaki Nakamura, Takayuki Nagai |
Abstract | |
Tasks | Language Modelling, Text Generation |
Published | 2016-09-01 |
URL | https://www.aclweb.org/anthology/W16-5502/ |
https://www.aclweb.org/anthology/W16-5502 | |
PWC | https://paperswithcode.com/paper/human-like-natural-language-generation-using |
Repo | |
Framework | |
SemEval-2016 Task 2: Interpretable Semantic Textual Similarity
Title | SemEval-2016 Task 2: Interpretable Semantic Textual Similarity |
Authors | Eneko Agirre, Aitor Gonzalez-Agirre, I{~n}igo Lopez-Gazpio, Montse Maritxalar, German Rigau, Larraitz Uria |
Abstract | |
Tasks | Image Captioning, Semantic Textual Similarity |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/S16-1082/ |
https://www.aclweb.org/anthology/S16-1082 | |
PWC | https://paperswithcode.com/paper/semeval-2016-task-2-interpretable-semantic |
Repo | |
Framework | |
Microsyntactic Phenomena as a Computational Linguistics Issue
Title | Microsyntactic Phenomena as a Computational Linguistics Issue |
Authors | Leonid Iomdin |
Abstract | Microsyntactic linguistic units, such as syntactic idioms and non-standard syntactic constructions, are poorly represented in linguistic resources, mostly because the former are elements occupying an intermediate position between the lexicon and the grammar and the latter are too specific to be routinely tackled by general grammars. Consequently, many such units produce substantial gaps in systems intended to solve sophisticated computational linguistics tasks, such as parsing, deep semantic analysis, question answering, machine translation, or text generation. They also present obstacles for applying advanced techniques to these tasks, such as machine learning. The paper discusses an approach aimed at bridging such gaps, focusing on the development of monolingual and multilingual corpora where microsyntactic units are to be tagged. |
Tasks | Machine Translation, Question Answering, Text Generation |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-3803/ |
https://www.aclweb.org/anthology/W16-3803 | |
PWC | https://paperswithcode.com/paper/microsyntactic-phenomena-as-a-computational |
Repo | |
Framework | |
The Development of Multimodal Lexical Resources
Title | The Development of Multimodal Lexical Resources |
Authors | James Pustejovsky, Tuan Do, Gitit Kehat, Nikhil Krishnaswamy |
Abstract | Human communication is a multimodal activity, involving not only speech and written expressions, but intonation, images, gestures, visual clues, and the interpretation of actions through perception. In this paper, we describe the design of a multimodal lexicon that is able to accommodate the diverse modalities that present themselves in NLP applications. We have been developing a multimodal semantic representation, VoxML, that integrates the encoding of semantic, visual, gestural, and action-based features associated with linguistic expressions. |
Tasks | Question Answering, Visual Question Answering |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-3807/ |
https://www.aclweb.org/anthology/W16-3807 | |
PWC | https://paperswithcode.com/paper/the-development-of-multimodal-lexical |
Repo | |
Framework | |
Representing Support Verbs in FrameNet
Title | Representing Support Verbs in FrameNet |
Authors | Miriam R. L. Petruck, Michael Ellsworth |
Abstract | |
Tasks | Word Embeddings |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-1811/ |
https://www.aclweb.org/anthology/W16-1811 | |
PWC | https://paperswithcode.com/paper/representing-support-verbs-in-framenet |
Repo | |
Framework | |
The Grammar of English Deverbal Compounds and their Meaning
Title | The Grammar of English Deverbal Compounds and their Meaning |
Authors | Gianina Iord{\u{a}}chioaia, Lonneke van der Plas, Glorianna Jagfeld |
Abstract | We present an interdisciplinary study on the interaction between the interpretation of noun-noun deverbal compounds (DCs; e.g., task assignment) and the morphosyntactic properties of their deverbal heads in English. Underlying hypotheses from theoretical linguistics are tested with tools and resources from computational linguistics. We start with Grimshaw{'}s (1990) insight that deverbal nouns are ambiguous between argument-supporting nominal (ASN) readings, which inherit verbal arguments (e.g., the assignment of the tasks), and the less verbal and more lexicalized Result Nominal and Simple Event readings (e.g., a two-page assignment). Following Grimshaw, our hypothesis is that the former will realize object arguments in DCs, while the latter will receive a wider range of interpretations like root compounds headed by non-derived nouns (e.g., chocolate box). Evidence from a large corpus assisted by machine learning techniques confirms this hypothesis, by showing that, besides other features, the realization of internal arguments by deverbal heads outside compounds (i.e., the most distinctive ASN-property in Grimshaw 1990) is a good predictor for an object interpretation of non-heads in DCs. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-3811/ |
https://www.aclweb.org/anthology/W16-3811 | |
PWC | https://paperswithcode.com/paper/the-grammar-of-english-deverbal-compounds-and |
Repo | |
Framework | |
Identification of Flexible Multiword Expressions with the Help of Dependency Structure Annotation
Title | Identification of Flexible Multiword Expressions with the Help of Dependency Structure Annotation |
Authors | Ayaka Morimoto, Akifumi Yoshimoto, Akihiko Kato, Hiroyuki Shindo, Yuji Matsumoto |
Abstract | This paper presents our ongoing work on compilation of English multi-word expression (MWE) lexicon. We are especially interested in collecting flexible MWEs, in which some other components can intervene the expression such as {}a number of{''} vs { }a large number of{''} where a modifier of {``}number{''} can be placed in the expression and inherit the original meaning. We fiest collect possible candidates of flexible English MWEs from the web, and annotate all of their occurrences in the Wall Street Journal portion of Ontonotes corpus. We make use of word dependency strcuture information of the sentences converted from the phrase structure annotation. This process enables semi-automatic annotation of MWEs in the corpus and simultanaously produces the internal and external dependency representation of flexible MWEs. | |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-3813/ |
https://www.aclweb.org/anthology/W16-3813 | |
PWC | https://paperswithcode.com/paper/identification-of-flexible-multiword |
Repo | |
Framework | |
A new look at possessive reflexivization: A comparative study between Czech and Russian
Title | A new look at possessive reflexivization: A comparative study between Czech and Russian |
Authors | Anna Nedoluzhko |
Abstract | The paper presents a contrastive description of reflexive possessive pronouns {}sv{\r{u}}j{''} in Czech and { }svoj{''} in Russian. The research concerns syntactic, semantic and pragmatic aspects. With our analysis, we shed a new light on the already investigated issue, which comes from a detailed comparison of the phenomenon of possessive reflexivization in two typologically and genetically similar languages. We show that whereas in Czech, the possessive reflexivization is mostly limited to syntactic functions and does not go beyond the grammar, in Russian it gets additional semantic meanings and moves substan-tially towards the lexicon. The obtained knowledge allows us to explain heretofore unclear marginal uses of reflexives in each language. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-3814/ |
https://www.aclweb.org/anthology/W16-3814 | |
PWC | https://paperswithcode.com/paper/a-new-look-at-possessive-reflexivization-a |
Repo | |
Framework | |
Processing non-canonical or noisy text: fortuitous data to the rescue
Title | Processing non-canonical or noisy text: fortuitous data to the rescue |
Authors | Barbara Plank |
Abstract | Real world data differs radically from the benchmark corpora we use in NLP, resulting in large performance drops. The reason for this problem is obvious: NLP models are trained on limited samples from canonical varieties considered standard. However, there are many dimensions, e.g., sociodemographic, language, genre, sentence type, etc. on which texts can differ from the standard. The solution is not obvious: we cannot control for all factors, and it is not clear how to best go beyond the current practice of training on homogeneous data from a single domain and language. In this talk, I review the notion of canonicity, and how it shapes our community{'}s approach to language. I argue for the use of fortuitous data. Fortuitous data is data out there that just waits to be harvested. It includes data which is in plain sight, but is often neglected, and more distant sources like behavioral data, which first need to be refined. They provide additional contexts and a myriad of opportunities to build more adaptive language technology, some of which I will explore in this talk. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-3901/ |
https://www.aclweb.org/anthology/W16-3901 | |
PWC | https://paperswithcode.com/paper/processing-non-canonical-or-noisy-text |
Repo | |
Framework | |