May 5, 2019

2266 words 11 mins read

Paper Group NANR 149

Paper Group NANR 149

ULISBOA at SemEval-2016 Task 12: Extraction of temporal expressions, clinical events and relations using IBEnt. Contextual term equivalent search using domain-driven disambiguation. Exploring Distributional Representations and Machine Translation for Aspect-based Cross-lingual Sentiment Classification.. USFD at SemEval-2016 Task 6: Any-Target Stanc …

ULISBOA at SemEval-2016 Task 12: Extraction of temporal expressions, clinical events and relations using IBEnt

Title ULISBOA at SemEval-2016 Task 12: Extraction of temporal expressions, clinical events and relations using IBEnt
Authors Marcia Barros, Andre Lamurias, Gon{\c{c}}alo Figueiro, Marta Antunes, Joana Teixeira, Alex Pinheiro, re, Francisco M. Couto
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1196/
PDF https://www.aclweb.org/anthology/S16-1196
PWC https://paperswithcode.com/paper/ulisboa-at-semeval-2016-task-12-extraction-of
Repo
Framework

Contextual term equivalent search using domain-driven disambiguation

Title Contextual term equivalent search using domain-driven disambiguation
Authors Caroline Barri{`e}re, Pierre Andr{'e} M{'e}nard, Daphn{'e}e Azoulay
Abstract This article presents a domain-driven algorithm for the task of term sense disambiguation (TSD). TSD aims at automatically choosing which term record from a term bank best represents the meaning of a term occurring in a particular context. In a translation environment, finding the contextually appropriate term record is necessary to access the proper equivalent to be used in the target language text. The term bank TERMIUM Plus, recently published as an open access repository, is chosen as a domain-rich resource for testing our TSD algorithm, using English and French as source and target languages. We devise an experiment using over 1300 English terms found in scientific articles, and show that our domain-driven TSD algorithm is able to bring the best term record, and therefore the best French equivalent, at the average rank of 1.69 compared to a baseline random rank of 3.51.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4704/
PDF https://www.aclweb.org/anthology/W16-4704
PWC https://paperswithcode.com/paper/contextual-term-equivalent-search-using
Repo
Framework

Exploring Distributional Representations and Machine Translation for Aspect-based Cross-lingual Sentiment Classification.

Title Exploring Distributional Representations and Machine Translation for Aspect-based Cross-lingual Sentiment Classification.
Authors Jeremy Barnes, Patrik Lambert, Toni Badia
Abstract Cross-lingual sentiment classification (CLSC) seeks to use resources from a source language in order to detect sentiment and classify text in a target language. Almost all research into CLSC has been carried out at sentence and document level, although this level of granularity is often less useful. This paper explores methods for performing aspect-based cross-lingual sentiment classification (aspect-based CLSC) for under-resourced languages. Given the limited nature of parallel data for many languages, we would like to make the most of this resource for our task. We compare zero-shot learning, bilingual word embeddings, stacked denoising autoencoder representations and machine translation techniques for aspect-based CLSC. Each of these approaches requires differing amounts of parallel data. We show that models based on distributed semantics can achieve comparable results to machine translation on aspect-based CLSC and give an analysis of the errors found for each method.
Tasks Denoising, Machine Translation, Sentiment Analysis, Word Embeddings, Zero-Shot Learning
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1152/
PDF https://www.aclweb.org/anthology/C16-1152
PWC https://paperswithcode.com/paper/exploring-distributional-representations-and
Repo
Framework

USFD at SemEval-2016 Task 6: Any-Target Stance Detection on Twitter with Autoencoders

Title USFD at SemEval-2016 Task 6: Any-Target Stance Detection on Twitter with Autoencoders
Authors Isabelle Augenstein, Andreas Vlachos, Kalina Bontcheva
Abstract
Tasks Natural Language Inference, Sentiment Analysis, Stance Detection
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1063/
PDF https://www.aclweb.org/anthology/S16-1063
PWC https://paperswithcode.com/paper/usfd-at-semeval-2016-task-6-any-target-stance
Repo
Framework

Lightly Supervised Quality Estimation

Title Lightly Supervised Quality Estimation
Authors Matthias Sperber, Graham Neubig, Jan Niehues, Sebastian St{"u}ker, Alex Waibel
Abstract Evaluating the quality of output from language processing systems such as machine translation or speech recognition is an essential step in ensuring that they are sufficient for practical use. However, depending on the practical requirements, evaluation approaches can differ strongly. Often, reference-based evaluation measures (such as BLEU or WER) are appealing because they are cheap and allow rapid quantitative comparison. On the other hand, practitioners often focus on manual evaluation because they must deal with frequently changing domains and quality standards requested by customers, for which reference-based evaluation is insufficient or not possible due to missing in-domain reference data (Harris et al., 2016). In this paper, we attempt to bridge this gap by proposing a framework for lightly supervised quality estimation. We collect manually annotated scores for a small number of segments in a test corpus or document, and combine them with automatically predicted quality scores for the remaining segments to predict an overall quality estimate. An evaluation shows that our framework estimates quality more reliably than using fully automatic quality estimation approaches, while keeping annotation effort low by not requiring full references to be available for the particular domain.
Tasks Machine Translation, Speech Recognition
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1292/
PDF https://www.aclweb.org/anthology/C16-1292
PWC https://paperswithcode.com/paper/lightly-supervised-quality-estimation
Repo
Framework

Predicting the Evocation Relation between Lexicalized Concepts

Title Predicting the Evocation Relation between Lexicalized Concepts
Authors Yoshihiko Hayashi
Abstract Evocation is a directed yet weighted semantic relationship between lexicalized concepts. Although evocation relations are considered potentially useful in several semantic NLP tasks, the prediction of the evocation relation between an arbitrary pair of concepts remains difficult, since evocation relationships cover a broader range of semantic relations rooted in human perception and experience. This paper presents a supervised learning approach to predict the strength (by regression) and to determine the directionality (by classification) of the evocation relation that might hold between a pair of lexicalized concepts. Empirical results that were obtained by investigating useful features are shown, indicating that a combination of the proposed features largely outperformed individual baselines, and also suggesting that semantic relational vectors computed from existing semantic vectors for lexicalized concepts were indeed effective for both the prediction of strength and the determination of directionality.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1156/
PDF https://www.aclweb.org/anthology/C16-1156
PWC https://paperswithcode.com/paper/predicting-the-evocation-relation-between
Repo
Framework

Using Argument Mining to Assess the Argumentation Quality of Essays

Title Using Argument Mining to Assess the Argumentation Quality of Essays
Authors Henning Wachsmuth, Khalid Al-Khatib, Benno Stein
Abstract Argument mining aims to determine the argumentative structure of texts. Although it is said to be crucial for future applications such as writing support systems, the benefit of its output has rarely been evaluated. This paper puts the analysis of the output into the focus. In particular, we investigate to what extent the mined structure can be leveraged to assess the argumentation quality of persuasive essays. We find insightful statistical patterns in the structure of essays. From these, we derive novel features that we evaluate in four argumentation-related essay scoring tasks. Our results reveal the benefit of argument mining for assessing argumentation quality. Among others, we improve the state of the art in scoring an essay{'}s organization and its argument strength.
Tasks Argument Mining
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1158/
PDF https://www.aclweb.org/anthology/C16-1158
PWC https://paperswithcode.com/paper/using-argument-mining-to-assess-the
Repo
Framework

ltl.uni-due at SemEval-2016 Task 6: Stance Detection in Social Media Using Stacked Classifiers

Title ltl.uni-due at SemEval-2016 Task 6: Stance Detection in Social Media Using Stacked Classifiers
Authors Michael Wojatzki, Torsten Zesch
Abstract
Tasks Stance Detection
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1069/
PDF https://www.aclweb.org/anthology/S16-1069
PWC https://paperswithcode.com/paper/ltluni-due-at-semeval-2016-task-6-stance
Repo
Framework

``Beware the Jabberwock, dear reader!’’ Testing the distributional reality of construction semantics

Title ``Beware the Jabberwock, dear reader!’’ Testing the distributional reality of construction semantics |
Authors Gianluca Lebani, Aless Lenci, ro
Abstract Notwithstanding the success of the notion of construction, the computational tradition still lacks a way to represent the semantic content of these linguistic entities. Here we present a simple corpus-based model implementing the idea that the meaning of a syntactic construction is intimately related to the semantics of its typical verbs. It is a two-step process, that starts by identifying the typical verbs occurring with a given syntactic construction and building their distributional vectors. We then calculated the weighted centroid of these vectors in order to derive the distributional signature of a construction. In order to assess the goodness of our approach, we replicated the priming effect described by Johnson and Golberg (2013) as a function of the semantic distance between a construction and its prototypical verbs. Additional support for our view comes from a regression analysis showing that our distributional information can be used to model behavioral data collected with a crowdsourced elicitation experiment.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-5302/
PDF https://www.aclweb.org/anthology/W16-5302
PWC https://paperswithcode.com/paper/beware-the-jabberwock-dear-reader-testing-the
Repo
Framework

An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation

Title An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation
Authors Peter Viszlay, J{'a}n Sta{\v{s}}, Tom{'a}{\v{s}} Koct{'u}r, Martin Lojka, Jozef Juh{'a}r
Abstract In this paper, we introduce an extension of our previously released TUKE-BNews-SK corpus based on a semi-automatic annotation scheme. It firstly relies on the automatic transcription of the BN data performed by our Slovak large vocabulary continuous speech recognition system. The generated hypotheses are then manually corrected and completed by trained human annotators. The corpus is composed of 25 hours of fully-annotated spontaneous and prepared speech. In addition, we have acquired 900 hours of another BN data, part of which we plan to annotate semi-automatically. We present a preliminary corpus evaluation that gives very promising results.
Tasks Large Vocabulary Continuous Speech Recognition, Speech Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1743/
PDF https://www.aclweb.org/anthology/L16-1743
PWC https://paperswithcode.com/paper/an-extension-of-the-slovak-broadcast-news
Repo
Framework

Annotating Topic Development in Information Seeking Queries

Title Annotating Topic Development in Information Seeking Queries
Authors Marta Andersson, Adnan {"O}zt{"u}rel, Silvia Pareti
Abstract This paper contributes to the limited body of empirical research in the domain of discourse structure of information seeking queries. We describe the development of an annotation schema for coding topic development in information seeking queries and the initial observations from a pilot sample of query sessions. The main idea that we explore is the relationship between constant and variable discourse entities and their role in tracking changes in the topic progression. We argue that the topicalized entities remain stable across development of the discourse and can be identified by a simple mechanism where anaphora resolution is a precursor. We also claim that a corpus annotated in this framework can be used as training data for dialogue management and computational semantics systems.
Tasks Dialogue Management
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1277/
PDF https://www.aclweb.org/anthology/L16-1277
PWC https://paperswithcode.com/paper/annotating-topic-development-in-information
Repo
Framework

Is an Image Worth More than a Thousand Words? On the Fine-Grain Semantic Differences between Visual and Linguistic Representations

Title Is an Image Worth More than a Thousand Words? On the Fine-Grain Semantic Differences between Visual and Linguistic Representations
Authors Guillem Collell, Marie-Francine Moens
Abstract Human concept representations are often grounded with visual information, yet some aspects of meaning cannot be visually represented or are better described with language. Thus, vision and language provide complementary information that, properly combined, can potentially yield more complete concept representations. Recently, state-of-the-art distributional semantic models and convolutional neural networks have achieved great success in representing linguistic and visual knowledge respectively. In this paper, we compare both, visual and linguistic representations in their ability to capture different types of fine-grain semantic knowledge{—}or attributes{—}of concepts. Humans often describe objects using attributes, that is, properties such as shape, color or functionality, which often transcend the linguistic and visual modalities. In our setting, we evaluate how well attributes can be predicted by using the unimodal representations as inputs. We are interested in first, finding out whether attributes are generally better captured by either the vision or by the language modality; and second, if none of them is clearly superior (as we hypothesize), what type of attributes or semantic knowledge are better encoded from each modality. Ultimately, our study sheds light on the potential of combining visual and textual representations.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1264/
PDF https://www.aclweb.org/anthology/C16-1264
PWC https://paperswithcode.com/paper/is-an-image-worth-more-than-a-thousand-words
Repo
Framework

Topic-Informed Neural Machine Translation

Title Topic-Informed Neural Machine Translation
Authors Jian Zhang, Liangyou Li, Andy Way, Qun Liu
Abstract In recent years, neural machine translation (NMT) has demonstrated state-of-the-art machine translation (MT) performance. It is a new approach to MT, which tries to learn a set of parameters to maximize the conditional probability of target sentences given source sentences. In this paper, we present a novel approach to improve the translation performance in NMT by conveying topic knowledge during translation. The proposed topic-informed NMT can increase the likelihood of selecting words from the same topic and domain for translation. Experimentally, we demonstrate that topic-informed NMT can achieve a 1.15 (3.3{%} relative) and 1.67 (5.4{%} relative) absolute improvement in BLEU score on the Chinese-to-English language pair using NIST 2004 and 2005 test sets, respectively, compared to NMT without topic information.
Tasks Machine Translation, Topic Models
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1170/
PDF https://www.aclweb.org/anthology/C16-1170
PWC https://paperswithcode.com/paper/topic-informed-neural-machine-translation
Repo
Framework

Off-topic Response Detection for Spontaneous Spoken English Assessment

Title Off-topic Response Detection for Spontaneous Spoken English Assessment
Authors Andrey Malinin, Rogier Van Dalen, Kate Knill, Yu Wang, Mark Gales
Abstract
Tasks Semantic Textual Similarity, Speech Recognition
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1102/
PDF https://www.aclweb.org/anthology/P16-1102
PWC https://paperswithcode.com/paper/off-topic-response-detection-for-spontaneous
Repo
Framework

A Meaning-based English Math Word Problem Solver with Understanding, Reasoning and Explanation

Title A Meaning-based English Math Word Problem Solver with Understanding, Reasoning and Explanation
Authors Chao-Chun Liang, Shih-Hong Tsai, Ting-Yun Chang, Yi-Chung Lin, Keh-Yih Su
Abstract This paper presents a meaning-based statistical math word problem (MWP) solver with understanding, reasoning and explanation. It comprises a web user interface and pipelined modules for analysing the text, transforming both body and question parts into their logic forms, and then performing inference on them. The associated context of each quantity is represented with proposed role-tags (e.g., nsubj, verb, etc.), which provides the flexibility for annotating the extracted math quantity with its associated syntactic and semantic information (which specifies the physical meaning of that quantity). Those role-tags are then used to identify the desired operands and filter out irrelevant quantities (so that the answer can be obtained precisely). Since the physical meaning of each quantity is explicitly represented with those role-tags and used in the inference process, the proposed approach could explain how the answer is obtained in a human comprehensible way.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2032/
PDF https://www.aclweb.org/anthology/C16-2032
PWC https://paperswithcode.com/paper/a-meaning-based-english-math-word-problem
Repo
Framework
comments powered by Disqus