May 5, 2019

2266 words 11 mins read

Paper Group NANR 149

ULISBOA at SemEval-2016 Task 12: Extraction of temporal expressions, clinical events and relations using IBEnt. Contextual term equivalent search using domain-driven disambiguation. Exploring Distributional Representations and Machine Translation for Aspect-based Cross-lingual Sentiment Classification.. USFD at SemEval-2016 Task 6: Any-Target Stanc …

ULISBOA at SemEval-2016 Task 12: Extraction of temporal expressions, clinical events and relations using IBEnt


Title	ULISBOA at SemEval-2016 Task 12: Extraction of temporal expressions, clinical events and relations using IBEnt
Authors	Marcia Barros, Andre Lamurias, Gon{\c{c}}alo Figueiro, Marta Antunes, Joana Teixeira, Alex Pinheiro, re, Francisco M. Couto
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1196/
PDF	https://www.aclweb.org/anthology/S16-1196
PWC	https://paperswithcode.com/paper/ulisboa-at-semeval-2016-task-12-extraction-of
Repo
Framework

Contextual term equivalent search using domain-driven disambiguation


Title	Contextual term equivalent search using domain-driven disambiguation
Authors	Caroline Barri{`e}re, Pierre Andr{'e} M{'e}nard, Daphn{'e}e Azoulay
Abstract	This article presents a domain-driven algorithm for the task of term sense disambiguation (TSD). TSD aims at automatically choosing which term record from a term bank best represents the meaning of a term occurring in a particular context. In a translation environment, finding the contextually appropriate term record is necessary to access the proper equivalent to be used in the target language text. The term bank TERMIUM Plus, recently published as an open access repository, is chosen as a domain-rich resource for testing our TSD algorithm, using English and French as source and target languages. We devise an experiment using over 1300 English terms found in scientific articles, and show that our domain-driven TSD algorithm is able to bring the best term record, and therefore the best French equivalent, at the average rank of 1.69 compared to a baseline random rank of 3.51.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4704/
PDF	https://www.aclweb.org/anthology/W16-4704
PWC	https://paperswithcode.com/paper/contextual-term-equivalent-search-using
Repo
Framework

Exploring Distributional Representations and Machine Translation for Aspect-based Cross-lingual Sentiment Classification.


Title	Exploring Distributional Representations and Machine Translation for Aspect-based Cross-lingual Sentiment Classification.
Authors	Jeremy Barnes, Patrik Lambert, Toni Badia
Abstract	Cross-lingual sentiment classification (CLSC) seeks to use resources from a source language in order to detect sentiment and classify text in a target language. Almost all research into CLSC has been carried out at sentence and document level, although this level of granularity is often less useful. This paper explores methods for performing aspect-based cross-lingual sentiment classification (aspect-based CLSC) for under-resourced languages. Given the limited nature of parallel data for many languages, we would like to make the most of this resource for our task. We compare zero-shot learning, bilingual word embeddings, stacked denoising autoencoder representations and machine translation techniques for aspect-based CLSC. Each of these approaches requires differing amounts of parallel data. We show that models based on distributed semantics can achieve comparable results to machine translation on aspect-based CLSC and give an analysis of the errors found for each method.
Tasks	Denoising, Machine Translation, Sentiment Analysis, Word Embeddings, Zero-Shot Learning
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1152/
PDF	https://www.aclweb.org/anthology/C16-1152
PWC	https://paperswithcode.com/paper/exploring-distributional-representations-and
Repo
Framework

USFD at SemEval-2016 Task 6: Any-Target Stance Detection on Twitter with Autoencoders


Title	USFD at SemEval-2016 Task 6: Any-Target Stance Detection on Twitter with Autoencoders
Authors	Isabelle Augenstein, Andreas Vlachos, Kalina Bontcheva
Abstract
Tasks	Natural Language Inference, Sentiment Analysis, Stance Detection
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1063/
PDF	https://www.aclweb.org/anthology/S16-1063
PWC	https://paperswithcode.com/paper/usfd-at-semeval-2016-task-6-any-target-stance
Repo
Framework

Lightly Supervised Quality Estimation


Title	Lightly Supervised Quality Estimation
Authors	Matthias Sperber, Graham Neubig, Jan Niehues, Sebastian St{"u}ker, Alex Waibel
Abstract	Evaluating the quality of output from language processing systems such as machine translation or speech recognition is an essential step in ensuring that they are sufficient for practical use. However, depending on the practical requirements, evaluation approaches can differ strongly. Often, reference-based evaluation measures (such as BLEU or WER) are appealing because they are cheap and allow rapid quantitative comparison. On the other hand, practitioners often focus on manual evaluation because they must deal with frequently changing domains and quality standards requested by customers, for which reference-based evaluation is insufficient or not possible due to missing in-domain reference data (Harris et al., 2016). In this paper, we attempt to bridge this gap by proposing a framework for lightly supervised quality estimation. We collect manually annotated scores for a small number of segments in a test corpus or document, and combine them with automatically predicted quality scores for the remaining segments to predict an overall quality estimate. An evaluation shows that our framework estimates quality more reliably than using fully automatic quality estimation approaches, while keeping annotation effort low by not requiring full references to be available for the particular domain.
Tasks	Machine Translation, Speech Recognition
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1292/
PDF	https://www.aclweb.org/anthology/C16-1292
PWC	https://paperswithcode.com/paper/lightly-supervised-quality-estimation
Repo
Framework

Predicting the Evocation Relation between Lexicalized Concepts


Title	Predicting the Evocation Relation between Lexicalized Concepts
Authors	Yoshihiko Hayashi
Abstract	Evocation is a directed yet weighted semantic relationship between lexicalized concepts. Although evocation relations are considered potentially useful in several semantic NLP tasks, the prediction of the evocation relation between an arbitrary pair of concepts remains difficult, since evocation relationships cover a broader range of semantic relations rooted in human perception and experience. This paper presents a supervised learning approach to predict the strength (by regression) and to determine the directionality (by classification) of the evocation relation that might hold between a pair of lexicalized concepts. Empirical results that were obtained by investigating useful features are shown, indicating that a combination of the proposed features largely outperformed individual baselines, and also suggesting that semantic relational vectors computed from existing semantic vectors for lexicalized concepts were indeed effective for both the prediction of strength and the determination of directionality.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1156/
PDF	https://www.aclweb.org/anthology/C16-1156
PWC	https://paperswithcode.com/paper/predicting-the-evocation-relation-between
Repo
Framework

Using Argument Mining to Assess the Argumentation Quality of Essays


Title	Using Argument Mining to Assess the Argumentation Quality of Essays
Authors	Henning Wachsmuth, Khalid Al-Khatib, Benno Stein
Abstract	Argument mining aims to determine the argumentative structure of texts. Although it is said to be crucial for future applications such as writing support systems, the benefit of its output has rarely been evaluated. This paper puts the analysis of the output into the focus. In particular, we investigate to what extent the mined structure can be leveraged to assess the argumentation quality of persuasive essays. We find insightful statistical patterns in the structure of essays. From these, we derive novel features that we evaluate in four argumentation-related essay scoring tasks. Our results reveal the benefit of argument mining for assessing argumentation quality. Among others, we improve the state of the art in scoring an essay{'}s organization and its argument strength.
Tasks	Argument Mining
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1158/
PDF	https://www.aclweb.org/anthology/C16-1158
PWC	https://paperswithcode.com/paper/using-argument-mining-to-assess-the
Repo
Framework


Title	ltl.uni-due at SemEval-2016 Task 6: Stance Detection in Social Media Using Stacked Classifiers
Authors	Michael Wojatzki, Torsten Zesch
Abstract
Tasks	Stance Detection
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1069/
PDF	https://www.aclweb.org/anthology/S16-1069
PWC	https://paperswithcode.com/paper/ltluni-due-at-semeval-2016-task-6-stance
Repo
Framework

``Beware the Jabberwock, dear reader!’’ Testing the distributional reality of construction semantics


Title	``Beware the Jabberwock, dear reader!’’ Testing the distributional reality of construction semantics \|
Authors	Gianluca Lebani, Aless Lenci, ro
Abstract	Notwithstanding the success of the notion of construction, the computational tradition still lacks a way to represent the semantic content of these linguistic entities. Here we present a simple corpus-based model implementing the idea that the meaning of a syntactic construction is intimately related to the semantics of its typical verbs. It is a two-step process, that starts by identifying the typical verbs occurring with a given syntactic construction and building their distributional vectors. We then calculated the weighted centroid of these vectors in order to derive the distributional signature of a construction. In order to assess the goodness of our approach, we replicated the priming effect described by Johnson and Golberg (2013) as a function of the semantic distance between a construction and its prototypical verbs. Additional support for our view comes from a regression analysis showing that our distributional information can be used to model behavioral data collected with a crowdsourced elicitation experiment.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-5302/
PDF	https://www.aclweb.org/anthology/W16-5302
PWC	https://paperswithcode.com/paper/beware-the-jabberwock-dear-reader-testing-the
Repo
Framework

An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation


Title	An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation
Authors	Peter Viszlay, J{'a}n Sta{\v{s}}, Tom{'a}{\v{s}} Koct{'u}r, Martin Lojka, Jozef Juh{'a}r
Abstract	In this paper, we introduce an extension of our previously released TUKE-BNews-SK corpus based on a semi-automatic annotation scheme. It firstly relies on the automatic transcription of the BN data performed by our Slovak large vocabulary continuous speech recognition system. The generated hypotheses are then manually corrected and completed by trained human annotators. The corpus is composed of 25 hours of fully-annotated spontaneous and prepared speech. In addition, we have acquired 900 hours of another BN data, part of which we plan to annotate semi-automatically. We present a preliminary corpus evaluation that gives very promising results.
Tasks	Large Vocabulary Continuous Speech Recognition, Speech Recognition
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1743/
PDF	https://www.aclweb.org/anthology/L16-1743
PWC	https://paperswithcode.com/paper/an-extension-of-the-slovak-broadcast-news
Repo
Framework

Annotating Topic Development in Information Seeking Queries


Title	Annotating Topic Development in Information Seeking Queries
Authors	Marta Andersson, Adnan {"O}zt{"u}rel, Silvia Pareti
Abstract	This paper contributes to the limited body of empirical research in the domain of discourse structure of information seeking queries. We describe the development of an annotation schema for coding topic development in information seeking queries and the initial observations from a pilot sample of query sessions. The main idea that we explore is the relationship between constant and variable discourse entities and their role in tracking changes in the topic progression. We argue that the topicalized entities remain stable across development of the discourse and can be identified by a simple mechanism where anaphora resolution is a precursor. We also claim that a corpus annotated in this framework can be used as training data for dialogue management and computational semantics systems.
Tasks	Dialogue Management
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1277/
PDF	https://www.aclweb.org/anthology/L16-1277
PWC	https://paperswithcode.com/paper/annotating-topic-development-in-information
Repo
Framework

Is an Image Worth More than a Thousand Words? On the Fine-Grain Semantic Differences between Visual and Linguistic Representations


Title	Is an Image Worth More than a Thousand Words? On the Fine-Grain Semantic Differences between Visual and Linguistic Representations
Authors	Guillem Collell, Marie-Francine Moens
Abstract	Human concept representations are often grounded with visual information, yet some aspects of meaning cannot be visually represented or are better described with language. Thus, vision and language provide complementary information that, properly combined, can potentially yield more complete concept representations. Recently, state-of-the-art distributional semantic models and convolutional neural networks have achieved great success in representing linguistic and visual knowledge respectively. In this paper, we compare both, visual and linguistic representations in their ability to capture different types of fine-grain semantic knowledge{—}or attributes{—}of concepts. Humans often describe objects using attributes, that is, properties such as shape, color or functionality, which often transcend the linguistic and visual modalities. In our setting, we evaluate how well attributes can be predicted by using the unimodal representations as inputs. We are interested in first, finding out whether attributes are generally better captured by either the vision or by the language modality; and second, if none of them is clearly superior (as we hypothesize), what type of attributes or semantic knowledge are better encoded from each modality. Ultimately, our study sheds light on the potential of combining visual and textual representations.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1264/
PDF	https://www.aclweb.org/anthology/C16-1264
PWC	https://paperswithcode.com/paper/is-an-image-worth-more-than-a-thousand-words
Repo
Framework

Topic-Informed Neural Machine Translation


Title	Topic-Informed Neural Machine Translation
Authors	Jian Zhang, Liangyou Li, Andy Way, Qun Liu
Abstract	In recent years, neural machine translation (NMT) has demonstrated state-of-the-art machine translation (MT) performance. It is a new approach to MT, which tries to learn a set of parameters to maximize the conditional probability of target sentences given source sentences. In this paper, we present a novel approach to improve the translation performance in NMT by conveying topic knowledge during translation. The proposed topic-informed NMT can increase the likelihood of selecting words from the same topic and domain for translation. Experimentally, we demonstrate that topic-informed NMT can achieve a 1.15 (3.3{%} relative) and 1.67 (5.4{%} relative) absolute improvement in BLEU score on the Chinese-to-English language pair using NIST 2004 and 2005 test sets, respectively, compared to NMT without topic information.
Tasks	Machine Translation, Topic Models
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1170/
PDF	https://www.aclweb.org/anthology/C16-1170
PWC	https://paperswithcode.com/paper/topic-informed-neural-machine-translation
Repo
Framework

Off-topic Response Detection for Spontaneous Spoken English Assessment


Title	Off-topic Response Detection for Spontaneous Spoken English Assessment
Authors	Andrey Malinin, Rogier Van Dalen, Kate Knill, Yu Wang, Mark Gales
Abstract
Tasks	Semantic Textual Similarity, Speech Recognition
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1102/
PDF	https://www.aclweb.org/anthology/P16-1102
PWC	https://paperswithcode.com/paper/off-topic-response-detection-for-spontaneous
Repo
Framework

A Meaning-based English Math Word Problem Solver with Understanding, Reasoning and Explanation


Title	A Meaning-based English Math Word Problem Solver with Understanding, Reasoning and Explanation
Authors	Chao-Chun Liang, Shih-Hong Tsai, Ting-Yun Chang, Yi-Chung Lin, Keh-Yih Su
Abstract	This paper presents a meaning-based statistical math word problem (MWP) solver with understanding, reasoning and explanation. It comprises a web user interface and pipelined modules for analysing the text, transforming both body and question parts into their logic forms, and then performing inference on them. The associated context of each quantity is represented with proposed role-tags (e.g., nsubj, verb, etc.), which provides the flexibility for annotating the extracted math quantity with its associated syntactic and semantic information (which specifies the physical meaning of that quantity). Those role-tags are then used to identify the desired operands and filter out irrelevant quantities (so that the answer can be obtained precisely). Since the physical meaning of each quantity is explicitly represented with those role-tags and used in the inference process, the proposed approach could explain how the answer is obtained in a human comprehensible way.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2032/
PDF	https://www.aclweb.org/anthology/C16-2032
PWC	https://paperswithcode.com/paper/a-meaning-based-english-math-word-problem
Repo
Framework