May 5, 2019

1741 words 9 mins read

Paper Group NANR 106

Paper Group NANR 106

Controlling the Voice of a Sentence in Japanese-to-English Neural Machine Translation. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Proceedings of the Workshop on Structured Prediction for NLP. Annotating Named Entities in Consumer Health Questions. Catching heuristics are optimal control policies. A Study …

Controlling the Voice of a Sentence in Japanese-to-English Neural Machine Translation

Title Controlling the Voice of a Sentence in Japanese-to-English Neural Machine Translation
Authors Hayahide Yamagishi, Shin Kanouchi, Takayuki Sato, Mamoru Komachi
Abstract In machine translation, we must consider the difference in expression between languages. For example, the active/passive voice may change in Japanese-English translation. The same verb in Japanese may be translated into different voices at each translation because the voice of a generated sentence cannot be determined using only the information of the Japanese sentence. Machine translation systems should consider the information structure to improve the coherence of the output by using several topicalization techniques such as passivization. Therefore, this paper reports on our attempt to control the voice of the sentence generated by an encoder-decoder model. To control the voice of the generated sentence, we added the voice information of the target sentence to the source sentence during the training. We then generated sentences with a specified voice by appending the voice information to the source sentence. We observed experimentally whether the voice could be controlled. The results showed that, we could control the voice of the generated sentence with 85.0{%} accuracy on average. In the evaluation of Japanese-English translation, we obtained a 0.73-point improvement in BLEU score by using gold voice labels.
Tasks Machine Translation, Text Summarization
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4620/
PDF https://www.aclweb.org/anthology/W16-4620
PWC https://paperswithcode.com/paper/controlling-the-voice-of-a-sentence-in
Repo
Framework

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

Title Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
Authors
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1000/
PDF https://www.aclweb.org/anthology/D16-1000
PWC https://paperswithcode.com/paper/proceedings-of-the-2016-conference-on
Repo
Framework

Proceedings of the Workshop on Structured Prediction for NLP

Title Proceedings of the Workshop on Structured Prediction for NLP
Authors
Abstract
Tasks Structured Prediction
Published 2016-11-01
URL https://www.aclweb.org/anthology/W16-5900/
PDF https://www.aclweb.org/anthology/W16-5900
PWC https://paperswithcode.com/paper/proceedings-of-the-workshop-on-structured
Repo
Framework

Annotating Named Entities in Consumer Health Questions

Title Annotating Named Entities in Consumer Health Questions
Authors Halil Kilicoglu, Asma Ben Abacha, Yassine Mrabet, Kirk Roberts, Laritza Rodriguez, Sonya Shooshan, Dina Demner-Fushman
Abstract We describe a corpus of consumer health questions annotated with named entities. The corpus consists of 1548 de-identified questions about diseases and drugs, written in English. We defined 15 broad categories of biomedical named entities for annotation. A pilot annotation phase in which a small portion of the corpus was double-annotated by four annotators was followed by a main phase in which double annotation was carried out by six annotators, and a reconciliation phase in which all annotations were reconciled by an expert. We conducted the annotation in two modes, manual and assisted, to assess the effect of automatic pre-annotation and calculated inter-annotator agreement. We obtained moderate inter-annotator agreement; assisted annotation yielded slightly better agreement and fewer missed annotations than manual annotation. Due to complex nature of biomedical entities, we paid particular attention to nested entities for which we obtained slightly lower inter-annotator agreement, confirming that annotating nested entities is somewhat more challenging. To our knowledge, the corpus is the first of its kind for consumer health text and is publicly available.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1530/
PDF https://www.aclweb.org/anthology/L16-1530
PWC https://paperswithcode.com/paper/annotating-named-entities-in-consumer-health
Repo
Framework

Catching heuristics are optimal control policies

Title Catching heuristics are optimal control policies
Authors Boris Belousov, Gerhard Neumann, Constantin A. Rothkopf, Jan R. Peters
Abstract Two seemingly contradictory theories attempt to explain how humans move to intercept an airborne ball. One theory posits that humans predict the ball trajectory to optimally plan future actions; the other claims that, instead of performing such complicated computations, humans employ heuristics to reactively choose appropriate actions based on immediate visual feedback. In this paper, we show that interception strategies appearing to be heuristics can be understood as computational solutions to the optimal control problem faced by a ball-catching agent acting under uncertainty. Modeling catching as a continuous partially observable Markov decision process and employing stochastic optimal control theory, we discover that the four main heuristics described in the literature are optimal solutions if the catcher has sufficient time to continuously visually track the ball. Specifically, by varying model parameters such as noise, time to ground contact, and perceptual latency, we show that different strategies arise under different circumstances. The catcher’s policy switches between generating reactive and predictive behavior based on the ratio of system to observation noise and the ratio between reaction time and task duration. Thus, we provide a rational account of human ball-catching behavior and a unifying explanation for seemingly contradictory theories of target interception on the basis of stochastic optimal control.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6548-catching-heuristics-are-optimal-control-policies
PDF http://papers.nips.cc/paper/6548-catching-heuristics-are-optimal-control-policies.pdf
PWC https://paperswithcode.com/paper/catching-heuristics-are-optimal-control
Repo
Framework

A Study on the Interplay Between the Corpus Size and Parameters of a Distributional Model for Term Classification

Title A Study on the Interplay Between the Corpus Size and Parameters of a Distributional Model for Term Classification
Authors Behrang QasemiZadeh
Abstract We propose and evaluate a method for identifying co-hyponym lexical units in a terminological resource. The principles of term recognition and distributional semantics are combined to extract terms from a similar category of concept. Given a set of candidate terms, random projections are employed to represent them as low-dimensional vectors. These vectors are derived automatically from the frequency of the co-occurrences of the candidate terms and words that appear within windows of text in their proximity (context-windows). In a $k$-nearest neighbours framework, these vectors are classified using a small set of manually annotated terms which exemplify concept categories. We then investigate the interplay between the size of the corpus that is used for collecting the co-occurrences and a number of factors that play roles in the performance of the proposed method: the configuration of context-windows for collecting co-occurrences, the selection of neighbourhood size ($k$), and the choice of similarity metric.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4708/
PDF https://www.aclweb.org/anthology/W16-4708
PWC https://paperswithcode.com/paper/a-study-on-the-interplay-between-the-corpus
Repo
Framework

Non-sentential Question Resolution using Sequence to Sequence Learning

Title Non-sentential Question Resolution using Sequence to Sequence Learning
Authors Vineet Kumar, Sachindra Joshi
Abstract An interactive Question Answering (QA) system frequently encounters non-sentential (incomplete) questions. These non-sentential questions may not make sense to the system when a user asks them without the context of conversation. The system thus needs to take into account the conversation context to process the question. In this work, we present a recurrent neural network (RNN) based encoder decoder network that can generate a complete (intended) question, given an incomplete question and conversation context. RNN encoder decoder networks have been show to work well when trained on a parallel corpus with millions of sentences, however it is extremely hard to obtain conversation data of this magnitude. We therefore propose to decompose the original problem into two separate simplified problems where each problem focuses on an abstraction. Specifically, we train a semantic sequence model to learn semantic patterns, and a syntactic sequence model to learn linguistic patterns. We further combine syntactic and semantic sequence models to generate an ensemble model. Our model achieves a BLEU score of 30.15 as compared to 18.54 using a standard RNN encoder decoder model.
Tasks Question Answering
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1190/
PDF https://www.aclweb.org/anthology/C16-1190
PWC https://paperswithcode.com/paper/non-sentential-question-resolution-using
Repo
Framework

Proceedings of the Third Workshop on Computational Linguistics and Clinical Psychology

Title Proceedings of the Third Workshop on Computational Linguistics and Clinical Psychology
Authors
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0300/
PDF https://www.aclweb.org/anthology/W16-0300
PWC https://paperswithcode.com/paper/proceedings-of-the-third-workshop-on-4
Repo
Framework

Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers

Title Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Authors
Abstract
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1000/
PDF https://www.aclweb.org/anthology/C16-1000
PWC https://paperswithcode.com/paper/proceedings-of-coling-2016-the-26th-1
Repo
Framework

SV000gg at SemEval-2016 Task 11: Heavy Gauge Complex Word Identification with System Voting

Title SV000gg at SemEval-2016 Task 11: Heavy Gauge Complex Word Identification with System Voting
Authors Gustavo Paetzold, Lucia Specia
Abstract
Tasks Complex Word Identification, Lexical Simplification
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1149/
PDF https://www.aclweb.org/anthology/S16-1149
PWC https://paperswithcode.com/paper/sv000gg-at-semeval-2016-task-11-heavy-gauge
Repo
Framework

PLUJAGH at SemEval-2016 Task 11: Simple System for Complex Word Identification

Title PLUJAGH at SemEval-2016 Task 11: Simple System for Complex Word Identification
Authors Krzysztof Wr{'o}bel
Abstract
Tasks Complex Word Identification, Lexical Simplification, Word Sense Disambiguation
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1146/
PDF https://www.aclweb.org/anthology/S16-1146
PWC https://paperswithcode.com/paper/plujagh-at-semeval-2016-task-11-simple-system
Repo
Framework

Creating Annotated Dialogue Resources: Cross-domain Dialogue Act Classification

Title Creating Annotated Dialogue Resources: Cross-domain Dialogue Act Classification
Authors Dilafruz Amanova, Volha Petukhova, Dietrich Klakow
Abstract This paper describes a method to automatically create dialogue resources annotated with dialogue act information by reusing existing dialogue corpora. Numerous dialogue corpora are available for research purposes and many of them are annotated with dialogue act information that captures the intentions encoded in user utterances. Annotated dialogue resources, however, differ in various respects: data collection settings and modalities used, dialogue task domains and scenarios (if any) underlying the collection, number and roles of dialogue participants involved and dialogue act annotation schemes applied. The presented study encompasses three phases of data-driven investigation. We, first, assess the importance of various types of features and their combinations for effective cross-domain dialogue act classification. Second, we establish the best predictive model comparing various cross-corpora training settings. Finally, we specify models adaptation procedures and explore late fusion approaches to optimize the overall classification decision taking process. The proposed methodology accounts for empirically motivated and technically sound classification procedures that may reduce annotation and training costs significantly.
Tasks Dialogue Act Classification
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1017/
PDF https://www.aclweb.org/anthology/L16-1017
PWC https://paperswithcode.com/paper/creating-annotated-dialogue-resources-cross
Repo
Framework

DiegoLab16 at SemEval-2016 Task 4: Sentiment Analysis in Twitter using Centroids, Clusters, and Sentiment Lexicons

Title DiegoLab16 at SemEval-2016 Task 4: Sentiment Analysis in Twitter using Centroids, Clusters, and Sentiment Lexicons
Authors Abeed Sarker, Graciela Gonzalez
Abstract
Tasks Sentiment Analysis
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1031/
PDF https://www.aclweb.org/anthology/S16-1031
PWC https://paperswithcode.com/paper/diegolab16-at-semeval-2016-task-4-sentiment
Repo
Framework

UniPI at SemEval-2016 Task 4: Convolutional Neural Networks for Sentiment Classification

Title UniPI at SemEval-2016 Task 4: Convolutional Neural Networks for Sentiment Classification
Authors Giuseppe Attardi, Daniele Sartiano
Abstract
Tasks Sentiment Analysis, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1033/
PDF https://www.aclweb.org/anthology/S16-1033
PWC https://paperswithcode.com/paper/unipi-at-semeval-2016-task-4-convolutional
Repo
Framework

Know-Center at SemEval-2016 Task 5: Using Word Vectors with Typed Dependencies for Opinion Target Expression Extraction

Title Know-Center at SemEval-2016 Task 5: Using Word Vectors with Typed Dependencies for Opinion Target Expression Extraction
Authors Stefan Falk, Andi Rexha, Roman Kern
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1042/
PDF https://www.aclweb.org/anthology/S16-1042
PWC https://paperswithcode.com/paper/know-center-at-semeval-2016-task-5-using-word
Repo
Framework
comments powered by Disqus