July 26, 2019

1618 words 8 mins read

Paper Group NANR 25

Paper Group NANR 25

Sentence Embedding for Neural Machine Translation Domain Adaptation. Semantics-Enhanced Task-Oriented Dialogue Translation: A Case Study on Hotel Booking. ADoCS: Automatic Designer of Conference Schedules. Efficient Benchmarking of NLP APIs using Multi-armed Bandits. Goal-Oriented Design for Ethical Machine Learning and NLP. TDB 1.1: Extensions on …

Sentence Embedding for Neural Machine Translation Domain Adaptation

Title Sentence Embedding for Neural Machine Translation Domain Adaptation
Authors Rui Wang, Andrew Finch, Masao Utiyama, Eiichiro Sumita
Abstract Although new corpora are becoming increasingly available for machine translation, only those that belong to the same or similar domains are typically able to improve translation performance. Recently Neural Machine Translation (NMT) has become prominent in the field. However, most of the existing domain adaptation methods only focus on phrase-based machine translation. In this paper, we exploit the NMT{'}s internal embedding of the source sentence and use the sentence embedding similarity to select the sentences which are close to in-domain data. The empirical adaptation results on the IWSLT English-French and NIST Chinese-English tasks show that the proposed methods can substantially improve NMT performance by 2.4-9.0 BLEU points, outperforming the existing state-of-the-art baseline by 2.3-4.5 BLEU points.
Tasks Domain Adaptation, Language Modelling, Machine Translation, Sentence Embedding
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-2089/
PDF https://www.aclweb.org/anthology/P17-2089
PWC https://paperswithcode.com/paper/sentence-embedding-for-neural-machine
Repo
Framework

Semantics-Enhanced Task-Oriented Dialogue Translation: A Case Study on Hotel Booking

Title Semantics-Enhanced Task-Oriented Dialogue Translation: A Case Study on Hotel Booking
Authors Longyue Wang, Jinhua Du, Liangyou Li, Zhaopeng Tu, Andy Way, Qun Liu
Abstract We showcase TODAY, a semantics-enhanced task-oriented dialogue translation system, whose novelties are: (i) task-oriented named entity (NE) definition and a hybrid strategy for NE recognition and translation; and (ii) a novel grounded semantic method for dialogue understanding and task-order management. TODAY is a case-study demo which can efficiently and accurately assist customers and agents in different languages to reach an agreement in a dialogue for the hotel booking.
Tasks Dialogue Understanding, Machine Translation, Named Entity Recognition
Published 2017-11-01
URL https://www.aclweb.org/anthology/I17-3009/
PDF https://www.aclweb.org/anthology/I17-3009
PWC https://paperswithcode.com/paper/semantics-enhanced-task-oriented-dialogue
Repo
Framework

ADoCS: Automatic Designer of Conference Schedules

Title ADoCS: Automatic Designer of Conference Schedules
Authors Diego Fern Vallejo Huanga, o, Paulina Adriana Morillo Alc{'\i}var, C{`e}sar Ferri Ram{'\i}rez
Abstract Distributing papers into sessions in scientific conferences is a task consisting in grouping papers with common topics and considering the size restrictions imposed by the conference schedule. This problem can be seen as a semi-supervised clustering of scientific papers based on their features. This paper presents a web tool called ADoCS that solves the problem of configuring conference schedules by an automatic clustering of articles by similarity using a new algorithm considering size constraints.
Tasks Information Retrieval, Tokenization
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-3011/
PDF https://www.aclweb.org/anthology/E17-3011
PWC https://paperswithcode.com/paper/adocs-automatic-designer-of-conference
Repo
Framework

Efficient Benchmarking of NLP APIs using Multi-armed Bandits

Title Efficient Benchmarking of NLP APIs using Multi-armed Bandits
Authors Gholamreza Haffari, Tuan Dung Tran, Mark Carman
Abstract Comparing NLP systems to select the best one for a task of interest, such as named entity recognition, is critical for practitioners and researchers. A rigorous approach involves setting up a hypothesis testing scenario using the performance of the systems on query documents. However, often the hypothesis testing approach needs to send a lot of document queries to the systems, which can be problematic. In this paper, we present an effective alternative based on the multi-armed bandit (MAB). We propose a hierarchical generative model to represent the uncertainty in the performance measures of the competing systems, to be used by Thompson Sampling to solve the resulting MAB. Experimental results on both synthetic and real data show that our approach requires significantly fewer queries compared to the standard benchmarking technique to identify the best system according to F-measure.
Tasks Multi-Armed Bandits, Named Entity Recognition
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-1039/
PDF https://www.aclweb.org/anthology/E17-1039
PWC https://paperswithcode.com/paper/efficient-benchmarking-of-nlp-apis-using
Repo
Framework

Goal-Oriented Design for Ethical Machine Learning and NLP

Title Goal-Oriented Design for Ethical Machine Learning and NLP
Authors Tyler Schnoebelen
Abstract The argument made in this paper is that to act ethically in machine learning and NLP requires focusing on goals. NLP projects are often classificatory systems that deal with human subjects, which means that goals from people affected by the systems should be included. The paper takes as its core example a model that detects criminality, showing the problems of training data, categories, and outcomes. The paper is oriented to the kinds of critiques on power and the reproduction of inequality that are found in social theory, but it also includes concrete suggestions on how to put goal-oriented design into practice.
Tasks
Published 2017-04-01
URL https://www.aclweb.org/anthology/W17-1611/
PDF https://www.aclweb.org/anthology/W17-1611
PWC https://paperswithcode.com/paper/goal-oriented-design-for-ethical-machine
Repo
Framework

TDB 1.1: Extensions on Turkish Discourse Bank

Title TDB 1.1: Extensions on Turkish Discourse Bank
Authors Deniz Zeyrek, Murathan Kurfal{\i}
Abstract This paper presents the recent developments on Turkish Discourse Bank (TDB). First, the resource is summarized and an evaluation is presented. Then, TDB 1.1, i.e. enrichments on 10{%} of the corpus are described (namely, senses for explicit discourse connectives, and new annotations for three discourse relation types - implicit relations, entity relations and alternative lexicalizations). The method of annotation is explained and the data are evaluated.
Tasks
Published 2017-04-01
URL https://www.aclweb.org/anthology/W17-0809/
PDF https://www.aclweb.org/anthology/W17-0809
PWC https://paperswithcode.com/paper/tdb-11-extensions-on-turkish-discourse-bank
Repo
Framework

Action Centered Contextual Bandits

Title Action Centered Contextual Bandits
Authors Kristjan Greenewald, Ambuj Tewari, Susan Murphy, Predag Klasnja
Abstract Contextual bandits have become popular as they offer a middle ground between very simple approaches based on multi-armed bandits and very complex approaches using the full power of reinforcement learning. They have demonstrated success in web applications and have a rich body of associated theoretical guarantees. Linear models are well understood theoretically and preferred by practitioners because they are not only easily interpretable but also simple to implement and debug. Furthermore, if the linear model is true, we get very strong performance guarantees. Unfortunately, in emerging applications in mobile health, the time-invariant linear model assumption is untenable. We provide an extension of the linear model for contextual bandits that has two parts: baseline reward and treatment effect. We allow the former to be complex but keep the latter simple. We argue that this model is plausible for mobile health applications. At the same time, it leads to algorithms with strong performance guarantees as in the linear model setting, while still allowing for complex nonlinear baseline modeling. Our theory is supported by experiments on data gathered in a recently concluded mobile health study.
Tasks Multi-Armed Bandits
Published 2017-12-01
URL http://papers.nips.cc/paper/7179-action-centered-contextual-bandits
PDF http://papers.nips.cc/paper/7179-action-centered-contextual-bandits.pdf
PWC https://paperswithcode.com/paper/action-centered-contextual-bandits
Repo
Framework

End-to-End Non-Factoid Question Answering with an Interactive Visualization of Neural Attention Weights

Title End-to-End Non-Factoid Question Answering with an Interactive Visualization of Neural Attention Weights
Authors Andreas R{"u}ckl{'e}, Iryna Gurevych
Abstract
Tasks Answer Selection, Machine Translation, Question Answering
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-4004/
PDF https://www.aclweb.org/anthology/P17-4004
PWC https://paperswithcode.com/paper/end-to-end-non-factoid-question-answering
Repo
Framework

Handling Cold-Start Problem in Review Spam Detection by Jointly Embedding Texts and Behaviors

Title Handling Cold-Start Problem in Review Spam Detection by Jointly Embedding Texts and Behaviors
Authors Xuepeng Wang, Kang Liu, Jun Zhao
Abstract Solving cold-start problem in review spam detection is an urgent and significant task. It can help the on-line review websites to relieve the damage of spammers in time, but has never been investigated by previous work. This paper proposes a novel neural network model to detect review spam for cold-start problem, by learning to represent the new reviewers{'} review with jointly embedded textual and behavioral information. Experimental results prove the proposed model achieves an effective performance and possesses preferable domain-adaptability. It is also applicable to a large scale dataset in an unsupervised way.
Tasks
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-1034/
PDF https://www.aclweb.org/anthology/P17-1034
PWC https://paperswithcode.com/paper/handling-cold-start-problem-in-review-spam
Repo
Framework

Book Review: Biomedical Natural Language Processing by Kevin Bretonnel Cohen and Dina Demner-Fushman

Title Book Review: Biomedical Natural Language Processing by Kevin Bretonnel Cohen and Dina Demner-Fushman
Authors Jin-Dong Kim
Abstract
Tasks Named Entity Recognition, Relation Extraction
Published 2017-04-01
URL https://www.aclweb.org/anthology/J17-1007/
PDF https://www.aclweb.org/anthology/J17-1007
PWC https://paperswithcode.com/paper/book-review-biomedical-natural-language
Repo
Framework

Transliterated Mobile Keyboard Input via Weighted Finite-State Transducers

Title Transliterated Mobile Keyboard Input via Weighted Finite-State Transducers
Authors Lars Hellsten, Brian Roark, Prasoon Goyal, Cyril Allauzen, Fran{\c{c}}oise Beaufays, Tom Ouyang, Michael Riley, David Rybach
Abstract
Tasks Language Modelling, Optical Character Recognition, Speech Recognition, Transliteration
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-4002/
PDF https://www.aclweb.org/anthology/W17-4002
PWC https://paperswithcode.com/paper/transliterated-mobile-keyboard-input-via
Repo
Framework

Integrated Learning of Dialog Strategies and Semantic Parsing

Title Integrated Learning of Dialog Strategies and Semantic Parsing
Authors Aishwarya Padmakumar, Jesse Thomason, Raymond J. Mooney
Abstract Natural language understanding and dialog management are two integral components of interactive dialog systems. Previous research has used machine learning techniques to individually optimize these components, with different forms of direct and indirect supervision. We present an approach to integrate the learning of both a dialog strategy using reinforcement learning, and a semantic parser for robust natural language understanding, using only natural dialog interaction for supervision. Experimental results on a simulated task of robot instruction demonstrate that joint learning of both components improves dialog performance over learning either of these components alone.
Tasks Semantic Parsing
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-1052/
PDF https://www.aclweb.org/anthology/E17-1052
PWC https://paperswithcode.com/paper/integrated-learning-of-dialog-strategies-and
Repo
Framework

Data Augmentation for Morphological Reinflection

Title Data Augmentation for Morphological Reinflection
Authors Miikka Silfverberg, Adam Wiemerslage, Ling Liu, Lingshuang Jack Mao
Abstract
Tasks Data Augmentation
Published 2017-08-01
URL https://www.aclweb.org/anthology/K17-2010/
PDF https://www.aclweb.org/anthology/K17-2010
PWC https://paperswithcode.com/paper/data-augmentation-for-morphological
Repo
Framework

The LMU System for the CoNLL-SIGMORPHON 2017 Shared Task on Universal Morphological Reinflection

Title The LMU System for the CoNLL-SIGMORPHON 2017 Shared Task on Universal Morphological Reinflection
Authors Katharina Kann, Hinrich Sch{"u}tze
Abstract
Tasks Data Augmentation, Domain Adaptation, Morphological Analysis
Published 2017-08-01
URL https://www.aclweb.org/anthology/K17-2003/
PDF https://www.aclweb.org/anthology/K17-2003
PWC https://paperswithcode.com/paper/the-lmu-system-for-the-conll-sigmorphon-2017
Repo
Framework

Linguistically Rich Vector Representations of Supertags for TAG Parsing

Title Linguistically Rich Vector Representations of Supertags for TAG Parsing
Authors Dan Friedman, Jungo Kasai, R. Thomas McCoy, Robert Frank, Forrest Davis, Owen Rambow
Abstract
Tasks CCG Supertagging
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-6213/
PDF https://www.aclweb.org/anthology/W17-6213
PWC https://paperswithcode.com/paper/linguistically-rich-vector-representations-of
Repo
Framework
comments powered by Disqus