October 15, 2019

2187 words 11 mins read

Paper Group NANR 173

Paper Group NANR 173

Pretraining Sentiment Classifiers with Unlabeled Dialog Data. Researching Less-Resourced Languages – the DigiSami Corpus. Efficient Global Point Cloud Registration by Matching Rotation Invariant Features Through Translation Search. Variance Regularized Counterfactual Risk Minimization via Variational Divergence Minimization. When does deep multi-t …

Pretraining Sentiment Classifiers with Unlabeled Dialog Data

Title Pretraining Sentiment Classifiers with Unlabeled Dialog Data
Authors Toru Shimizu, Nobuyuki Shimizu, Hayato Kobayashi
Abstract The huge cost of creating labeled training data is a common problem for supervised learning tasks such as sentiment classification. Recent studies showed that pretraining with unlabeled data via a language model can improve the performance of classification models. In this paper, we take the concept a step further by using a conditional language model, instead of a language model. Specifically, we address a sentiment classification task for a tweet analysis service as a case study and propose a pretraining strategy with unlabeled dialog data (tweet-reply pairs) via an encoder-decoder model. Experimental results show that our strategy can improve the performance of sentiment classifiers and outperform several state-of-the-art strategies including language model pretraining.
Tasks Language Modelling, Sentiment Analysis
Published 2018-07-01
URL https://www.aclweb.org/anthology/P18-2121/
PDF https://www.aclweb.org/anthology/P18-2121
PWC https://paperswithcode.com/paper/pretraining-sentiment-classifiers-with
Repo
Framework

Researching Less-Resourced Languages – the DigiSami Corpus

Title Researching Less-Resourced Languages – the DigiSami Corpus
Authors Kristiina Jokinen
Abstract
Tasks
Published 2018-05-01
URL https://www.aclweb.org/anthology/L18-1534/
PDF https://www.aclweb.org/anthology/L18-1534
PWC https://paperswithcode.com/paper/researching-less-resourced-languages-a-the
Repo
Framework
Title Efficient Global Point Cloud Registration by Matching Rotation Invariant Features Through Translation Search
Authors Yinlong Liu, Chen Wang, Zhijian Song, Manning Wang
Abstract Three-dimensional rigid point cloud registration has many applications in computer vision and robotics. Local methods tend to fail, causing global methods to be needed, when the relative transformation is large or the overlap ratio is small. Most existing global methods utilize BnB optimization over the 6D parameter space of SE(3). Such methods are usually very slow because the time complexity of BnB optimization is exponential in the dimensionality of the parameter space. In this paper, we decouple the optimization of translation and rotation, and we propose a fast BnB algorithm to globally optimize the 3D translation parameter first. The optimal rotation is then calculated by utilizing the global optimal translation found by the BnB algorithm. The separate optimization of translation and rotation is realized by using a newly proposed rotation invariant feature. Experiments on challenging data sets demonstrate that the proposed method outperforms state-of-the-art global methods in terms of both speed and accuracy.
Tasks Point Cloud Registration
Published 2018-09-01
URL http://openaccess.thecvf.com/content_ECCV_2018/html/Yinlong_Liu_Efficient_Global_Point_ECCV_2018_paper.html
PDF http://openaccess.thecvf.com/content_ECCV_2018/papers/Yinlong_Liu_Efficient_Global_Point_ECCV_2018_paper.pdf
PWC https://paperswithcode.com/paper/efficient-global-point-cloud-registration-by
Repo
Framework

Variance Regularized Counterfactual Risk Minimization via Variational Divergence Minimization

Title Variance Regularized Counterfactual Risk Minimization via Variational Divergence Minimization
Authors Hang Wu
Abstract Off-policy learning, the task of evaluating and improving policies using historic data collected from a logging policy, is important because on-policy evaluation is usually expensive and has adverse impacts. One of the major challenge of off-policy learning is to derive counterfactual estimators that also has low variance and thus low generalization error. In this work, inspired by learning bounds for importance sampling problems, we present a new counterfactual learning principle for off-policy learning with bandit feedbacks.Our method regularizes the generalization error by minimizing the distribution divergence between the logging policy and the new policy, and removes the need for iterating through all training samples to compute sample variance regularization in prior work. With neural network policies, our end-to-end training algorithms using variational divergence minimization showed significant improvement over conventional baseline algorithms and is also consistent with our theoretical results.
Tasks
Published 2018-01-01
URL https://openreview.net/forum?id=SyPMT6gAb
PDF https://openreview.net/pdf?id=SyPMT6gAb
PWC https://paperswithcode.com/paper/variance-regularized-counterfactual-risk
Repo
Framework
Title When does deep multi-task learning work for loosely related document classification tasks?
Authors Emma Kerinec, Chlo{'e} Braud, Anders S{\o}gaard
Abstract This work aims to contribute to our understanding of \textit{when} multi-task learning through parameter sharing in deep neural networks leads to improvements over single-task learning. We focus on the setting of learning from \textit{loosely related} tasks, for which no theoretical guarantees exist. We therefore approach the question empirically, studying which properties of datasets and single-task learning characteristics correlate with improvements from multi-task learning. We are the first to study this in a text classification setting and across more than 500 different task pairs.
Tasks Document Classification, Machine Translation, Multi-Task Learning, Sentence Compression, Sentiment Analysis, Text Classification
Published 2018-11-01
URL https://www.aclweb.org/anthology/W18-5401/
PDF https://www.aclweb.org/anthology/W18-5401
PWC https://paperswithcode.com/paper/when-does-deep-multi-task-learning-work-for
Repo
Framework

DocUNet: Document Image Unwarping via a Stacked U-Net

Title DocUNet: Document Image Unwarping via a Stacked U-Net
Authors Ke Ma, Zhixin Shu, Xue Bai, Jue Wang, Dimitris Samaras
Abstract Capturing document images is a common way for digitizing and recording physical documents due to the ubiquitousness of mobile cameras. To make text recognition easier, it is often desirable to digitally flatten a document image when the physical document sheet is folded or curved. In this paper, we develop the first learning-based method to achieve this goal. We propose a stacked U-Net with intermediate supervision to directly predict the forward mapping from a distorted image to its rectified version. Because large-scale real-world data with ground truth deformation is difficult to obtain, we create a synthetic dataset with approximately 100 thousand images by warping non-distorted document images. The network is trained on this dataset with various data augmentations to improve its generalization ability. We further create a comprehensive benchmark that covers various real-world conditions. We evaluate the proposed model quantitatively and qualitatively on the proposed benchmark, and compare it with previous non-learning-based methods.
Tasks
Published 2018-06-01
URL http://openaccess.thecvf.com/content_cvpr_2018/html/Ma_DocUNet_Document_Image_CVPR_2018_paper.html
PDF http://openaccess.thecvf.com/content_cvpr_2018/papers/Ma_DocUNet_Document_Image_CVPR_2018_paper.pdf
PWC https://paperswithcode.com/paper/docunet-document-image-unwarping-via-a
Repo
Framework

Building Dialogue Structure from Discourse Tree of a Question

Title Building Dialogue Structure from Discourse Tree of a Question
Authors Boris Galitsky, Dmitry Ilvovsky
Abstract In this section we propose a reasoning-based approach to a dialogue management for a customer support chat bot. To build a dialogue scenario, we analyze the discourse tree (DT) of an initial query of a customer support dialogue that is frequently complex and multi-sentence. We then enforce rhetorical agreement between DT of the initial query and that of the answers, requests and responses. The chat bot finds answers, which are not only relevant by topic but also suitable for a given step of a conversation and match the question by style, communication means, experience level and other domain-independent attributes. We evaluate a performance of proposed algorithm in car repair domain and observe a 5 to 10{%} improvement for single and three-step dialogues respectively, in comparison with baseline approaches to dialogue management.
Tasks Dialogue Management
Published 2018-10-01
URL https://www.aclweb.org/anthology/W18-5703/
PDF https://www.aclweb.org/anthology/W18-5703
PWC https://paperswithcode.com/paper/building-dialogue-structure-from-discourse
Repo
Framework

A Methodology for Evaluating Interaction Strategies of Task-Oriented Conversational Agents

Title A Methodology for Evaluating Interaction Strategies of Task-Oriented Conversational Agents
Authors Marco Guerini, Sara Falcone, Bernardo Magnini
Abstract In task-oriented conversational agents, more attention has been usually devoted to assessing task effectiveness, rather than to \textit{how} the task is achieved. However, conversational agents are moving towards more complex and human-like interaction capabilities (e.g. the ability to use a formal/informal register, to show an empathetic behavior), for which standard evaluation methodologies may not suffice. In this paper, we provide a novel methodology to assess - in a completely controlled way - the impact on the quality of experience of agent{'}s interaction strategies. The methodology is based on a within subject design, where two slightly different transcripts of the same interaction with a conversational agent are presented to the user. Through a series of pilot experiments we prove that this methodology allows fast and cheap experimentation/evaluation, focusing on aspects that are overlooked by current methods.
Tasks
Published 2018-10-01
URL https://www.aclweb.org/anthology/W18-5704/
PDF https://www.aclweb.org/anthology/W18-5704
PWC https://paperswithcode.com/paper/a-methodology-for-evaluating-interaction
Repo
Framework

From Lexical Functional Grammar to Enhanced Universal Dependencies

Title From Lexical Functional Grammar to Enhanced Universal Dependencies
Authors Adam Przepi{'o}rkowski, Agnieszka Patejuk
Abstract This is a summary of an invited talk.
Tasks
Published 2018-08-01
URL https://www.aclweb.org/anthology/W18-4902/
PDF https://www.aclweb.org/anthology/W18-4902
PWC https://paperswithcode.com/paper/from-lexical-functional-grammar-to-enhanced
Repo
Framework

Iterative Back-Translation for Neural Machine Translation

Title Iterative Back-Translation for Neural Machine Translation
Authors Vu Cong Duy Hoang, Philipp Koehn, Gholamreza Haffari, Trevor Cohn
Abstract We present iterative back-translation, a method for generating increasingly better synthetic parallel data from monolingual data to train neural machine translation systems. Our proposed method is very simple yet effective and highly applicable in practice. We demonstrate improvements in neural machine translation quality in both high and low resourced scenarios, including the best reported BLEU scores for the WMT 2017 German↔English tasks.
Tasks Machine Translation
Published 2018-07-01
URL https://www.aclweb.org/anthology/W18-2703/
PDF https://www.aclweb.org/anthology/W18-2703
PWC https://paperswithcode.com/paper/iterative-back-translation-for-neural-machine
Repo
Framework

Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management

Title Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management
Authors Atsushi Saito
Abstract Learning from sparse and delayed reward is a central issue in reinforcement learning. In this paper, to tackle reward sparseness problem of task oriented dialogue management, we propose a curriculum based approach on the number of slots of user goals. This curriculum makes it possible to learn dialogue management for sets of user goals with large number of slots. We also propose a dialogue policy based on progressive neural networks whose modules with parameters are appended with previous parameters fixed as the curriculum proceeds, and this policy improves performances over the one with single set of parameters.
Tasks Dialogue Management, Information Retrieval, Question Answering, Text Generation
Published 2018-10-01
URL https://www.aclweb.org/anthology/W18-5707/
PDF https://www.aclweb.org/anthology/W18-5707
PWC https://paperswithcode.com/paper/curriculum-learning-based-on-reward
Repo
Framework

A theory on the absence of spurious solutions for nonconvex and nonsmooth optimization

Title A theory on the absence of spurious solutions for nonconvex and nonsmooth optimization
Authors Cedric Josz, Yi Ouyang, Richard Zhang, Javad Lavaei, Somayeh Sojoudi
Abstract We study the set of continuous functions that admit no spurious local optima (i.e. local minima that are not global minima) which we term global functions. They satisfy various powerful properties for analyzing nonconvex and nonsmooth optimization problems. For instance, they satisfy a theorem akin to the fundamental uniform limit theorem in the analysis regarding continuous functions. Global functions are also endowed with useful properties regarding the composition of functions and change of variables. Using these new results, we show that a class of non-differentiable nonconvex optimization problems arising in tensor decomposition applications are global functions. This is the first result concerning nonconvex methods for nonsmooth objective functions. Our result provides a theoretical guarantee for the widely-used $\ell_1$ norm to avoid outliers in nonconvex optimization.
Tasks
Published 2018-12-01
URL http://papers.nips.cc/paper/7511-a-theory-on-the-absence-of-spurious-solutions-for-nonconvex-and-nonsmooth-optimization
PDF http://papers.nips.cc/paper/7511-a-theory-on-the-absence-of-spurious-solutions-for-nonconvex-and-nonsmooth-optimization.pdf
PWC https://paperswithcode.com/paper/a-theory-on-the-absence-of-spurious-solutions
Repo
Framework

Uncalibrated Photometric Stereo Under Natural Illumination

Title Uncalibrated Photometric Stereo Under Natural Illumination
Authors Zhipeng Mo, Boxin Shi, Feng Lu, Sai-Kit Yeung, Yasuyuki Matsushita
Abstract This paper presents a photometric stereo method that works with unknown natural illuminations without any calibration object. To solve this challenging problem, we propose the use of an equivalent directional lighting model for small surface patches consisting of slowly varying normals, and solve each patch up to an arbitrary rotation ambiguity. Our method connects the resulting patches and unifies the local ambiguities to a global rotation one through angular distance propagation defined over the whole surface. After applying the integrability constraint, our final solution contains only a binary ambiguity, which could be easily removed. Experiments using both synthetic and real-world datasets show our method provides even comparable results to calibrated methods
Tasks Calibration
Published 2018-06-01
URL http://openaccess.thecvf.com/content_cvpr_2018/html/Mo_Uncalibrated_Photometric_Stereo_CVPR_2018_paper.html
PDF http://openaccess.thecvf.com/content_cvpr_2018/papers/Mo_Uncalibrated_Photometric_Stereo_CVPR_2018_paper.pdf
PWC https://paperswithcode.com/paper/uncalibrated-photometric-stereo-under-natural
Repo
Framework

Geometrically Coupled Monte Carlo Sampling

Title Geometrically Coupled Monte Carlo Sampling
Authors Mark Rowland, Krzysztof M. Choromanski, François Chalus, Aldo Pacchiano, Tamas Sarlos, Richard E. Turner, Adrian Weller
Abstract Monte Carlo sampling in high-dimensional, low-sample settings is important in many machine learning tasks. We improve current methods for sampling in Euclidean spaces by avoiding independence, and instead consider ways to couple samples. We show fundamental connections to optimal transport theory, leading to novel sampling algorithms, and providing new theoretical grounding for existing strategies. We compare our new strategies against prior methods for improving sample efficiency, including QMC, by studying discrepancy. We explore our findings empirically, and observe benefits of our sampling schemes for reinforcement learning and generative modelling.
Tasks
Published 2018-12-01
URL http://papers.nips.cc/paper/7304-geometrically-coupled-monte-carlo-sampling
PDF http://papers.nips.cc/paper/7304-geometrically-coupled-monte-carlo-sampling.pdf
PWC https://paperswithcode.com/paper/geometrically-coupled-monte-carlo-sampling
Repo
Framework

A Computational Approach to Feature Extraction for Identification of Suicidal Ideation in Tweets

Title A Computational Approach to Feature Extraction for Identification of Suicidal Ideation in Tweets
Authors Ramit Sawhney, Manch, Prachi a, Raj Singh, Swati Aggarwal
Abstract Technological advancements in the World Wide Web and social networks in particular coupled with an increase in social media usage has led to a positive correlation between the exhibition of Suicidal ideation on websites such as Twitter and cases of suicide. This paper proposes a novel supervised approach for detecting suicidal ideation in content on Twitter. A set of features is proposed for training both linear and ensemble classifiers over a dataset of manually annotated tweets. The performance of the proposed methodology is compared against four baselines that utilize varying approaches to validate its utility. The results are finally summarized by reflecting on the effect of the inclusion of the proposed features one by one for suicidal ideation detection.
Tasks
Published 2018-07-01
URL https://www.aclweb.org/anthology/P18-3013/
PDF https://www.aclweb.org/anthology/P18-3013
PWC https://paperswithcode.com/paper/a-computational-approach-to-feature
Repo
Framework
comments powered by Disqus