July 26, 2019

2239 words 11 mins read

Paper Group NANR 184

Minimizing a Submodular Function from Samples. An Empirical Study on End-to-End Sentence Modelling. NLP for Precision Medicine. Consistent Classification of Translation Revisions: A Case Study of English-Japanese Student Translations. Adaptive Sampling Probabilities for Non-Smooth Optimization. Variable Importance Using Decision Trees. Neural Seque …

Minimizing a Submodular Function from Samples


Title	Minimizing a Submodular Function from Samples
Authors	Eric Balkanski, Yaron Singer
Abstract	In this paper we consider the problem of minimizing a submodular function from training data. Submodular functions can be efficiently minimized and are conse- quently heavily applied in machine learning. There are many cases, however, in which we do not know the function we aim to optimize, but rather have access to training data that is used to learn the function. In this paper we consider the question of whether submodular functions can be minimized in such cases. We show that even learnable submodular functions cannot be minimized within any non-trivial approximation when given access to polynomially-many samples. Specifically, we show that there is a class of submodular functions with range in [0, 1] such that, despite being PAC-learnable and minimizable in polynomial-time, no algorithm can obtain an approximation strictly better than 1/2 − o(1) using polynomially-many samples drawn from any distribution. Furthermore, we show that this bound is tight using a trivial algorithm that obtains an approximation of 1/2.
Tasks
Published	2017-12-01
URL	http://papers.nips.cc/paper/6683-minimizing-a-submodular-function-from-samples
PDF	http://papers.nips.cc/paper/6683-minimizing-a-submodular-function-from-samples.pdf
PWC	https://paperswithcode.com/paper/minimizing-a-submodular-function-from-samples
Repo
Framework

An Empirical Study on End-to-End Sentence Modelling


Title	An Empirical Study on End-to-End Sentence Modelling
Authors	Kurt Junshean Espinosa
Abstract
Tasks	Natural Language Inference
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-3021/
PDF	https://www.aclweb.org/anthology/P17-3021
PWC	https://paperswithcode.com/paper/an-empirical-study-on-end-to-end-sentence
Repo
Framework

NLP for Precision Medicine


Title	NLP for Precision Medicine
Authors	Hoifung Poon, Chris Quirk, Kristina Toutanova, Wen-tau Yih
Abstract	We will introduce precision medicine and showcase the vast opportunities for NLP in this burgeoning field with great societal impact. We will review pressing NLP problems, state-of-the art methods, and important applications, as well as datasets, medical resources, and practical issues. The tutorial will provide an accessible overview of biomedicine, and does not presume knowledge in biology or healthcare. The ultimate goal is to reduce the entry barrier for NLP researchers to contribute to this exciting domain.
Tasks	Decision Making, Entity Linking, Relation Extraction, Semantic Parsing
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-5001/
PDF	https://www.aclweb.org/anthology/P17-5001
PWC	https://paperswithcode.com/paper/nlp-for-precision-medicine
Repo
Framework

Consistent Classification of Translation Revisions: A Case Study of English-Japanese Student Translations


Title	Consistent Classification of Translation Revisions: A Case Study of English-Japanese Student Translations
Authors	Atsushi Fujita, Kikuko Tanabe, Chiho Toyoshima, Mayuka Yamamoto, Kyo Kageura, Anthony Hartley
Abstract	Consistency is a crucial requirement in text annotation. It is especially important in educational applications, as lack of consistency directly affects learners{'} motivation and learning performance. This paper presents a quality assessment scheme for English-to-Japanese translations produced by learner translators at university. We constructed a revision typology and a decision tree manually through an application of the OntoNotes method, i.e., an iteration of assessing learners{'} translations and hypothesizing the conditions for consistent decision making, as well as re-organizing the typology. Intrinsic evaluation of the created scheme confirmed its potential contribution to the consistent classification of identified erroneous text spans, achieving visibly higher Cohen{'}s kappa values, up to 0.831, than previous work. This paper also describes an application of our scheme to an English-to-Japanese translation exercise course for undergraduate students at a university in Japan.
Tasks	Decision Making
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-0807/
PDF	https://www.aclweb.org/anthology/W17-0807
PWC	https://paperswithcode.com/paper/consistent-classification-of-translation-1
Repo
Framework

Adaptive Sampling Probabilities for Non-Smooth Optimization


Title	Adaptive Sampling Probabilities for Non-Smooth Optimization
Authors	Hongseok Namkoong, Aman Sinha, Steve Yadlowsky, John C. Duchi
Abstract	Standard forms of coordinate and stochastic gradient methods do not adapt to structure in data; their good behavior under random sampling is predicated on uniformity in data. When gradients in certain blocks of features (for coordinate descent) or examples (for SGD) are larger than others, there is a natural structure that can be exploited for quicker convergence. Yet adaptive variants often suffer nontrivial computational overhead. We present a framework that discovers and leverages such structural properties at a low computational cost. We employ a bandit optimization procedure that “learns” probabilities for sampling coordinates or examples in (non-smooth) optimization problems, allowing us to guarantee performance close to that of the optimal stationary sampling distribution. When such structures exist, our algorithms achieve tighter convergence guarantees than their non-adaptive counterparts, and we complement our analysis with experiments on several datasets.
Tasks
Published	2017-08-01
URL	https://icml.cc/Conferences/2017/Schedule?showEvent=562
PDF	http://proceedings.mlr.press/v70/namkoong17a/namkoong17a.pdf
PWC	https://paperswithcode.com/paper/adaptive-sampling-probabilities-for-non
Repo
Framework

Variable Importance Using Decision Trees


Title	Variable Importance Using Decision Trees
Authors	Jalil Kazemitabar, Arash Amini, Adam Bloniarz, Ameet S. Talwalkar
Abstract	Decision trees and random forests are well established models that not only offer good predictive performance, but also provide rich feature importance information. While practitioners often employ variable importance methods that rely on this impurity-based information, these methods remain poorly characterized from a theoretical perspective. We provide novel insights into the performance of these methods by deriving finite sample performance guarantees in a high-dimensional setting under various modeling assumptions. We further demonstrate the effectiveness of these impurity-based methods via an extensive set of simulations.
Tasks	Feature Importance
Published	2017-12-01
URL	http://papers.nips.cc/paper/6646-variable-importance-using-decision-trees
PDF	http://papers.nips.cc/paper/6646-variable-importance-using-decision-trees.pdf
PWC	https://paperswithcode.com/paper/variable-importance-using-decision-trees
Repo
Framework

Neural Sequence-Labelling Models for Grammatical Error Correction


Title	Neural Sequence-Labelling Models for Grammatical Error Correction
Authors	Helen Yannakoudakis, Marek Rei, {\O}istein E. Andersen, Zheng Yuan
Abstract	We propose an approach to N-best list reranking using neural sequence-labelling models. We train a compositional model for error detection that calculates the probability of each token in a sentence being correct or incorrect, utilising the full sentence as context. Using the error detection model, we then re-rank the N best hypotheses generated by statistical machine translation systems. Our approach achieves state-of-the-art results on error correction for three different datasets, and it has the additional advantage of only using a small set of easily computed features that require no linguistic input.
Tasks	Grammatical Error Correction, Machine Translation
Published	2017-09-01
URL	https://www.aclweb.org/anthology/D17-1297/
PDF	https://www.aclweb.org/anthology/D17-1297
PWC	https://paperswithcode.com/paper/neural-sequence-labelling-models-for
Repo
Framework

Linguistically Regularized LSTM for Sentiment Classification


Title	Linguistically Regularized LSTM for Sentiment Classification
Authors	Qiao Qian, Minlie Huang, Jinhao Lei, Xiaoyan Zhu
Abstract	This paper deals with sentence-level sentiment classification. Though a variety of neural network models have been proposed recently, however, previous models either depend on expensive phrase-level annotation, most of which has remarkably degraded performance when trained with only sentence-level annotation; or do not fully employ linguistic resources (e.g., sentiment lexicons, negation words, intensity words). In this paper, we propose simple models trained with sentence-level annotation, but also attempt to model the linguistic role of sentiment lexicons, negation words, and intensity words. Results show that our models are able to capture the linguistic role of sentiment words, negation words, and intensity words in sentiment expression.
Tasks	Sentiment Analysis
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-1154/
PDF	https://www.aclweb.org/anthology/P17-1154
PWC	https://paperswithcode.com/paper/linguistically-regularized-lstm-for-sentiment
Repo
Framework

Neural Architectures for Multilingual Semantic Parsing


Title	Neural Architectures for Multilingual Semantic Parsing
Authors	Raymond Hendy Susanto, Wei Lu
Abstract	In this paper, we address semantic parsing in a multilingual context. We train one multilingual model that is capable of parsing natural language sentences from multiple different languages into their corresponding formal semantic representations. We extend an existing sequence-to-tree model to a multi-task learning framework which shares the decoder for generating semantic representations. We report evaluation results on the multilingual GeoQuery corpus and introduce a new multilingual version of the ATIS corpus.
Tasks	Machine Translation, Multi-Task Learning, Semantic Parsing
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-2007/
PDF	https://www.aclweb.org/anthology/P17-2007
PWC	https://paperswithcode.com/paper/neural-architectures-for-multilingual
Repo
Framework

Tutorial: Making Better Use of the Crowd


Title	Tutorial: Making Better Use of the Crowd
Authors	Jennifer Wortman Vaughan
Abstract	Over the last decade, crowdsourcing has been used to harness the power of human computation to solve tasks that are notoriously difficult to solve with computers alone, such as determining whether or not an image contains a tree, rating the relevance of a website, or verifying the phone number of a business. The natural language processing community was early to embrace crowdsourcing as a tool for quickly and inexpensively obtaining annotated data to train NLP systems. Once this data is collected, it can be handed off to algorithms that learn to perform basic NLP tasks such as translation or parsing. Usually this handoff is where interaction with the crowd ends. The crowd provides the data, but the ultimate goal is to eventually take humans out of the loop. Are there better ways to make use of the crowd?In this tutorial, I will begin with a showcase of innovative uses of crowdsourcing that go beyond data collection and annotation. I will discuss applications to natural language processing and machine learning, hybrid intelligence or {``}human in the loop{''} AI systems that leverage the complementary strengths of humans and machines in order to achieve more than either could achieve alone, and large scale studies of human behavior online. I will then spend the majority of the tutorial diving into recent research aimed at understanding who crowdworkers are, how they behave, and what this should teach us about best practices for interacting with the crowd. \|
Tasks
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-5006/
PDF	https://www.aclweb.org/anthology/P17-5006
PWC	https://paperswithcode.com/paper/tutorial-making-better-use-of-the-crowd
Repo
Framework

Lightly-Supervised Modeling of Argument Persuasiveness


Title	Lightly-Supervised Modeling of Argument Persuasiveness
Authors	Isaac Persing, Vincent Ng
Abstract	We propose the first lightly-supervised approach to scoring an argument{'}s persuasiveness. Key to our approach is the novel hypothesis that lightly-supervised persuasiveness scoring is possible by explicitly modeling the major errors that negatively impact persuasiveness. In an evaluation on a new annotated corpus of online debate arguments, our approach rivals its fully-supervised counterparts in performance by four scoring metrics when using only 10{%} of the available training instances.
Tasks	Argument Mining
Published	2017-11-01
URL	https://www.aclweb.org/anthology/I17-1060/
PDF	https://www.aclweb.org/anthology/I17-1060
PWC	https://paperswithcode.com/paper/lightly-supervised-modeling-of-argument
Repo
Framework

Texture image retrieval using DTCWT-SVD and local binary pattern features


Title	Texture image retrieval using DTCWT-SVD and local binary pattern features
Authors	Dayou, Jiang; Jongweon, Kim
Abstract	The combination texture feature extraction approach for texture image retrieval is proposed in this paper. Two kinds of low-level texture features were combined in the approach. One of them was extracted from singular value decomposition (SVD) based dual-tree complex wavelet transform (DTCWT) coefficients, and the other one was extracted from multi-scale local binary patterns (LBPs). The fusion features of SVD based multi-directional wavelet features and multi-scale LBP features have short dimensions of the feature vector. The comparing experiments are conducted on Brodatz and Vistex datasets. According to the experimental results, the proposed method has a relatively better performance in aspect of retrieval accuracy and time complexity upon the existing methods.
Tasks	Image Retrieval, Texture Image Retrieval
Published	2017-12-01
URL	http://jips-k.org/q.jips?cp=pp&pn=523
PDF	http://jips-k.org/q.jips?cp=pp&pn=523
PWC	https://paperswithcode.com/paper/texture-image-retrieval-using-dtcwt-svd-and
Repo
Framework

Alignment at Work: Using Language to Distinguish the Internalization and Self-Regulation Components of Cultural Fit in Organizations


Title	Alignment at Work: Using Language to Distinguish the Internalization and Self-Regulation Components of Cultural Fit in Organizations
Authors	Gabriel Doyle, Amir Goldberg, Sameer Srivastava, Michael Frank
Abstract	Cultural fit is widely believed to affect the success of individuals and the groups to which they belong. Yet it remains an elusive, poorly measured construct. Recent research draws on computational linguistics to measure cultural fit but overlooks asymmetries in cultural adaptation. By contrast, we develop a directed, dynamic measure of cultural fit based on linguistic alignment, which estimates the influence of one person{'}s word use on another{'}s and distinguishes between two enculturation mechanisms: internalization and self-regulation. We use this measure to trace employees{'} enculturation trajectories over a large, multi-year corpus of corporate emails and find that patterns of alignment in the first six months of employment are predictive of individuals{'} downstream outcomes, especially involuntary exit. Further predictive analyses suggest referential alignment plays an overlooked role in linguistic alignment.
Tasks	Language Modelling
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-1056/
PDF	https://www.aclweb.org/anthology/P17-1056
PWC	https://paperswithcode.com/paper/alignment-at-work-using-language-to
Repo
Framework

Spindle Net: Person Re-Identification With Human Body Region Guided Feature Decomposition and Fusion


Title	Spindle Net: Person Re-Identification With Human Body Region Guided Feature Decomposition and Fusion
Authors	Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, Xiaoou Tang
Abstract	Person re-identification (ReID) is an important task in video surveillance and has various applications. It is non-trivial due to complex background clutters, varying illumination conditions, and uncontrollable camera settings. Moreover, the person body misalignment caused by detectors or pose variations is sometimes too severe for feature matching across images. In this study, we propose a novel Convolutional Neural Network (CNN), called Spindle Net, based on human body region guided multi-stage feature decomposition and tree-structured competitive feature fusion. It is the first time human body structure information is considered in a CNN framework to facilitate feature learning. The proposed Spindle Net brings unique advantages: 1) it separately captures semantic features from different body regions thus the macro- and micro-body features can be well aligned across images, 2) the learned region features from different semantic regions are merged with a competitive scheme and discriminative features can be well preserved. State of the art performance can be achieved on multiple datasets by large margins. We further demonstrate the robustness and effectiveness of the proposed Spindle Net on our proposed dataset SenseReID without fine-tuning.
Tasks	Person Re-Identification
Published	2017-07-01
URL	http://openaccess.thecvf.com/content_cvpr_2017/html/Zhao_Spindle_Net_Person_CVPR_2017_paper.html
PDF	http://openaccess.thecvf.com/content_cvpr_2017/papers/Zhao_Spindle_Net_Person_CVPR_2017_paper.pdf
PWC	https://paperswithcode.com/paper/spindle-net-person-re-identification-with
Repo
Framework

Learning with learner corpora: Using the TLE for native language identification


Title	Learning with learner corpora: Using the TLE for native language identification
Authors	Allison Adams, Sara Stymne
Abstract
Tasks	Language Acquisition, Language Identification, Native Language Identification, Text Classification
Published	2017-05-01
URL	https://www.aclweb.org/anthology/W17-0301/
PDF	https://www.aclweb.org/anthology/W17-0301
PWC	https://paperswithcode.com/paper/learning-with-learner-corpora-using-the-tle
Repo
Framework