May 7, 2019

2924 words 14 mins read

Paper Group ANR 25

XCSP3: An Integrated Format for Benchmarking Combinatorial Constrained Problems. Semi-supervised and Unsupervised Methods for Categorizing Posts in Web Discussion Forums. Bootstrapping Face Detection with Hard Negative Examples. End-to-End Data Visualization by Metric Learning and Coordinate Transformation. Tree Space Prototypes: Another Look at Ma …

XCSP3: An Integrated Format for Benchmarking Combinatorial Constrained Problems


Title	XCSP3: An Integrated Format for Benchmarking Combinatorial Constrained Problems
Authors	Frederic Boussemart, Christophe Lecoutre, Gilles Audemard, Cédric Piette
Abstract	We propose a major revision of the format XCSP 2.1, called XCSP3, to build integrated representations of combinatorial constrained problems. This new format is able to deal with mono/multi optimization, many types of variables, cost functions, reification, views, annotations, variable quantification, distributed, probabilistic and qualitative reasoning. The new format is made compact, highly readable, and rather easy to parse. Interestingly, it captures the structure of the problem models, through the possibilities of declaring arrays of variables, and identifying syntactic and semantic groups of constraints. The number of constraints is kept under control by introducing a limited set of basic constraint forms, and producing almost automatically some of their variations through lifting, restriction, sliding, logical combination and relaxation mechanisms. As a result, XCSP3 encompasses practically all constraints that can be found in major constraint solvers developed by the CP community. A website, which is developed conjointly with the format, contains many models and series of instances. The user can make sophisticated queries for selecting instances from very precise criteria. The objective of XCSP3 is to ease the effort required to test and compare different algorithms by providing a common test-bed of combinatorial constrained instances.
Tasks
Published	2016-11-10
URL	http://arxiv.org/abs/1611.03398v2
PDF	http://arxiv.org/pdf/1611.03398v2.pdf
PWC	https://paperswithcode.com/paper/xcsp3-an-integrated-format-for-benchmarking
Repo
Framework

Semi-supervised and Unsupervised Methods for Categorizing Posts in Web Discussion Forums


Title	Semi-supervised and Unsupervised Methods for Categorizing Posts in Web Discussion Forums
Authors	Krish Perumal
Abstract	Web discussion forums are used by millions of people worldwide to share information belonging to a variety of domains such as automotive vehicles, pets, sports, etc. They typically contain posts that fall into different categories such as problem, solution, feedback, spam, etc. Automatic identification of these categories can aid information retrieval that is tailored for specific user requirements. Previously, a number of supervised methods have attempted to solve this problem; however, these depend on the availability of abundant training data. A few existing unsupervised and semi-supervised approaches are either focused on identifying a single category or do not report category-specific performance. In contrast, this work proposes unsupervised and semi-supervised methods that require no or minimal training data to achieve this objective without compromising on performance. A fine-grained analysis is also carried out to discuss their limitations. The proposed methods are based on sequence models (specifically, Hidden Markov Models) that can model language for each category using word and part-of-speech probability distributions, and manually specified features. Empirical evaluations across domains demonstrate that the proposed methods are better suited for this task than existing ones.
Tasks	Information Retrieval
Published	2016-04-01
URL	http://arxiv.org/abs/1604.00119v3
PDF	http://arxiv.org/pdf/1604.00119v3.pdf
PWC	https://paperswithcode.com/paper/semi-supervised-and-unsupervised-methods-for
Repo
Framework

Bootstrapping Face Detection with Hard Negative Examples


Title	Bootstrapping Face Detection with Hard Negative Examples
Authors	Shaohua Wan, Zhijun Chen, Tao Zhang, Bo Zhang, Kong-kat Wong
Abstract	Recently significant performance improvement in face detection was made possible by deeply trained convolutional networks. In this report, a novel approach for training state-of-the-art face detector is described. The key is to exploit the idea of hard negative mining and iteratively update the Faster R-CNN based face detector with the hard negatives harvested from a large set of background examples. We demonstrate that our face detector outperforms state-of-the-art detectors on the FDDB dataset, which is the de facto standard for evaluating face detection algorithms.
Tasks	Face Detection
Published	2016-08-07
URL	http://arxiv.org/abs/1608.02236v1
PDF	http://arxiv.org/pdf/1608.02236v1.pdf
PWC	https://paperswithcode.com/paper/bootstrapping-face-detection-with-hard
Repo
Framework

End-to-End Data Visualization by Metric Learning and Coordinate Transformation


Title	End-to-End Data Visualization by Metric Learning and Coordinate Transformation
Authors	Lilei Zheng, Ying Zhang, Stefan Duffner, Khalid Idrissi, Christophe Garcia, Atilla Baskurt
Abstract	This paper presents a deep nonlinear metric learning framework for data visualization on an image dataset. We propose the Triangular Similarity and prove its equivalence to the Cosine Similarity in measuring a data pair. Based on this novel similarity, a geometrically motivated loss function - the triangular loss - is then developed for optimizing a metric learning system comprising two identical CNNs. It is shown that this deep nonlinear system can be efficiently trained by a hybrid algorithm based on the conventional backpropagation algorithm. More interestingly, benefiting from classical manifold learning theories, the proposed system offers two different views to visualize the outputs, the second of which provides better classification results than the state-of-the-art methods in the visualizable spaces.
Tasks	Metric Learning
Published	2016-12-27
URL	http://arxiv.org/abs/1612.08499v1
PDF	http://arxiv.org/pdf/1612.08499v1.pdf
PWC	https://paperswithcode.com/paper/end-to-end-data-visualization-by-metric
Repo
Framework

Tree Space Prototypes: Another Look at Making Tree Ensembles Interpretable


Title	Tree Space Prototypes: Another Look at Making Tree Ensembles Interpretable
Authors	Sarah Tan, Matvey Soloviev, Giles Hooker, Martin T. Wells
Abstract	Ensembles of decision trees are known to perform well on many problems, but are not interpretable. In contrast to existing explanations of tree ensembles that explain relationships between features and predictions, we propose an alternative approach to interpreting tree ensembles by surfacing representative points for each class, in which we explain a prediction by presenting points with similar predictions – prototypes. We introduce a new distance for Gradient Boosted Tree models, and propose new prototype selection methods with theoretical guarantees, with the flexibility to choose a different number of prototypes in each class. We demonstrate our methods on random forests and gradient boosted trees, showing that our found prototypes perform as well as or even better than the original tree ensemble when used as a nearest-prototype classifier. We also present a use case of debugging dataset errors using our proposed methods.
Tasks
Published	2016-11-22
URL	https://arxiv.org/abs/1611.07115v2
PDF	https://arxiv.org/pdf/1611.07115v2.pdf
PWC	https://paperswithcode.com/paper/tree-space-prototypes-another-look-at-making
Repo
Framework

Nonnegative Matrix Factorization for identification of unknown number of sources emitting delayed signals


Title	Nonnegative Matrix Factorization for identification of unknown number of sources emitting delayed signals
Authors	Filip L. Iliev, Valentin G. Stanev, Velimir V. Vesselinov, Boian S. Alexandrov
Abstract	Factor analysis is broadly used as a powerful unsupervised machine learning tool for reconstruction of hidden features in recorded mixtures of signals. In the case of a linear approximation, the mixtures can be decomposed by a variety of model-free Blind Source Separation (BSS) algorithms. Most of the available BSS algorithms consider an instantaneous mixing of signals, while the case when the mixtures are linear combinations of signals with delays is less explored. Especially difficult is the case when the number of sources of the signals with delays is unknown and has to be determined from the data as well. To address this problem, in this paper, we present a new method based on Nonnegative Matrix Factorization (NMF) that is capable of identifying: (a) the unknown number of the sources, (b) the delays and speed of propagation of the signals, and (c) the locations of the sources. Our method can be used to decompose records of mixtures of signals with delays emitted by an unknown number of sources in a nondispersive medium, based only on recorded data. This is the case, for example, when electromagnetic signals from multiple antennas are received asynchronously; or mixtures of acoustic or seismic signals recorded by sensors located at different positions; or when a shift in frequency is induced by the Doppler effect. By applying our method to synthetic datasets, we demonstrate its ability to identify the unknown number of sources as well as the waveforms, the delays, and the strengths of the signals. Using Bayesian analysis, we also evaluate estimation uncertainties and identify the region of likelihood where the positions of the sources can be found.
Tasks
Published	2016-12-12
URL	http://arxiv.org/abs/1612.03950v2
PDF	http://arxiv.org/pdf/1612.03950v2.pdf
PWC	https://paperswithcode.com/paper/nonnegative-matrix-factorization-for
Repo
Framework

Predicting health inspection results from online restaurant reviews


Title	Predicting health inspection results from online restaurant reviews
Authors	Samantha Wong, Hamidreza Chinaei, Frank Rudzicz
Abstract	Informatics around public health are increasingly shifting from the professional to the public spheres. In this work, we apply linguistic analytics to restaurant reviews, from Yelp, in order to automatically predict official health inspection reports. We consider two types of feature sets, i.e., keyword detection and topic model features, and use these in several classification methods. Our empirical analysis shows that these extracted features can predict public health inspection reports with over 90% accuracy using simple support vector machines.
Tasks
Published	2016-03-17
URL	http://arxiv.org/abs/1603.05673v1
PDF	http://arxiv.org/pdf/1603.05673v1.pdf
PWC	https://paperswithcode.com/paper/predicting-health-inspection-results-from
Repo
Framework

General Vector Machine


Title	General Vector Machine
Authors	Hong Zhao
Abstract	The support vector machine (SVM) is an important class of learning machines for function approach, pattern recognition, and time-serious prediction, etc. It maps samples into the feature space by so-called support vectors of selected samples, and then feature vectors are separated by maximum margin hyperplane. The present paper presents the general vector machine (GVM) to replace the SVM. The support vectors are replaced by general project vectors selected from the usual vector space, and a Monte Carlo (MC) algorithm is developed to find the general vectors. The general project vectors improves the feature-extraction ability, and the MC algorithm can control the width of the separation margin of the hyperplane. By controlling the separation margin, we show that the maximum margin hyperplane can usually induce the overlearning, and the best learning machine is achieved with a proper separation margin. Applications in function approach, pattern recognition, and classification indicate that the developed method is very successful, particularly for small-set training problems. Additionally, our algorithm may induce some particular applications, such as for the transductive inference.
Tasks
Published	2016-02-12
URL	http://arxiv.org/abs/1602.03950v1
PDF	http://arxiv.org/pdf/1602.03950v1.pdf
PWC	https://paperswithcode.com/paper/general-vector-machine
Repo
Framework

Learning to Push by Grasping: Using multiple tasks for effective learning


Title	Learning to Push by Grasping: Using multiple tasks for effective learning
Authors	Lerrel Pinto, Abhinav Gupta
Abstract	Recently, end-to-end learning frameworks are gaining prevalence in the field of robot control. These frameworks input states/images and directly predict the torques or the action parameters. However, these approaches are often critiqued due to their huge data requirements for learning a task. The argument of the difficulty in scalability to multiple tasks is well founded, since training these tasks often require hundreds or thousands of examples. But do end-to-end approaches need to learn a unique model for every task? Intuitively, it seems that sharing across tasks should help since all tasks require some common understanding of the environment. In this paper, we attempt to take the next step in data-driven end-to-end learning frameworks: move from the realm of task-specific models to joint learning of multiple robot tasks. In an astonishing result we show that models with multi-task learning tend to perform better than task-specific models trained with same amounts of data. For example, a deep-network learned with 2.5K grasp and 2.5K push examples performs better on grasping than a network trained on 5K grasp examples.
Tasks	Multi-Task Learning
Published	2016-09-28
URL	http://arxiv.org/abs/1609.09025v1
PDF	http://arxiv.org/pdf/1609.09025v1.pdf
PWC	https://paperswithcode.com/paper/learning-to-push-by-grasping-using-multiple
Repo
Framework

Multiple-View Spectral Clustering for Group-wise Functional Community Detection


Title	Multiple-View Spectral Clustering for Group-wise Functional Community Detection
Authors	Nathan D. Cahill, Harmeet Singh, Chao Zhang, Daryl A. Corcoran, Alison M. Prengaman, Paul S. Wenger, John F. Hamilton, Peter Bajorski, Andrew M. Michael
Abstract	Functional connectivity analysis yields powerful insights into our understanding of the human brain. Group-wise functional community detection aims to partition the brain into clusters, or communities, in which functional activity is inter-regionally correlated in a common manner across a group of subjects. In this article, we show how to use multiple-view spectral clustering to perform group-wise functional community detection. In a series of experiments on 291 subjects from the Human Connectome Project, we compare three versions of multiple-view spectral clustering: MVSC (uniform weights), MVSCW (weights based on subject-specific embedding quality), and AASC (weights optimized along with the embedding) with the competing technique of Joint Diagonalization of Laplacians (JDL). Results show that multiple-view spectral clustering not only yields group-wise functional communities that are more consistent than JDL when using randomly selected subsets of individual brains, but it is several orders of magnitude faster than JDL.
Tasks	Community Detection
Published	2016-11-21
URL	http://arxiv.org/abs/1611.06981v1
PDF	http://arxiv.org/pdf/1611.06981v1.pdf
PWC	https://paperswithcode.com/paper/multiple-view-spectral-clustering-for-group
Repo
Framework

Pairwise Quantization


Title	Pairwise Quantization
Authors	Artem Babenko, Relja Arandjelović, Victor Lempitsky
Abstract	We consider the task of lossy compression of high-dimensional vectors through quantization. We propose the approach that learns quantization parameters by minimizing the distortion of scalar products and squared distances between pairs of points. This is in contrast to previous works that obtain these parameters through the minimization of the reconstruction error of individual points. The proposed approach proceeds by finding a linear transformation of the data that effectively reduces the minimization of the pairwise distortions to the minimization of individual reconstruction errors. After such transformation, any of the previously-proposed quantization approaches can be used. Despite the simplicity of this transformation, the experiments demonstrate that it achieves considerable reduction of the pairwise distortions compared to applying quantization directly to the untransformed data.
Tasks	Quantization
Published	2016-06-05
URL	http://arxiv.org/abs/1606.01550v1
PDF	http://arxiv.org/pdf/1606.01550v1.pdf
PWC	https://paperswithcode.com/paper/pairwise-quantization
Repo
Framework

Attention Correctness in Neural Image Captioning


Title	Attention Correctness in Neural Image Captioning
Authors	Chenxi Liu, Junhua Mao, Fei Sha, Alan Yuille
Abstract	Attention mechanisms have recently been introduced in deep learning for various tasks in natural language processing and computer vision. But despite their popularity, the “correctness” of the implicitly-learned attention maps has only been assessed qualitatively by visualization of several examples. In this paper we focus on evaluating and improving the correctness of attention in neural image captioning models. Specifically, we propose a quantitative evaluation metric for the consistency between the generated attention maps and human annotations, using recently released datasets with alignment between regions in images and entities in captions. We then propose novel models with different levels of explicit supervision for learning attention maps during training. The supervision can be strong when alignment between regions and caption entities are available, or weak when only object segments and categories are provided. We show on the popular Flickr30k and COCO datasets that introducing supervision of attention maps during training solidly improves both attention correctness and caption quality, showing the promise of making machine perception more human-like.
Tasks	Image Captioning
Published	2016-05-31
URL	http://arxiv.org/abs/1605.09553v2
PDF	http://arxiv.org/pdf/1605.09553v2.pdf
PWC	https://paperswithcode.com/paper/attention-correctness-in-neural-image
Repo
Framework

Representing Verbs with Rich Contexts: an Evaluation on Verb Similarity


Title	Representing Verbs with Rich Contexts: an Evaluation on Verb Similarity
Authors	Emmanuele Chersoni, Enrico Santus, Alessandro Lenci, Philippe Blache, Chu-Ren Huang
Abstract	Several studies on sentence processing suggest that the mental lexicon keeps track of the mutual expectations between words. Current DSMs, however, represent context words as separate features, thereby loosing important information for word expectations, such as word interrelations. In this paper, we present a DSM that addresses this issue by defining verb contexts as joint syntactic dependencies. We test our representation in a verb similarity task on two datasets, showing that joint contexts achieve performances comparable to single dependencies or even better. Moreover, they are able to overcome the data sparsity problem of joint feature spaces, in spite of the limited size of our training corpus.
Tasks
Published	2016-07-07
URL	http://arxiv.org/abs/1607.02061v2
PDF	http://arxiv.org/pdf/1607.02061v2.pdf
PWC	https://paperswithcode.com/paper/representing-verbs-with-rich-contexts-an
Repo
Framework

Partially blind domain adaptation for age prediction from DNA methylation data


Title	Partially blind domain adaptation for age prediction from DNA methylation data
Authors	Lisa Handl, Adrin Jalali, Michael Scherer, Nico Pfeifer
Abstract	Over the last years, huge resources of biological and medical data have become available for research. This data offers great chances for machine learning applications in health care, e.g. for precision medicine, but is also challenging to analyze. Typical challenges include a large number of possibly correlated features and heterogeneity in the data. One flourishing field of biological research in which this is relevant is epigenetics. Here, especially large amounts of DNA methylation data have emerged. This epigenetic mark has been used to predict a donor’s ‘epigenetic age’ and increased epigenetic aging has been linked to lifestyle and disease history. In this paper we propose an adaptive model which performs feature selection for each test sample individually based on the distribution of the input data. The method can be seen as partially blind domain adaptation. We apply the model to the problem of age prediction based on DNA methylation data from a variety of tissues, and compare it to a standard model, which does not take heterogeneity into account. The standard approach has particularly bad performance on one tissue type on which we show substantial improvement with our new adaptive approach even though no samples of that tissue were part of the training data.
Tasks	Domain Adaptation, Feature Selection
Published	2016-12-20
URL	http://arxiv.org/abs/1612.06650v1
PDF	http://arxiv.org/pdf/1612.06650v1.pdf
PWC	https://paperswithcode.com/paper/partially-blind-domain-adaptation-for-age
Repo
Framework

Towards Unifying Hamiltonian Monte Carlo and Slice Sampling


Title	Towards Unifying Hamiltonian Monte Carlo and Slice Sampling
Authors	Yizhe Zhang, Xiangyu Wang, Changyou Chen, Ricardo Henao, Kai Fan, Lawrence Carin
Abstract	We unify slice sampling and Hamiltonian Monte Carlo (HMC) sampling, demonstrating their connection via the Hamiltonian-Jacobi equation from Hamiltonian mechanics. This insight enables extension of HMC and slice sampling to a broader family of samplers, called Monomial Gamma Samplers (MGS). We provide a theoretical analysis of the mixing performance of such samplers, proving that in the limit of a single parameter, the MGS draws decorrelated samples from the desired target distribution. We further show that as this parameter tends toward this limit, performance gains are achieved at a cost of increasing numerical difficulty and some practical convergence issues. Our theoretical results are validated with synthetic data and real-world applications.
Tasks
Published	2016-02-25
URL	http://arxiv.org/abs/1602.07800v5
PDF	http://arxiv.org/pdf/1602.07800v5.pdf
PWC	https://paperswithcode.com/paper/towards-unifying-hamiltonian-monte-carlo-and
Repo
Framework