May 6, 2019

3006 words 15 mins read

Paper Group ANR 355

Variable Neighborhood Search Algorithms for the multi-depot dial-a-ride problem with heterogeneous vehicles and users. Path planning with Inventory-driven Jump-Point-Search. Splitting matters: how monotone transformation of predictor variables may improve the predictions of decision tree models. A New Android Malware Detection Approach Using Bayesi …

Variable Neighborhood Search Algorithms for the multi-depot dial-a-ride problem with heterogeneous vehicles and users


Title	Variable Neighborhood Search Algorithms for the multi-depot dial-a-ride problem with heterogeneous vehicles and users
Authors	Paolo Detti, Garazi Zabalo Manrique de Lara
Abstract	In this work, a study on Variable Neighborhood Search algorithms for multi-depot dial-a-ride problems is presented. In dial-a-ride problems patients need to be transported from pre-specified pickup locations to pre-specified delivery locations, under different considerations. The addressed problem presents several constraints and features, such as heterogeneous vehicles, distributed in different depots, and heterogeneous patients. The aim is of minimizing the total routing cost, while respecting time-window, ride-time, capacity and route duration constraints. The objective of the study is of determining the best algorithm configuration in terms of initial solution, neighborhood and local search procedures. At this aim, two different procedures for the computation of an initial solution, six different type of neighborhoods and five local search procedures, where only intra-route changes are made, have been considered and compared. We have also evaluated an “adjusting procedure” that aims to produce feasible solutions from infeasible solutions with small constraints violations. The different VNS algorithms have been tested on instances from literature as well as on random instances arising from a real-world healthcare application.
Tasks
Published	2016-11-16
URL	http://arxiv.org/abs/1611.05187v1
PDF	http://arxiv.org/pdf/1611.05187v1.pdf
PWC	https://paperswithcode.com/paper/variable-neighborhood-search-algorithms-for
Repo
Framework

Path planning with Inventory-driven Jump-Point-Search


Title	Path planning with Inventory-driven Jump-Point-Search
Authors	Davide Aversa, Sebastian Sardina, Stavros Vassos
Abstract	In many navigational domains the traversability of cells is conditioned on the path taken. This is often the case in video-games, in which a character may need to acquire a certain object (i.e., a key or a flying suit) to be able to traverse specific locations (e.g., doors or high walls). In order for non-player characters to handle such scenarios we present invJPS, an “inventory-driven” pathfinding approach based on the highly successful grid-based Jump-Point-Search (JPS) algorithm. We show, formally and experimentally, that the invJPS preserves JPS’s optimality guarantees and its symmetry breaking advantages in inventory-based variants of game maps.
Tasks
Published	2016-07-04
URL	http://arxiv.org/abs/1607.00715v1
PDF	http://arxiv.org/pdf/1607.00715v1.pdf
PWC	https://paperswithcode.com/paper/path-planning-with-inventory-driven-jump
Repo
Framework

Splitting matters: how monotone transformation of predictor variables may improve the predictions of decision tree models


Title	Splitting matters: how monotone transformation of predictor variables may improve the predictions of decision tree models
Authors	Tal Galili, Isaac Meilijson
Abstract	It is widely believed that the prediction accuracy of decision tree models is invariant under any strictly monotone transformation of the individual predictor variables. However, this statement may be false when predicting new observations with values that were not seen in the training-set and are close to the location of the split point of a tree rule. The sensitivity of the prediction error to the split point interpolation is high when the split point of the tree is estimated based on very few observations, reaching 9% misclassification error when only 10 observations are used for constructing a split, and shrinking to 1% when relying on 100 observations. This study compares the performance of alternative methods for split point interpolation and concludes that the best choice is taking the mid-point between the two closest points to the split point of the tree. Furthermore, if the (continuous) distribution of the predictor variable is known, then using its probability integral for transforming the variable (“quantile transformation”) will reduce the model’s interpolation error by up to about a half on average. Accordingly, this study provides guidelines for both developers and users of decision tree models (including bagging and random forest).
Tasks
Published	2016-11-14
URL	http://arxiv.org/abs/1611.04561v1
PDF	http://arxiv.org/pdf/1611.04561v1.pdf
PWC	https://paperswithcode.com/paper/splitting-matters-how-monotone-transformation
Repo
Framework

A New Android Malware Detection Approach Using Bayesian Classification


Title	A New Android Malware Detection Approach Using Bayesian Classification
Authors	Suleiman Y. Yerima, Sakir Sezer, Gavin McWilliams, Igor Muttik
Abstract	Mobile malware has been growing in scale and complexity as smartphone usage continues to rise. Android has surpassed other mobile platforms as the most popular whilst also witnessing a dramatic increase in malware targeting the platform. A worrying trend that is emerging is the increasing sophistication of Android malware to evade detection by traditional signature-based scanners. As such, Android app marketplaces remain at risk of hosting malicious apps that could evade detection before being downloaded by unsuspecting users. Hence, in this paper we present an effective approach to alleviate this problem based on Bayesian classification models obtained from static code analysis. The models are built from a collection of code and app characteristics that provide indicators of potential malicious activities. The models are evaluated with real malware samples in the wild and results of experiments are presented to demonstrate the effectiveness of the proposed approach.
Tasks	Android Malware Detection, Malware Detection
Published	2016-08-02
URL	http://arxiv.org/abs/1608.00848v1
PDF	http://arxiv.org/pdf/1608.00848v1.pdf
PWC	https://paperswithcode.com/paper/a-new-android-malware-detection-approach
Repo
Framework

When Fashion Meets Big Data: Discriminative Mining of Best Selling Clothing Features


Title	When Fashion Meets Big Data: Discriminative Mining of Best Selling Clothing Features
Authors	Kuan-Ting Chen, Jiebo Luo
Abstract	With the prevalence of e-commence websites and the ease of online shopping, consumers are embracing huge amounts of various options in products. Undeniably, shopping is one of the most essential activities in our society and studying consumer’s shopping behavior is important for the industry as well as sociology and psychology. Indisputable, one of the most popular e-commerce categories is clothing business. There arises the needs for analysis of popular and attractive clothing features which could further boost many emerging applications, such as clothing recommendation and advertising. In this work, we design a novel system that consists of three major components: 1) exploring and organizing a large-scale clothing dataset from a online shopping website, 2) pruning and extracting images of best-selling products in clothing item data and user transaction history, and 3) utilizing a machine learning based approach to discovering fine-grained clothing attributes as the representative and discriminative characteristics of popular clothing style elements. Through the experiments over a large-scale online clothing shopping dataset, we demonstrate the effectiveness of our proposed system, and obtain useful insights on clothing consumption trends and profitable clothing features.
Tasks
Published	2016-11-11
URL	http://arxiv.org/abs/1611.03915v2
PDF	http://arxiv.org/pdf/1611.03915v2.pdf
PWC	https://paperswithcode.com/paper/when-fashion-meets-big-data-discriminative
Repo
Framework

Geometric deep learning: going beyond Euclidean data


Title	Geometric deep learning: going beyond Euclidean data
Authors	Michael M. Bronstein, Joan Bruna, Yann LeCun, Arthur Szlam, Pierre Vandergheynst
Abstract	Many scientific fields study data with an underlying structure that is a non-Euclidean space. Some examples include social networks in computational social sciences, sensor networks in communications, functional networks in brain imaging, regulatory networks in genetics, and meshed surfaces in computer graphics. In many applications, such geometric data are large and complex (in the case of social networks, on the scale of billions), and are natural targets for machine learning techniques. In particular, we would like to use deep neural networks, which have recently proven to be powerful tools for a broad range of problems from computer vision, natural language processing, and audio analysis. However, these tools have been most successful on data with an underlying Euclidean or grid-like structure, and in cases where the invariances of these structures are built into networks used to model them. Geometric deep learning is an umbrella term for emerging techniques attempting to generalize (structured) deep neural models to non-Euclidean domains such as graphs and manifolds. The purpose of this paper is to overview different examples of geometric deep learning problems and present available solutions, key difficulties, applications, and future research directions in this nascent field.
Tasks
Published	2016-11-24
URL	http://arxiv.org/abs/1611.08097v2
PDF	http://arxiv.org/pdf/1611.08097v2.pdf
PWC	https://paperswithcode.com/paper/geometric-deep-learning-going-beyond
Repo
Framework

A Surrogate-based Generic Classifier for Chinese TV Series Reviews


Title	A Surrogate-based Generic Classifier for Chinese TV Series Reviews
Authors	Yufeng Ma, Long Xia, Wenqi Shen, Mi Zhou, Weiguo Fan
Abstract	With the emerging of various online video platforms like Youtube, Youku and LeTV, online TV series’ reviews become more and more important both for viewers and producers. Customers rely heavily on these reviews before selecting TV series, while producers use them to improve the quality. As a result, automatically classifying reviews according to different requirements evolves as a popular research topic and is essential in our daily life. In this paper, we focused on reviews of hot TV series in China and successfully trained generic classifiers based on eight predefined categories. The experimental results showed promising performance and effectiveness of its generalization to different TV series.
Tasks
Published	2016-11-08
URL	http://arxiv.org/abs/1611.02378v2
PDF	http://arxiv.org/pdf/1611.02378v2.pdf
PWC	https://paperswithcode.com/paper/a-surrogate-based-generic-classifier-for
Repo
Framework

Stabilized Sparse Online Learning for Sparse Data


Title	Stabilized Sparse Online Learning for Sparse Data
Authors	Yuting Ma, Tian Zheng
Abstract	Stochastic gradient descent (SGD) is commonly used for optimization in large-scale machine learning problems. Langford et al. (2009) introduce a sparse online learning method to induce sparsity via truncated gradient. With high-dimensional sparse data, however, the method suffers from slow convergence and high variance due to the heterogeneity in feature sparsity. To mitigate this issue, we introduce a stabilized truncated stochastic gradient descent algorithm. We employ a soft-thresholding scheme on the weight vector where the imposed shrinkage is adaptive to the amount of information available in each feature. The variability in the resulted sparse weight vector is further controlled by stability selection integrated with the informative truncation. To facilitate better convergence, we adopt an annealing strategy on the truncation rate, which leads to a balanced trade-off between exploration and exploitation in learning a sparse weight vector. Numerical experiments show that our algorithm compares favorably with the original algorithm in terms of prediction accuracy, achieved sparsity and stability.
Tasks
Published	2016-04-21
URL	http://arxiv.org/abs/1604.06498v3
PDF	http://arxiv.org/pdf/1604.06498v3.pdf
PWC	https://paperswithcode.com/paper/stabilized-sparse-online-learning-for-sparse
Repo
Framework

Efficient Estimation of Compressible State-Space Models with Application to Calcium Signal Deconvolution


Title	Efficient Estimation of Compressible State-Space Models with Application to Calcium Signal Deconvolution
Authors	Abbas Kazemipour, Ji Liu, Patrick Kanold, Min Wu, Behtash Babadi
Abstract	In this paper, we consider linear state-space models with compressible innovations and convergent transition matrices in order to model spatiotemporally sparse transient events. We perform parameter and state estimation using a dynamic compressed sensing framework and develop an efficient solution consisting of two nested Expectation-Maximization (EM) algorithms. Under suitable sparsity assumptions on the innovations, we prove recovery guarantees and derive confidence bounds for the state estimates. We provide simulation studies as well as application to spike deconvolution from calcium imaging data which verify our theoretical results and show significant improvement over existing algorithms.
Tasks
Published	2016-10-20
URL	http://arxiv.org/abs/1610.06461v1
PDF	http://arxiv.org/pdf/1610.06461v1.pdf
PWC	https://paperswithcode.com/paper/efficient-estimation-of-compressible-state
Repo
Framework

PixelCNN Models with Auxiliary Variables for Natural Image Modeling


Title	PixelCNN Models with Auxiliary Variables for Natural Image Modeling
Authors	Alexander Kolesnikov, Christoph H. Lampert
Abstract	We study probabilistic models of natural images and extend the autoregressive family of PixelCNN architectures by incorporating auxiliary variables. Subsequently, we describe two new generative image models that exploit different image transformations as auxiliary variables: a quantized grayscale view of the image or a multi-resolution image pyramid. The proposed models tackle two known shortcomings of existing PixelCNN models: 1) their tendency to focus on low-level image details, while largely ignoring high-level image information, such as object shapes, and 2) their computationally costly procedure for image sampling. We experimentally demonstrate benefits of the proposed models, in particular showing that they produce much more realistically looking image samples than previous state-of-the-art probabilistic models.
Tasks	Image Generation
Published	2016-12-24
URL	http://arxiv.org/abs/1612.08185v4
PDF	http://arxiv.org/pdf/1612.08185v4.pdf
PWC	https://paperswithcode.com/paper/pixelcnn-models-with-auxiliary-variables-for
Repo
Framework

Learning to Distill: The Essence Vector Modeling Framework


Title	Learning to Distill: The Essence Vector Modeling Framework
Authors	Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang
Abstract	In the context of natural language processing, representation learning has emerged as a newly active research subject because of its excellent performance in many applications. Learning representations of words is a pioneering study in this school of research. However, paragraph (or sentence and document) embedding learning is more suitable/reasonable for some tasks, such as sentiment classification and document summarization. Nevertheless, as far as we are aware, there is relatively less work focusing on the development of unsupervised paragraph embedding methods. Classic paragraph embedding methods infer the representation of a given paragraph by considering all of the words occurring in the paragraph. Consequently, those stop or function words that occur frequently may mislead the embedding learning process to produce a misty paragraph representation. Motivated by these observations, our major contributions in this paper are twofold. First, we propose a novel unsupervised paragraph embedding method, named the essence vector (EV) model, which aims at not only distilling the most representative information from a paragraph but also excluding the general background information to produce a more informative low-dimensional vector representation for the paragraph. Second, in view of the increasing importance of spoken content processing, an extension of the EV model, named the denoising essence vector (D-EV) model, is proposed. The D-EV model not only inherits the advantages of the EV model but also can infer a more robust representation for a given spoken paragraph against imperfect speech recognition.
Tasks	Denoising, Document Embedding, Document Summarization, Representation Learning, Sentiment Analysis, Speech Recognition
Published	2016-11-22
URL	http://arxiv.org/abs/1611.07206v1
PDF	http://arxiv.org/pdf/1611.07206v1.pdf
PWC	https://paperswithcode.com/paper/learning-to-distill-the-essence-vector
Repo
Framework

Experimental and causal view on information integration in autonomous agents


Title	Experimental and causal view on information integration in autonomous agents
Authors	Philipp Geiger, Katja Hofmann, Bernhard Schölkopf
Abstract	The amount of digitally available but heterogeneous information about the world is remarkable, and new technologies such as self-driving cars, smart homes, or the internet of things may further increase it. In this paper we present preliminary ideas about certain aspects of the problem of how such heterogeneous information can be harnessed by autonomous agents. After discussing potentials and limitations of some existing approaches, we investigate how \emph{experiments} can help to obtain a better understanding of the problem. Specifically, we present a simple agent that integrates video data from a different agent, and implement and evaluate a version of it on the novel experimentation platform \emph{Malmo}. The focus of a second investigation is on how information about the hardware of different agents, the agents’ sensory data, and \emph{causal} information can be utilized for knowledge transfer between agents and subsequently more data-efficient decision making. Finally, we discuss potential future steps w.r.t.\ theory and experimentation, and formulate open questions.
Tasks	Decision Making, Self-Driving Cars, Transfer Learning
Published	2016-06-14
URL	http://arxiv.org/abs/1606.04250v3
PDF	http://arxiv.org/pdf/1606.04250v3.pdf
PWC	https://paperswithcode.com/paper/experimental-and-causal-view-on-information
Repo
Framework

Efficient 2D and 3D Facade Segmentation using Auto-Context


Title	Efficient 2D and 3D Facade Segmentation using Auto-Context
Authors	Raghudeep Gadde, Varun Jampani, Renaud Marlet, Peter V. Gehler
Abstract	This paper introduces a fast and efficient segmentation technique for 2D images and 3D point clouds of building facades. Facades of buildings are highly structured and consequently most methods that have been proposed for this problem aim to make use of this strong prior information. Contrary to most prior work, we are describing a system that is almost domain independent and consists of standard segmentation methods. We train a sequence of boosted decision trees using auto-context features. This is learned using stacked generalization. We find that this technique performs better, or comparable with all previous published methods and present empirical results on all available 2D and 3D facade benchmark datasets. The proposed method is simple to implement, easy to extend, and very efficient at test-time inference.
Tasks
Published	2016-06-21
URL	http://arxiv.org/abs/1606.06437v1
PDF	http://arxiv.org/pdf/1606.06437v1.pdf
PWC	https://paperswithcode.com/paper/efficient-2d-and-3d-facade-segmentation-using
Repo
Framework

Towards A Virtual Assistant That Can Be Taught New Tasks In Any Domain By Its End-Users


Title	Towards A Virtual Assistant That Can Be Taught New Tasks In Any Domain By Its End-Users
Authors	I. Dan Melamed, Nobal B. Niraula
Abstract	The challenge stated in the title can be divided into two main problems. The first problem is to reliably mimic the way that users interact with user interfaces. The second problem is to build an instructible agent, i.e. one that can be taught to execute tasks expressed as previously unseen natural language commands. This paper proposes a solution to the second problem, a system we call Helpa. End-users can teach Helpa arbitrary new tasks whose level of complexity is similar to the tasks available from today’s most popular virtual assistants. Teaching Helpa does not involve any programming. Instead, users teach Helpa by providing just one example of a command paired with a demonstration of how to execute that command. Helpa does not rely on any pre-existing domain-specific knowledge. It is therefore completely domain-independent. Our usability study showed that end-users can teach Helpa many new tasks in less than a minute each, often much less.
Tasks
Published	2016-06-30
URL	http://arxiv.org/abs/1607.00061v1
PDF	http://arxiv.org/pdf/1607.00061v1.pdf
PWC	https://paperswithcode.com/paper/towards-a-virtual-assistant-that-can-be
Repo
Framework

Locally Epistatic Models for Genome-wide Prediction and Association by Importance Sampling


Title	Locally Epistatic Models for Genome-wide Prediction and Association by Importance Sampling
Authors	Deniz Akdemir, Jean-Luc Jannink
Abstract	In statistical genetics an important task involves building predictive models for the genotype-phenotype relationships and thus attribute a proportion of the total phenotypic variance to the variation in genotypes. Numerous models have been proposed to incorporate additive genetic effects into models for prediction or association. However, there is a scarcity of models that can adequately account for gene by gene or other forms of genetical interactions. In addition, there is an increased interest in using marker annotations in genome-wide prediction and association. In this paper, we discuss an hybrid modeling methodology which combines the parametric mixed modeling approach and the non-parametric rule ensembles. This approach gives us a flexible class of models that can be used to capture additive, locally epistatic genetic effects, gene x background interactions and allows us to incorporate one or more annotations into the genomic selection or association models. We use benchmark data sets covering a range of organisms and traits in addition to simulated data sets to illustrate the strengths of this approach. The improvement of model accuracies and association results suggest that a part of the “missing heritability” in complex traits can be captured by modeling local epistasis.
Tasks
Published	2016-03-29
URL	http://arxiv.org/abs/1603.08813v1
PDF	http://arxiv.org/pdf/1603.08813v1.pdf
PWC	https://paperswithcode.com/paper/locally-epistatic-models-for-genome-wide
Repo
Framework