May 7, 2019

2760 words 13 mins read

Paper Group ANR 144

Robustness of Bayesian Pool-based Active Learning Against Prior Misspecification. Learning Shapes by Convex Composition. An Overview on Data Representation Learning: From Traditional Feature Learning to Recent Deep Learning. On-line Bayesian System Identification. A metric for sets of trajectories that is practical and mathematically consistent. No …

Robustness of Bayesian Pool-based Active Learning Against Prior Misspecification


Title	Robustness of Bayesian Pool-based Active Learning Against Prior Misspecification
Authors	Nguyen Viet Cuong, Nan Ye, Wee Sun Lee
Abstract	We study the robustness of active learning (AL) algorithms against prior misspecification: whether an algorithm achieves similar performance using a perturbed prior as compared to using the true prior. In both the average and worst cases of the maximum coverage setting, we prove that all $\alpha$-approximate algorithms are robust (i.e., near $\alpha$-approximate) if the utility is Lipschitz continuous in the prior. We further show that robustness may not be achieved if the utility is non-Lipschitz. This suggests we should use a Lipschitz utility for AL if robustness is required. For the minimum cost setting, we can also obtain a robustness result for approximate AL algorithms. Our results imply that many commonly used AL algorithms are robust against perturbed priors. We then propose the use of a mixture prior to alleviate the problem of prior misspecification. We analyze the robustness of the uniform mixture prior and show experimentally that it performs reasonably well in practice.
Tasks	Active Learning
Published	2016-03-30
URL	http://arxiv.org/abs/1603.09050v1
PDF	http://arxiv.org/pdf/1603.09050v1.pdf
PWC	https://paperswithcode.com/paper/robustness-of-bayesian-pool-based-active
Repo
Framework

Learning Shapes by Convex Composition


Title	Learning Shapes by Convex Composition
Authors	Alireza Aghasi, Justin Romberg
Abstract	We present a mathematical and algorithmic scheme for learning the principal geometric elements in an image or 3D object. We build on recent work that convexifies the basic problem of finding a combination of a small number shapes that overlap and occlude one another in such a way that they “match” a given scene as closely as possible. This paper derives general sufficient conditions under which this convex shape composition identifies a target composition. From a computational standpoint, we present two different methods for solving the associated optimization programs. The first method simply recasts the problem as a linear program, while the second uses the alternating direction method of multipliers with a series of easily computed proximal operators.
Tasks
Published	2016-02-23
URL	http://arxiv.org/abs/1602.07613v2
PDF	http://arxiv.org/pdf/1602.07613v2.pdf
PWC	https://paperswithcode.com/paper/learning-shapes-by-convex-composition
Repo
Framework

An Overview on Data Representation Learning: From Traditional Feature Learning to Recent Deep Learning


Title	An Overview on Data Representation Learning: From Traditional Feature Learning to Recent Deep Learning
Authors	Guoqiang Zhong, Li-Na Wang, Junyu Dong
Abstract	Since about 100 years ago, to learn the intrinsic structure of data, many representation learning approaches have been proposed, including both linear ones and nonlinear ones, supervised ones and unsupervised ones. Particularly, deep architectures are widely applied for representation learning in recent years, and have delivered top results in many tasks, such as image classification, object detection and speech recognition. In this paper, we review the development of data representation learning methods. Specifically, we investigate both traditional feature learning algorithms and state-of-the-art deep learning models. The history of data representation learning is introduced, while available resources (e.g. online course, tutorial and book information) and toolboxes are provided. Finally, we conclude this paper with remarks and some interesting research directions on data representation learning.
Tasks	Image Classification, Object Detection, Representation Learning, Speech Recognition
Published	2016-11-25
URL	http://arxiv.org/abs/1611.08331v1
PDF	http://arxiv.org/pdf/1611.08331v1.pdf
PWC	https://paperswithcode.com/paper/an-overview-on-data-representation-learning
Repo
Framework

On-line Bayesian System Identification


Title	On-line Bayesian System Identification
Authors	Diego Romeres, Giulia Prando, Gianluigi Pillonetto, Alessandro Chiuso
Abstract	We consider an on-line system identification setting, in which new data become available at given time steps. In order to meet real-time estimation requirements, we propose a tailored Bayesian system identification procedure, in which the hyper-parameters are still updated through Marginal Likelihood maximization, but after only one iteration of a suitable iterative optimization algorithm. Both gradient methods and the EM algorithm are considered for the Marginal Likelihood optimization. We compare this “1-step” procedure with the standard one, in which the optimization method is run until convergence to a local minimum. The experiments we perform confirm the effectiveness of the approach we propose.
Tasks
Published	2016-01-17
URL	http://arxiv.org/abs/1601.04251v1
PDF	http://arxiv.org/pdf/1601.04251v1.pdf
PWC	https://paperswithcode.com/paper/on-line-bayesian-system-identification
Repo
Framework

A metric for sets of trajectories that is practical and mathematically consistent


Title	A metric for sets of trajectories that is practical and mathematically consistent
Authors	José Bento, Jia Jie Zhu
Abstract	Metrics on the space of sets of trajectories are important for scientists in the field of computer vision, machine learning, robotics, and general artificial intelligence. However, existing notions of closeness between sets of trajectories are either mathematically inconsistent or of limited practical use. In this paper, we outline the limitations in the current mathematically-consistent metrics, which are based on OSPA (Schuhmacher et al. 2008); and the inconsistencies in the heuristic notions of closeness used in practice, whose main ideas are common to the CLEAR MOT measures (Keni and Rainer 2008) widely used in computer vision. In two steps, we then propose a new intuitive metric between sets of trajectories and address these limitations. First, we explain a solution that leads to a metric that is hard to compute. Then we modify this formulation to obtain a metric that is easy to compute while keeping the useful properties of the previous metric. Our notion of closeness is the first demonstrating the following three features: the metric 1) can be quickly computed, 2) incorporates confusion of trajectories’ identity in an optimal way, and 3) is a metric in the mathematical sense.
Tasks
Published	2016-01-12
URL	http://arxiv.org/abs/1601.03094v2
PDF	http://arxiv.org/pdf/1601.03094v2.pdf
PWC	https://paperswithcode.com/paper/a-metric-for-sets-of-trajectories-that-is
Repo
Framework

Non-Gaussian Component Analysis with Log-Density Gradient Estimation


Title	Non-Gaussian Component Analysis with Log-Density Gradient Estimation
Authors	Hiroaki Sasaki, Gang Niu, Masashi Sugiyama
Abstract	Non-Gaussian component analysis (NGCA) is aimed at identifying a linear subspace such that the projected data follows a non-Gaussian distribution. In this paper, we propose a novel NGCA algorithm based on log-density gradient estimation. Unlike existing methods, the proposed NGCA algorithm identifies the linear subspace by using the eigenvalue decomposition without any iterative procedures, and thus is computationally reasonable. Furthermore, through theoretical analysis, we prove that the identified subspace converges to the true subspace at the optimal parametric rate. Finally, the practical performance of the proposed algorithm is demonstrated on both artificial and benchmark datasets.
Tasks
Published	2016-01-28
URL	http://arxiv.org/abs/1601.07665v1
PDF	http://arxiv.org/pdf/1601.07665v1.pdf
PWC	https://paperswithcode.com/paper/non-gaussian-component-analysis-with-log
Repo
Framework

Low-Rank Matrix Recovery using Gabidulin Codes in Characteristic Zero


Title	Low-Rank Matrix Recovery using Gabidulin Codes in Characteristic Zero
Authors	Sven Müelich, Sven Puchinger, Martin Bossert
Abstract	We present a new approach on low-rank matrix recovery (LRMR) based on Gabidulin Codes. Since most applications of LRMR deal with matrices over infinite fields, we use the recently introduced generalization of Gabidulin codes to fields of characterstic zero. We show that LRMR can be reduced to decoding of Gabidulin codes and discuss which field extensions can be used in the code construction.
Tasks
Published	2016-04-15
URL	http://arxiv.org/abs/1604.04397v2
PDF	http://arxiv.org/pdf/1604.04397v2.pdf
PWC	https://paperswithcode.com/paper/low-rank-matrix-recovery-using-gabidulin
Repo
Framework

A coarse-to-fine algorithm for registration in 3D street-view cross-source point clouds


Title	A coarse-to-fine algorithm for registration in 3D street-view cross-source point clouds
Authors	Xiaoshui Huang, Jian Zhang, Qiang Wu, Lixin Fan, Chun Yuan
Abstract	With the development of numerous 3D sensing technologies, object registration on cross-source point cloud has aroused researchers’ interests. When the point clouds are captured from different kinds of sensors, there are large and different kinds of variations. In this study, we address an even more challenging case in which the differently-source point clouds are acquired from a real street view. One is produced directly by the LiDAR system and the other is generated by using VSFM software on image sequence captured from RGB cameras. When it confronts to large scale point clouds, previous methods mostly focus on point-to-point level registration, and the methods have many limitations.The reason is that the least mean error strategy shows poor ability in registering large variable cross-source point clouds. In this paper, different from previous ICP-based methods, and from a statistic view, we propose a effective coarse-to-fine algorithm to detect and register a small scale SFM point cloud in a large scale Lidar point cloud. Seen from the experimental results, the model can successfully run on LiDAR and SFM point clouds, hence it can make a contribution to many applications, such as robotics and smart city development.
Tasks
Published	2016-10-24
URL	http://arxiv.org/abs/1610.07324v1
PDF	http://arxiv.org/pdf/1610.07324v1.pdf
PWC	https://paperswithcode.com/paper/a-coarse-to-fine-algorithm-for-registration
Repo
Framework

Interactive Bayesian Hierarchical Clustering


Title	Interactive Bayesian Hierarchical Clustering
Authors	Sharad Vikram, Sanjoy Dasgupta
Abstract	Clustering is a powerful tool in data analysis, but it is often difficult to find a grouping that aligns with a user’s needs. To address this, several methods incorporate constraints obtained from users into clustering algorithms, but unfortunately do not apply to hierarchical clustering. We design an interactive Bayesian algorithm that incorporates user interaction into hierarchical clustering while still utilizing the geometry of the data by sampling a constrained posterior distribution over hierarchies. We also suggest several ways to intelligently query a user. The algorithm, along with the querying schemes, shows promising results on real data.
Tasks
Published	2016-02-10
URL	http://arxiv.org/abs/1602.03258v3
PDF	http://arxiv.org/pdf/1602.03258v3.pdf
PWC	https://paperswithcode.com/paper/interactive-bayesian-hierarchical-clustering
Repo
Framework

On numerical approximation schemes for expectation propagation


Title	On numerical approximation schemes for expectation propagation
Authors	Alexis Roche
Abstract	Several numerical approximation strategies for the expectation-propagation algorithm are studied in the context of large-scale learning: the Laplace method, a faster variant of it, Gaussian quadrature, and a deterministic version of variational sampling (i.e., combining quadrature with variational approximation). Experiments in training linear binary classifiers show that the expectation-propagation algorithm converges best using variational sampling, while it also converges well using Laplace-style methods with smooth factors but tends to be unstable with non-differentiable ones. Gaussian quadrature yields unstable behavior or convergence to a sub-optimal solution in most experiments.
Tasks
Published	2016-11-14
URL	http://arxiv.org/abs/1611.04416v1
PDF	http://arxiv.org/pdf/1611.04416v1.pdf
PWC	https://paperswithcode.com/paper/on-numerical-approximation-schemes-for
Repo
Framework

Incremental Nonparametric Weighted Feature Extraction for OnlineSubspace Pattern Classification


Title	Incremental Nonparametric Weighted Feature Extraction for OnlineSubspace Pattern Classification
Authors	Hamid Abrishami Moghaddam, Elaheh Raisi
Abstract	In this paper, a new online method based on nonparametric weighted feature extraction (NWFE) is proposed. NWFE was introduced to enjoy optimum characteristics of linear discriminant analysis (LDA) and nonparametric discriminant analysis (NDA) while rectifying their drawbacks. It emphasizes the points near decision boundary by putting greater weights on them and deemphasizes other points. Incremental nonparametric weighted feature extraction (INWFE) is the online version of NWFE. INWFE has advantages of NWFE method such as extracting more than L-1 features in contrast to LDA. It is independent of the class distribution and performs well in complex distributed data. The effects of outliers are reduced due to the nature of its nonparametric scatter matrix. Furthermore, it is possible to add new samples asynchronously, i.e. whenever a new sample becomes available at any given time, it can be added to the algorithm. This is useful for many real world applications since all data cannot be available in advance. This method is implemented on Gaussian and non-Gaussian multidimensional data, a number of UCI datasets and Indian Pine dataset. Results are compared with NWFE in terms of classification accuracy and execution time. For nearest neighbour classifier it shows that this technique converges to NWFE at the end of learning process. In addition, the computational complexity is reduced in comparison with NWFE in terms of execution time.
Tasks
Published	2016-10-26
URL	http://arxiv.org/abs/1610.08133v1
PDF	http://arxiv.org/pdf/1610.08133v1.pdf
PWC	https://paperswithcode.com/paper/incremental-nonparametric-weighted-feature
Repo
Framework

POSEidon: Face-from-Depth for Driver Pose Estimation


Title	POSEidon: Face-from-Depth for Driver Pose Estimation
Authors	Guido Borghi, Marco Venturelli, Roberto Vezzani, Rita Cucchiara
Abstract	Fast and accurate upper-body and head pose estimation is a key task for automatic monitoring of driver attention, a challenging context characterized by severe illumination changes, occlusions and extreme poses. In this work, we present a new deep learning framework for head localization and pose estimation on depth images. The core of the proposal is a regression neural network, called POSEidon, which is composed of three independent convolutional nets followed by a fusion layer, specially conceived for understanding the pose by depth. In addition, to recover the intrinsic value of face appearance for understanding head position and orientation, we propose a new Face-from-Depth approach for learning image faces from depth. Results in face reconstruction are qualitatively impressive. We test the proposed framework on two public datasets, namely Biwi Kinect Head Pose and ICT-3DHP, and on Pandora, a new challenging dataset mainly inspired by the automotive setup. Results show that our method overcomes all recent state-of-art works, running in real time at more than 30 frames per second.
Tasks	Face Reconstruction, Head Pose Estimation, Pose Estimation
Published	2016-11-30
URL	http://arxiv.org/abs/1611.10195v3
PDF	http://arxiv.org/pdf/1611.10195v3.pdf
PWC	https://paperswithcode.com/paper/poseidon-face-from-depth-for-driver-pose
Repo
Framework

The Myopia of Crowds: A Study of Collective Evaluation on Stack Exchange


Title	The Myopia of Crowds: A Study of Collective Evaluation on Stack Exchange
Authors	Keith Burghardt, Emanuel F. Alsina, Michelle Girvan, William Rand, Kristina Lerman
Abstract	Crowds can often make better decisions than individuals or small groups of experts by leveraging their ability to aggregate diverse information. Question answering sites, such as Stack Exchange, rely on the “wisdom of crowds” effect to identify the best answers to questions asked by users. We analyze data from 250 communities on the Stack Exchange network to pinpoint factors affecting which answers are chosen as the best answers. Our results suggest that, rather than evaluate all available answers to a question, users rely on simple cognitive heuristics to choose an answer to vote for or accept. These cognitive heuristics are linked to an answer’s salience, such as the order in which it is listed and how much screen space it occupies. While askers appear to depend more on heuristics, compared to voting users, when choosing an answer to accept as the most helpful one, voters use acceptance itself as a heuristic: they are more likely to choose the answer after it is accepted than before that very same answer was accepted. These heuristics become more important in explaining and predicting behavior as the number of available answers increases. Our findings suggest that crowd judgments may become less reliable as the number of answers grow.
Tasks	Question Answering
Published	2016-02-24
URL	http://arxiv.org/abs/1602.07388v1
PDF	http://arxiv.org/pdf/1602.07388v1.pdf
PWC	https://paperswithcode.com/paper/the-myopia-of-crowds-a-study-of-collective
Repo
Framework

Dictionary Learning for Robotic Grasp Recognition and Detection


Title	Dictionary Learning for Robotic Grasp Recognition and Detection
Authors	Ludovic Trottier, Philippe Giguère, Brahim Chaib-draa
Abstract	The ability to grasp ordinary and potentially never-seen objects is an important feature in both domestic and industrial robotics. For a system to accomplish this, it must autonomously identify grasping locations by using information from various sensors, such as Microsoft Kinect 3D camera. Despite numerous progress, significant work still remains to be done in this field. To this effect, we propose a dictionary learning and sparse representation (DLSR) framework for representing RGBD images from 3D sensors in the context of determining such good grasping locations. In contrast to previously proposed approaches that relied on sophisticated regularization or very large datasets, the derived perception system has a fast training phase and can work with small datasets. It is also theoretically founded for dealing with masked-out entries, which are common with 3D sensors. We contribute by presenting a comparative study of several DLSR approach combinations for recognizing and detecting grasp candidates on the standard Cornell dataset. Importantly, experimental results show a performance improvement of 1.69% in detection and 3.16% in recognition over current state-of-the-art convolutional neural network (CNN). Even though nowadays most popular vision-based approach is CNN, this suggests that DLSR is also a viable alternative with interesting advantages that CNN has not.
Tasks	Dictionary Learning
Published	2016-06-02
URL	http://arxiv.org/abs/1606.00538v1
PDF	http://arxiv.org/pdf/1606.00538v1.pdf
PWC	https://paperswithcode.com/paper/dictionary-learning-for-robotic-grasp
Repo
Framework

Modeling Group Dynamics Using Probabilistic Tensor Decompositions


Title	Modeling Group Dynamics Using Probabilistic Tensor Decompositions
Authors	Lin Li, Ananthram Swami, Anna Scaglione
Abstract	We propose a probabilistic modeling framework for learning the dynamic patterns in the collective behaviors of social agents and developing profiles for different behavioral groups, using data collected from multiple information sources. The proposed model is based on a hierarchical Bayesian process, in which each observation is a finite mixture of an set of latent groups and the mixture proportions (i.e., group probabilities) are drawn randomly. Each group is associated with some distributions over a finite set of outcomes. Moreover, as time evolves, the structure of these groups also changes; we model the change in the group structure by a hidden Markov model (HMM) with a fixed transition probability. We present an efficient inference method based on tensor decompositions and the expectation-maximization (EM) algorithm for parameter estimation.
Tasks
Published	2016-06-24
URL	http://arxiv.org/abs/1606.07840v1
PDF	http://arxiv.org/pdf/1606.07840v1.pdf
PWC	https://paperswithcode.com/paper/modeling-group-dynamics-using-probabilistic
Repo
Framework