May 5, 2019

2965 words 14 mins read

Paper Group ANR 523

Spoofing 2D Face Detection: Machines See People Who Aren’t There. Training Neural Networks Without Gradients: A Scalable ADMM Approach. Model-Free Trajectory-based Policy Optimization with Monotonic Improvement. A Neural Architecture Mimicking Humans End-to-End for Natural Language Inference. Automatic TM Cleaning through MT and POS Tagging: Autode …

Spoofing 2D Face Detection: Machines See People Who Aren’t There


Title	Spoofing 2D Face Detection: Machines See People Who Aren’t There
Authors	Michael McCoyd, David Wagner
Abstract	Machine learning is increasingly used to make sense of the physical world yet may suffer from adversarial manipulation. We examine the Viola-Jones 2D face detection algorithm to study whether images can be created that humans do not notice as faces yet the algorithm detects as faces. We show that it is possible to construct images that Viola-Jones recognizes as containing faces yet no human would consider a face. Moreover, we show that it is possible to construct images that fool facial detection even when they are printed and then photographed.
Tasks	Face Detection
Published	2016-08-06
URL	http://arxiv.org/abs/1608.02128v1
PDF	http://arxiv.org/pdf/1608.02128v1.pdf
PWC	https://paperswithcode.com/paper/spoofing-2d-face-detection-machines-see
Repo
Framework

Training Neural Networks Without Gradients: A Scalable ADMM Approach


Title	Training Neural Networks Without Gradients: A Scalable ADMM Approach
Authors	Gavin Taylor, Ryan Burmeister, Zheng Xu, Bharat Singh, Ankit Patel, Tom Goldstein
Abstract	With the growing importance of large network models and enormous training datasets, GPUs have become increasingly necessary to train neural networks. This is largely because conventional optimization algorithms rely on stochastic gradient methods that don’t scale well to large numbers of cores in a cluster setting. Furthermore, the convergence of all gradient methods, including batch methods, suffers from common problems like saturation effects, poor conditioning, and saddle points. This paper explores an unconventional training method that uses alternating direction methods and Bregman iteration to train networks without gradient descent steps. The proposed method reduces the network training problem to a sequence of minimization sub-steps that can each be solved globally in closed form. The proposed method is advantageous because it avoids many of the caveats that make gradient methods slow on highly non-convex problems. The method exhibits strong scaling in the distributed setting, yielding linear speedups even when split over thousands of cores.
Tasks
Published	2016-05-06
URL	http://arxiv.org/abs/1605.02026v1
PDF	http://arxiv.org/pdf/1605.02026v1.pdf
PWC	https://paperswithcode.com/paper/training-neural-networks-without-gradients-a
Repo
Framework

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement


Title	Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
Authors	Riad Akrour, Abbas Abdolmaleki, Hany Abdulsamad, Jan Peters, Gerhard Neumann
Abstract	Many of the recent trajectory optimization algorithms alternate between linear approximation of the system dynamics around the mean trajectory and conservative policy update. One way of constraining the policy change is by bounding the Kullback-Leibler (KL) divergence between successive policies. These approaches already demonstrated great experimental success in challenging problems such as end-to-end control of physical systems. However, the linear approximation of the system dynamics can introduce a bias in the policy update and prevent convergence to the optimal policy. In this article, we propose a new model-free trajectory-based policy optimization algorithm with guaranteed monotonic improvement. The algorithm backpropagates a local, quadratic and time-dependent \qfunc~learned from trajectory data instead of a model of the system dynamics. Our policy update ensures exact KL-constraint satisfaction without simplifying assumptions on the system dynamics. We experimentally demonstrate on highly non-linear control tasks the improvement in performance of our algorithm in comparison to approaches linearizing the system dynamics. In order to show the monotonic improvement of our algorithm, we additionally conduct a theoretical analysis of our policy update scheme to derive a lower bound of the change in policy return between successive iterations.
Tasks
Published	2016-06-29
URL	http://arxiv.org/abs/1606.09197v4
PDF	http://arxiv.org/pdf/1606.09197v4.pdf
PWC	https://paperswithcode.com/paper/model-free-trajectory-based-policy
Repo
Framework

A Neural Architecture Mimicking Humans End-to-End for Natural Language Inference


Title	A Neural Architecture Mimicking Humans End-to-End for Natural Language Inference
Authors	Biswajit Paria, K. M. Annervaz, Ambedkar Dukkipati, Ankush Chatterjee, Sanjay Podder
Abstract	In this work we use the recent advances in representation learning to propose a neural architecture for the problem of natural language inference. Our approach is aligned to mimic how a human does the natural language inference process given two statements. The model uses variants of Long Short Term Memory (LSTM), attention mechanism and composable neural networks, to carry out the task. Each part of our model can be mapped to a clear functionality humans do for carrying out the overall task of natural language inference. The model is end-to-end differentiable enabling training by stochastic gradient descent. On Stanford Natural Language Inference(SNLI) dataset, the proposed model achieves better accuracy numbers than all published models in literature.
Tasks	Natural Language Inference, Representation Learning
Published	2016-11-15
URL	http://arxiv.org/abs/1611.04741v2
PDF	http://arxiv.org/pdf/1611.04741v2.pdf
PWC	https://paperswithcode.com/paper/a-neural-architecture-mimicking-humans-end-to
Repo
Framework

Automatic TM Cleaning through MT and POS Tagging: Autodesk’s Submission to the NLP4TM 2016 Shared Task


Title	Automatic TM Cleaning through MT and POS Tagging: Autodesk’s Submission to the NLP4TM 2016 Shared Task
Authors	Alena Zwahlen, Olivier Carnal, Samuel Läubli
Abstract	We describe a machine learning based method to identify incorrect entries in translation memories. It extends previous work by Barbu (2015) through incorporating recall-based machine translation and part-of-speech-tagging features. Our system ranked first in the Binary Classification (II) task for two out of three language pairs: English-Italian and English-Spanish.
Tasks	Machine Translation, Part-Of-Speech Tagging
Published	2016-05-19
URL	http://arxiv.org/abs/1605.05906v1
PDF	http://arxiv.org/pdf/1605.05906v1.pdf
PWC	https://paperswithcode.com/paper/automatic-tm-cleaning-through-mt-and-pos
Repo
Framework

Optimization of Convolutional Neural Network using Microcanonical Annealing Algorithm


Title	Optimization of Convolutional Neural Network using Microcanonical Annealing Algorithm
Authors	Vina Ayumi, L. M. Rasdi Rere, Mohamad Ivan Fanany, Aniati Murni Arymurthy
Abstract	Convolutional neural network (CNN) is one of the most prominent architectures and algorithm in Deep Learning. It shows a remarkable improvement in the recognition and classification of objects. This method has also been proven to be very effective in a variety of computer vision and machine learning problems. As in other deep learning, however, training the CNN is interesting yet challenging. Recently, some metaheuristic algorithms have been used to optimize CNN using Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing and Harmony Search. In this paper, another type of metaheuristic algorithms with different strategy has been proposed, i.e. Microcanonical Annealing to optimize Convolutional Neural Network. The performance of the proposed method is tested using the MNIST and CIFAR-10 datasets. Although experiment results of MNIST dataset indicate the increase in computation time (1.02x - 1.38x), nevertheless this proposed method can considerably enhance the performance of the original CNN (up to 4.60%). On the CIFAR10 dataset, currently, state of the art is 96.53% using fractional pooling, while this proposed method achieves 99.14%.
Tasks
Published	2016-10-07
URL	http://arxiv.org/abs/1610.02306v1
PDF	http://arxiv.org/pdf/1610.02306v1.pdf
PWC	https://paperswithcode.com/paper/optimization-of-convolutional-neural-network
Repo
Framework

Distributed Estimation of the Operating State of a Single-Bus DC MicroGrid without an External Communication Interface


Title	Distributed Estimation of the Operating State of a Single-Bus DC MicroGrid without an External Communication Interface
Authors	Marko Angjelichinoski, Anna Scaglione, Petar Popovski, Cedomir Stefanovic
Abstract	We propose a decentralized Maximum Likelihood solution for estimating the stochastic renewable power generation and demand in single bus Direct Current (DC) MicroGrids (MGs), with high penetration of droop controlled power electronic converters. The solution relies on the fact that the primary control parameters are set in accordance with the local power generation status of the generators. Therefore, the steady state voltage is inherently dependent on the generation capacities and the load, through a non-linear parametric model, which can be estimated. To have a well conditioned estimation problem, our solution avoids the use of an external communication interface and utilizes controlled voltage disturbances to perform distributed training. Using this tool, we develop an efficient, decentralized Maximum Likelihood Estimator (MLE) and formulate the sufficient condition for the existence of the globally optimal solution. The numerical results illustrate the promising performance of our MLE algorithm.
Tasks
Published	2016-09-14
URL	http://arxiv.org/abs/1609.04623v1
PDF	http://arxiv.org/pdf/1609.04623v1.pdf
PWC	https://paperswithcode.com/paper/distributed-estimation-of-the-operating-state
Repo
Framework

Robust Structure from Motion in the Presence of Outliers and Missing Data


Title	Robust Structure from Motion in the Presence of Outliers and Missing Data
Authors	Guanghui Wang
Abstract	Structure from motion is an import theme in computer vision. Although great progress has been made both in theory and applications, most of the algorithms only work for static scenes and rigid objects. In recent years, structure and motion recovery of non-rigid objects and dynamic scenes have received a lot of attention. In this paper, the state-of-the-art techniques for structure and motion factorization of non-rigid objects are reviewed and discussed. First, an introduction of the structure from motion problem is presented, followed by a general formulation of non-rigid structure from motion. Second, an augmented affined factorization framework, by using homogeneous representation, is presented to solve the registration issue in the presence of outlying and missing data. Third, based on the observation that the reprojection residuals of outliers are significantly larger than those of inliers, a robust factorization strategy with outlier rejection is proposed by means of the reprojection residuals, followed by some comparative experimental evaluations. Finally, some future research topics in non-rigid structure from motion are discussed.
Tasks
Published	2016-09-09
URL	http://arxiv.org/abs/1609.02638v2
PDF	http://arxiv.org/pdf/1609.02638v2.pdf
PWC	https://paperswithcode.com/paper/robust-structure-from-motion-in-the-presence
Repo
Framework

A New Type-II Fuzzy Logic Based Controller for Non-linear Dynamical Systems with Application to a 3-PSP Parallel Robot


Title	A New Type-II Fuzzy Logic Based Controller for Non-linear Dynamical Systems with Application to a 3-PSP Parallel Robot
Authors	Hamid Reza Hassanzadeh
Abstract	The concept of uncertainty is posed in almost any complex system including parallel robots as an outstanding instance of dynamical robotics systems. As suggested by the name, uncertainty, is some missing information that is beyond the knowledge of human thus we may tend to handle it properly to minimize the side-effects through the control process. Type-II fuzzy logic has shown its superiority over traditional fuzzy logic when dealing with uncertainty. Type-II fuzzy logic controllers are however newer and more promising approaches that have been recently applied to various fields due to their significant contribution especially when noise (as an important instance of uncertainty) emerges. During the design of Type-I fuzzy logic systems, we presume that we are almost certain about the fuzzy membership functions which is not true in many cases. Thus T2FLS as a more realistic approach dealing with practical applications might have a lot to offer. Type-II fuzzy logic takes into account a higher level of uncertainty, in other words, the membership grade for a type-II fuzzy variable is no longer a crisp number but rather is itself a type-I linguistic term. In this thesis the effects of uncertainty in dynamic control of a parallel robot is considered. More specifically, it is intended to incorporate the Type-II Fuzzy Logic paradigm into a model based controller, the so-called computed torque control method, and apply the result to a 3 degrees of freedom parallel manipulator. …
Tasks
Published	2016-12-05
URL	http://arxiv.org/abs/1612.01399v1
PDF	http://arxiv.org/pdf/1612.01399v1.pdf
PWC	https://paperswithcode.com/paper/a-new-type-ii-fuzzy-logic-based-controller
Repo
Framework


Title	Subspace Clustering Based Tag Sharing for Inductive Tag Matrix Refinement with Complex Errors
Authors	Yuqing Hou, Zhouchen Lin, Jin-ge Yao
Abstract	Annotating images with tags is useful for indexing and retrieving images. However, many available annotation data include missing or inaccurate annotations. In this paper, we propose an image annotation framework which sequentially performs tag completion and refinement. We utilize the subspace property of data via sparse subspace clustering for tag completion. Then we propose a novel matrix completion model for tag refinement, integrating visual correlation, semantic correlation and the novelly studied property of complex errors. The proposed method outperforms the state-of-the-art approaches on multiple benchmark datasets even when they contain certain levels of annotation noise.
Tasks	Matrix Completion
Published	2016-01-12
URL	http://arxiv.org/abs/1601.03055v3
PDF	http://arxiv.org/pdf/1601.03055v3.pdf
PWC	https://paperswithcode.com/paper/subspace-clustering-based-tag-sharing-for
Repo
Framework


Title	Utilizing High-level Visual Feature for Indoor Shopping Mall Navigation
Authors	Ziwei Xu, Haitian Zheng, Minjian Pang, Yangchun Zhu, Xiongfei Su, Guyue Zhou, Lu Fang
Abstract	Towards robust and convenient indoor shopping mall navigation, we propose a novel learning-based scheme to utilize the high-level visual information from the storefront images captured by personal devices of users. Specifically, we decompose the visual navigation problem into localization and map generation respectively. Given a storefront input image, a novel feature fusion scheme (denoted as FusionNet) is proposed by fusing the distinguishing DNN-based appearance feature and text feature for robust recognition of store brands, which serves for accurate localization. Regarding the map generation, we convert the user-captured indicator map of the shopping mall into a topological map by parsing the stores and their connectivity. Experimental results conducted on the real shopping malls demonstrate that the proposed system achieves robust localization and precise map generation, enabling accurate navigation.
Tasks	Visual Navigation
Published	2016-10-06
URL	http://arxiv.org/abs/1610.01906v4
PDF	http://arxiv.org/pdf/1610.01906v4.pdf
PWC	https://paperswithcode.com/paper/utilizing-high-level-visual-feature-for
Repo
Framework

Causal inference for data-driven debugging and decision making in cloud computing


Title	Causal inference for data-driven debugging and decision making in cloud computing
Authors	Philipp Geiger, Lucian Carata, Bernhard Schoelkopf
Abstract	Cloud computing involves complex technical and economical systems and interactions. This brings about various challenges, two of which are: (1) debugging and control to optimize the performance of computing systems, with the help of sandbox experiments, and (2) privacy-preserving prediction of the cost of `spot'' resources for decision making of cloud clients. In this paper, we formalize debugging by counterfactual probabilities and control by post-(soft-)interventional probabilities. We prove that counterfactuals can approximately be calculated from a` stochastic’’ graphical causal model (while they are originally defined only for `deterministic'' functional causal models), and based on this sketch a data-driven approach to address problem (1). To address problem (2), we formalize bidding by post-(soft-)interventional probabilities and present a simple mathematical result on approximate integration of` incomplete’’ conditional probability distributions. We show how this can be used by cloud clients to trade off privacy against predictability of the outcome of their bidding actions in a toy scenario. We report experiments on simulated and real data.
Tasks	Causal Inference, Decision Making
Published	2016-03-04
URL	https://arxiv.org/abs/1603.01581v8
PDF	https://arxiv.org/pdf/1603.01581v8.pdf
PWC	https://paperswithcode.com/paper/causal-inference-for-cloud-computing
Repo
Framework


Title	Optimizing Gaze Direction in a Visual Navigation Task
Authors	Tuomas Välimäki, Risto Ritala
Abstract	Navigation in an unknown environment consists of multiple separable subtasks, such as collecting information about the surroundings and navigating to the current goal. In the case of pure visual navigation, all these subtasks need to utilize the same vision system, and therefore a way to optimally control the direction of focus is needed. We present a case study, where we model the active sensing problem of directing the gaze of a mobile robot with three machine vision cameras as a partially observable Markov decision process (POMDP) using a mutual information (MI) based reward function. The key aspect of the solution is that the cameras are dynamically used either in monocular or stereo configuration. The benefits of using the proposed active sensing implementation are demonstrated with simulations and experiments on a real robot.
Tasks	Visual Navigation
Published	2016-02-16
URL	http://arxiv.org/abs/1602.04981v1
PDF	http://arxiv.org/pdf/1602.04981v1.pdf
PWC	https://paperswithcode.com/paper/optimizing-gaze-direction-in-a-visual
Repo
Framework

Small-Variance Nonparametric Clustering on the Hypersphere


Title	Small-Variance Nonparametric Clustering on the Hypersphere
Authors	Julian Straub, Trevor Campbell, Jonathan P. How, John W. Fisher III
Abstract	Structural regularities in man-made environments reflect in the distribution of their surface normals. Describing these surface normal distributions is important in many computer vision applications, such as scene understanding, plane segmentation, and regularization of 3D reconstructions. Based on the small-variance limit of Bayesian nonparametric von-Mises-Fisher (vMF) mixture distributions, we propose two new flexible and efficient k-means-like clustering algorithms for directional data such as surface normals. The first, DP-vMF-means, is a batch clustering algorithm derived from the Dirichlet process (DP) vMF mixture. Recognizing the sequential nature of data collection in many applications, we extend this algorithm to DDP-vMF-means, which infers temporally evolving cluster structure from streaming data. Both algorithms naturally respect the geometry of directional data, which lies on the unit sphere. We demonstrate their performance on synthetic directional data and real 3D surface normals from RGB-D sensors. While our experiments focus on 3D data, both algorithms generalize to high dimensional directional data such as protein backbone configurations and semantic word vectors.
Tasks	Scene Understanding
Published	2016-07-21
URL	http://arxiv.org/abs/1607.06407v1
PDF	http://arxiv.org/pdf/1607.06407v1.pdf
PWC	https://paperswithcode.com/paper/small-variance-nonparametric-clustering-on
Repo
Framework

Persistent Homology on Grassmann Manifolds for Analysis of Hyperspectral Movies


Title	Persistent Homology on Grassmann Manifolds for Analysis of Hyperspectral Movies
Authors	Sofya Chepushtanova, Michael Kirby, Chris Peterson, Lori Ziegelmeier
Abstract	The existence of characteristic structure, or shape, in complex data sets has been recognized as increasingly important for mathematical data analysis. This realization has motivated the development of new tools such as persistent homology for exploring topological invariants, or features, in large data sets. In this paper we apply persistent homology to the characterization of gas plumes in time dependent sequences of hyperspectral cubes, i.e. the analysis of 4-way arrays. We investigate hyperspectral movies of Long-Wavelength Infrared data monitoring an experimental release of chemical simulant into the air. Our approach models regions of interest within the hyperspectral data cubes as points on the real Grassmann manifold $G(k, n)$ (whose points parameterize the $k$-dimensional subspaces of $\mathbb{R}^n$), contrasting our approach with the more standard framework in Euclidean space. An advantage of this approach is that it allows a sequence of time slices in a hyperspectral movie to be collapsed to a sequence of points in such a way that some of the key structure within and between the slices is encoded by the points on the Grassmann manifold. This motivates the search for topological features, associated with the evolution of the frames of a hyperspectral movie, within the corresponding points on the Grassmann manifold. The proposed mathematical model affords the processing of large data sets while retaining valuable discriminatory information. In this paper, we discuss how embedding our data in the Grassmann manifold, together with topological data analysis, captures dynamical events that occur as the chemical plume is released and evolves.
Tasks	Topological Data Analysis
Published	2016-07-07
URL	http://arxiv.org/abs/1607.02196v2
PDF	http://arxiv.org/pdf/1607.02196v2.pdf
PWC	https://paperswithcode.com/paper/persistent-homology-on-grassmann-manifolds
Repo
Framework

Paper Group ANR 523

Spoofing 2D Face Detection: Machines See People Who Aren’t There

Training Neural Networks Without Gradients: A Scalable ADMM Approach

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement

A Neural Architecture Mimicking Humans End-to-End for Natural Language Inference

Automatic TM Cleaning through MT and POS Tagging: Autodesk’s Submission to the NLP4TM 2016 Shared Task

Optimization of Convolutional Neural Network using Microcanonical Annealing Algorithm

Distributed Estimation of the Operating State of a Single-Bus DC MicroGrid without an External Communication Interface

Robust Structure from Motion in the Presence of Outliers and Missing Data

A New Type-II Fuzzy Logic Based Controller for Non-linear Dynamical Systems with Application to a 3-PSP Parallel Robot

Subspace Clustering Based Tag Sharing for Inductive Tag Matrix Refinement with Complex Errors

Utilizing High-level Visual Feature for Indoor Shopping Mall Navigation

Causal inference for data-driven debugging and decision making in cloud computing

Optimizing Gaze Direction in a Visual Navigation Task

Small-Variance Nonparametric Clustering on the Hypersphere

Persistent Homology on Grassmann Manifolds for Analysis of Hyperspectral Movies

Paper Group ANR 544

Paper Group ANR 558

Paper Group NANR 17