May 5, 2019

2653 words 13 mins read

Paper Group ANR 518

Constrained Cohort Intelligence using Static and Dynamic Penalty Function Approach for Mechanical Components Design. A probabilistic model for the numerical solution of initial value problems. Multichannel Variable-Size Convolution for Sentence Classification. Graphical RNN Models. Hyperspectral Unmixing with Endmember Variability using Partial Mem …

Constrained Cohort Intelligence using Static and Dynamic Penalty Function Approach for Mechanical Components Design


Title	Constrained Cohort Intelligence using Static and Dynamic Penalty Function Approach for Mechanical Components Design
Authors	Omkar Kulkarni, Ninad Kulkarni, Anand J Kulkarni, Ganesh Kakandikar
Abstract	Most of the metaheuristics can efficiently solve unconstrained problems; however, their performance may degenerate if the constraints are involved. This paper proposes two constraint handling approaches for an emerging metaheuristic of Cohort Intelligence (CI). More specifically CI with static penalty function approach (SCI) and CI with dynamic penalty function approach (DCI) are proposed. The approaches have been tested by solving several constrained test problems. The performance of the SCI and DCI have been compared with algorithms like GA, PSO, ABC, d-Ds. In addition, as well as three real world problems from mechanical engineering domain with improved solutions. The results were satisfactory and validated the applicability of CI methodology for solving real world problems.
Tasks
Published	2016-09-26
URL	http://arxiv.org/abs/1610.06009v1
PDF	http://arxiv.org/pdf/1610.06009v1.pdf
PWC	https://paperswithcode.com/paper/constrained-cohort-intelligence-using-static
Repo
Framework

A probabilistic model for the numerical solution of initial value problems


Title	A probabilistic model for the numerical solution of initial value problems
Authors	Michael Schober, Simo Särkkä, Philipp Hennig
Abstract	Like many numerical methods, solvers for initial value problems (IVPs) on ordinary differential equations estimate an analytically intractable quantity, using the results of tractable computations as inputs. This structure is closely connected to the notion of inference on latent variables in statistics. We describe a class of algorithms that formulate the solution to an IVP as inference on a latent path that is a draw from a Gaussian process probability measure (or equivalently, the solution of a linear stochastic differential equation). We then show that certain members of this class are connected precisely to generalized linear methods for ODEs, a number of Runge–Kutta methods, and Nordsieck methods. This probabilistic formulation of classic methods is valuable in two ways: analytically, it highlights implicit prior assumptions favoring certain approximate solutions to the IVP over others, and gives a precise meaning to the old observation that these methods act like filters. Practically, it endows the classic solvers with `docking points’ for notions of uncertainty and prior information about the initial value, the value of the ODE itself, and the solution of the problem. \|
Tasks
Published	2016-10-17
URL	http://arxiv.org/abs/1610.05261v3
PDF	http://arxiv.org/pdf/1610.05261v3.pdf
PWC	https://paperswithcode.com/paper/a-probabilistic-model-for-the-numerical
Repo
Framework

Multichannel Variable-Size Convolution for Sentence Classification


Title	Multichannel Variable-Size Convolution for Sentence Classification
Authors	Wenpeng Yin, Hinrich Schütze
Abstract	We propose MVCNN, a convolution neural network (CNN) architecture for sentence classification. It (i) combines diverse versions of pretrained word embeddings and (ii) extracts features of multigranular phrases with variable-size convolution filters. We also show that pretraining MVCNN is critical for good performance. MVCNN achieves state-of-the-art performance on four tasks: on small-scale binary, small-scale multi-class and largescale Twitter sentiment prediction and on subjectivity classification.
Tasks	Sentence Classification, Word Embeddings
Published	2016-03-15
URL	http://arxiv.org/abs/1603.04513v1
PDF	http://arxiv.org/pdf/1603.04513v1.pdf
PWC	https://paperswithcode.com/paper/multichannel-variable-size-convolution-for
Repo
Framework

Graphical RNN Models


Title	Graphical RNN Models
Authors	Ashish Bora, Sugato Basu, Joydeep Ghosh
Abstract	Many time series are generated by a set of entities that interact with one another over time. This paper introduces a broad, flexible framework to learn from multiple inter-dependent time series generated by such entities. Our framework explicitly models the entities and their interactions through time. It achieves this by building on the capabilities of Recurrent Neural Networks, while also offering several ways to incorporate domain knowledge/constraints into the model architecture. The capabilities of our approach are showcased through an application to weather prediction, which shows gains over strong baselines.
Tasks	Time Series
Published	2016-12-15
URL	http://arxiv.org/abs/1612.05054v1
PDF	http://arxiv.org/pdf/1612.05054v1.pdf
PWC	https://paperswithcode.com/paper/graphical-rnn-models
Repo
Framework

Hyperspectral Unmixing with Endmember Variability using Partial Membership Latent Dirichlet Allocation


Title	Hyperspectral Unmixing with Endmember Variability using Partial Membership Latent Dirichlet Allocation
Authors	Sheng Zou, Alina Zare
Abstract	The application of Partial Membership Latent Dirichlet Allocation(PM-LDA) for hyperspectral endmember estimation and spectral unmixing is presented. PM-LDA provides a model for a hyperspectral image analysis that accounts for spectral variability and incorporates spatial information through the use of superpixel-based ‘documents.’ In our application of PM-LDA, we employ the Normal Compositional Model in which endmembers are represented as Normal distributions to account for spectral variability and proportion vectors are modeled as random variables governed by a Dirichlet distribution. The use of the Dirichlet distribution enforces positivity and sum-to-one constraints on the proportion values. Algorithm results on real hyperspectral data indicate that PM-LDA produces endmember distributions that represent the ground truth classes and their associated variability.
Tasks	Hyperspectral Unmixing
Published	2016-09-12
URL	http://arxiv.org/abs/1609.03500v1
PDF	http://arxiv.org/pdf/1609.03500v1.pdf
PWC	https://paperswithcode.com/paper/hyperspectral-unmixing-with-endmember-1
Repo
Framework


Title	A Real-Time Deep Learning Pedestrian Detector for Robot Navigation
Authors	David Ribeiro, Andre Mateus, Pedro Miraldo, Jacinto C. Nascimento
Abstract	A real-time Deep Learning based method for Pedestrian Detection (PD) is applied to the Human-Aware robot navigation problem. The pedestrian detector combines the Aggregate Channel Features (ACF) detector with a deep Convolutional Neural Network (CNN) in order to obtain fast and accurate performance. Our solution is firstly evaluated using a set of real images taken from onboard and offboard cameras and, then, it is validated in a typical robot navigation environment with pedestrians (two distinct experiments are conducted). The results on both tests show that our pedestrian detector is robust and fast enough to be used on robot navigation applications.
Tasks	Pedestrian Detection, Robot Navigation
Published	2016-07-15
URL	http://arxiv.org/abs/1607.04436v2
PDF	http://arxiv.org/pdf/1607.04436v2.pdf
PWC	https://paperswithcode.com/paper/a-real-time-deep-learning-pedestrian-detector
Repo
Framework

Model Selection for Gaussian Process Regression by Approximation Set Coding


Title	Model Selection for Gaussian Process Regression by Approximation Set Coding
Authors	Benjamin Fischer, Nico Gorbach, Stefan Bauer, Yatao Bian, Joachim M. Buhmann
Abstract	Gaussian processes are powerful, yet analytically tractable models for supervised learning. A Gaussian process is characterized by a mean function and a covariance function (kernel), which are determined by a model selection criterion. The functions to be compared do not just differ in their parametrization but in their fundamental structure. It is often not clear which function structure to choose, for instance to decide between a squared exponential and a rational quadratic kernel. Based on the principle of approximation set coding, we develop a framework for model selection to rank kernels for Gaussian process regression. In our experiments approximation set coding shows promise to become a model selection criterion competitive with maximum evidence (also called marginal likelihood) and leave-one-out cross-validation.
Tasks	Gaussian Processes, Model Selection
Published	2016-10-04
URL	http://arxiv.org/abs/1610.00907v1
PDF	http://arxiv.org/pdf/1610.00907v1.pdf
PWC	https://paperswithcode.com/paper/model-selection-for-gaussian-process
Repo
Framework

Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies


Title	Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies
Authors	Daniel Hein, Alexander Hentschel, Thomas Runkler, Steffen Udluft
Abstract	Fuzzy controllers are efficient and interpretable system controllers for continuous state and action spaces. To date, such controllers have been constructed manually or trained automatically either using expert-generated problem-specific cost functions or incorporating detailed knowledge about the optimal control strategy. Both requirements for automatic training processes are not found in most real-world reinforcement learning (RL) problems. In such applications, online learning is often prohibited for safety reasons because online learning requires exploration of the problem’s dynamics during policy training. We introduce a fuzzy particle swarm reinforcement learning (FPSRL) approach that can construct fuzzy RL policies solely by training parameters on world models that simulate real system dynamics. These world models are created by employing an autonomous machine learning technique that uses previously generated transition samples of a real system. To the best of our knowledge, this approach is the first to relate self-organizing fuzzy controllers to model-based batch RL. Therefore, FPSRL is intended to solve problems in domains where online learning is prohibited, system dynamics are relatively easy to model from previously generated default policy transition samples, and it is expected that a relatively easily interpretable control policy exists. The efficiency of the proposed approach with problems from such domains is demonstrated using three standard RL benchmarks, i.e., mountain car, cart-pole balancing, and cart-pole swing-up. Our experimental results demonstrate high-performing, interpretable fuzzy policies.
Tasks
Published	2016-10-19
URL	http://arxiv.org/abs/1610.05984v5
PDF	http://arxiv.org/pdf/1610.05984v5.pdf
PWC	https://paperswithcode.com/paper/particle-swarm-optimization-for-generating
Repo
Framework

Comparative study and enhancement of Camera Tampering Detection algorithms


Title	Comparative study and enhancement of Camera Tampering Detection algorithms
Authors	Mabrouka Hagui, Mohamed Ali Mahjoub, Ahmed Boukhris
Abstract	Recently the use of video surveillance systems is widely increasing. Different places are equipped by camera surveillances such as hospitals, schools, airports, museums and military places in order to ensure the safety and security of the persons and their property. Therefore it becomes significant to guarantee the proper working of these systems. Intelligent video surveillance systems equipped by sophisticated digital camera can analyze video information s and automatically detect doubtful actions. The camera tampering detection algorithms may indicate that accidental or suspicious activities have occurred and that causes the abnormality works of the video surveillance. Camera Tampering Detection uses several techniques based on image processing and computer vision. In this paper, comparative study of performance of three algorithms that can detect abnormal disturbance for video surveillance is presented.
Tasks
Published	2016-08-08
URL	http://arxiv.org/abs/1608.02385v1
PDF	http://arxiv.org/pdf/1608.02385v1.pdf
PWC	https://paperswithcode.com/paper/comparative-study-and-enhancement-of-camera
Repo
Framework

Learning with Value-Ramp


Title	Learning with Value-Ramp
Authors	Tom J. Ameloot, Jan Van den Bussche
Abstract	We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.
Tasks
Published	2016-08-12
URL	http://arxiv.org/abs/1608.03647v2
PDF	http://arxiv.org/pdf/1608.03647v2.pdf
PWC	https://paperswithcode.com/paper/learning-with-value-ramp
Repo
Framework

Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes


Title	Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes
Authors	Gordon Christie, Ankit Laddha, Aishwarya Agrawal, Stanislaw Antol, Yash Goyal, Kevin Kochersberger, Dhruv Batra
Abstract	We present an approach to simultaneously perform semantic segmentation and prepositional phrase attachment resolution for captioned images. Some ambiguities in language cannot be resolved without simultaneously reasoning about an associated image. If we consider the sentence “I shot an elephant in my pajamas”, looking at language alone (and not using common sense), it is unclear if it is the person or the elephant wearing the pajamas or both. Our approach produces a diverse set of plausible hypotheses for both semantic segmentation and prepositional phrase attachment resolution that are then jointly reranked to select the most consistent pair. We show that our semantic segmentation and prepositional phrase attachment resolution modules have complementary strengths, and that joint reasoning produces more accurate results than any module operating in isolation. Multiple hypotheses are also shown to be crucial to improved multiple-module reasoning. Our vision and language approach significantly outperforms the Stanford Parser (De Marneffe et al., 2006) by 17.91% (28.69% relative) and 12.83% (25.28% relative) in two different experiments. We also make small improvements over DeepLab-CRF (Chen et al., 2015).
Tasks	Common Sense Reasoning, Prepositional Phrase Attachment, Semantic Segmentation
Published	2016-04-07
URL	http://arxiv.org/abs/1604.02125v4
PDF	http://arxiv.org/pdf/1604.02125v4.pdf
PWC	https://paperswithcode.com/paper/resolving-language-and-vision-ambiguities
Repo
Framework

A multinomial probabilistic model for movie genre predictions


Title	A multinomial probabilistic model for movie genre predictions
Authors	Eric Makita, Artem Lenskiy
Abstract	This paper proposes a movie genre-prediction based on multinomial probability model. To the best of our knowledge, this problem has not been addressed yet in the field of recommender system. The prediction of a movie genre has many practical applications including complementing the items categories given by experts and providing a surprise effect in the recommendations given to a user. We employ mulitnomial event model to estimate a likelihood of a movie given genre and the Bayes rule to evaluate the posterior probability of a genre given a movie. Experiments with the MovieLens dataset validate our approach. We achieved 70% prediction rate using only 15% of the whole set for training.
Tasks	Recommendation Systems
Published	2016-03-25
URL	http://arxiv.org/abs/1603.07849v1
PDF	http://arxiv.org/pdf/1603.07849v1.pdf
PWC	https://paperswithcode.com/paper/a-multinomial-probabilistic-model-for-movie
Repo
Framework

Structured adaptive and random spinners for fast machine learning computations


Title	Structured adaptive and random spinners for fast machine learning computations
Authors	Mariusz Bojarski, Anna Choromanska, Krzysztof Choromanski, Francois Fagan, Cedric Gouy-Pailler, Anne Morvan, Nourhan Sakr, Tamas Sarlos, Jamal Atif
Abstract	We consider an efficient computational framework for speeding up several machine learning algorithms with almost no loss of accuracy. The proposed framework relies on projections via structured matrices that we call Structured Spinners, which are formed as products of three structured matrix-blocks that incorporate rotations. The approach is highly generic, i.e. i) structured matrices under consideration can either be fully-randomized or learned, ii) our structured family contains as special cases all previously considered structured schemes, iii) the setting extends to the non-linear case where the projections are followed by non-linear functions, and iv) the method finds numerous applications including kernel approximations via random feature maps, dimensionality reduction algorithms, new fast cross-polytope LSH techniques, deep learning, convex optimization algorithms via Newton sketches, quantization with random projection trees, and more. The proposed framework comes with theoretical guarantees characterizing the capacity of the structured model in reference to its unstructured counterpart and is based on a general theoretical principle that we describe in the paper. As a consequence of our theoretical analysis, we provide the first theoretical guarantees for one of the most efficient existing LSH algorithms based on the HD3HD2HD1 structured matrix [Andoni et al., 2015]. The exhaustive experimental evaluation confirms the accuracy and efficiency of structured spinners for a variety of different applications.
Tasks	Dimensionality Reduction, Quantization
Published	2016-10-19
URL	http://arxiv.org/abs/1610.06209v3
PDF	http://arxiv.org/pdf/1610.06209v3.pdf
PWC	https://paperswithcode.com/paper/structured-adaptive-and-random-spinners-for
Repo
Framework

The Curious Robot: Learning Visual Representations via Physical Interactions


Title	The Curious Robot: Learning Visual Representations via Physical Interactions
Authors	Lerrel Pinto, Dhiraj Gandhi, Yuanfeng Han, Yong-Lae Park, Abhinav Gupta
Abstract	What is the right supervisory signal to train visual representations? Current approaches in computer vision use category labels from datasets such as ImageNet to train ConvNets. However, in case of biological agents, visual representation learning does not require millions of semantic labels. We argue that biological agents use physical interactions with the world to learn visual representations unlike current vision systems which just use passive observations (images and videos downloaded from web). For example, babies push objects, poke them, put them in their mouth and throw them to learn representations. Towards this goal, we build one of the first systems on a Baxter platform that pushes, pokes, grasps and observes objects in a tabletop environment. It uses four different types of physical interactions to collect more than 130K datapoints, with each datapoint providing supervision to a shared ConvNet architecture allowing us to learn visual representations. We show the quality of learned representations by observing neuron activations and performing nearest neighbor retrieval on this learned representation. Quantitatively, we evaluate our learned ConvNet on image classification tasks and show improvements compared to learning without external data. Finally, on the task of instance retrieval, our network outperforms the ImageNet network on recall@1 by 3%
Tasks	Image Classification, Representation Learning
Published	2016-04-05
URL	http://arxiv.org/abs/1604.01360v2
PDF	http://arxiv.org/pdf/1604.01360v2.pdf
PWC	https://paperswithcode.com/paper/the-curious-robot-learning-visual
Repo
Framework

Adobe-MIT submission to the DSTC 4 Spoken Language Understanding pilot task


Title	Adobe-MIT submission to the DSTC 4 Spoken Language Understanding pilot task
Authors	Franck Dernoncourt, Ji Young Lee, Trung H. Bui, Hung H. Bui
Abstract	The Dialog State Tracking Challenge 4 (DSTC 4) proposes several pilot tasks. In this paper, we focus on the spoken language understanding pilot task, which consists of tagging a given utterance with speech acts and semantic slots. We compare different classifiers: the best system obtains 0.52 and 0.67 F1-scores on the test set for speech act recognition for the tourist and the guide respectively, and 0.52 F1-score for semantic tagging for both the guide and the tourist.
Tasks	Spoken Language Understanding
Published	2016-05-07
URL	http://arxiv.org/abs/1605.02129v1
PDF	http://arxiv.org/pdf/1605.02129v1.pdf
PWC	https://paperswithcode.com/paper/adobe-mit-submission-to-the-dstc-4-spoken
Repo
Framework