May 5, 2019

2653 words 13 mins read

Paper Group ANR 518

Paper Group ANR 518

Constrained Cohort Intelligence using Static and Dynamic Penalty Function Approach for Mechanical Components Design. A probabilistic model for the numerical solution of initial value problems. Multichannel Variable-Size Convolution for Sentence Classification. Graphical RNN Models. Hyperspectral Unmixing with Endmember Variability using Partial Mem …

Constrained Cohort Intelligence using Static and Dynamic Penalty Function Approach for Mechanical Components Design

Title Constrained Cohort Intelligence using Static and Dynamic Penalty Function Approach for Mechanical Components Design
Authors Omkar Kulkarni, Ninad Kulkarni, Anand J Kulkarni, Ganesh Kakandikar
Abstract Most of the metaheuristics can efficiently solve unconstrained problems; however, their performance may degenerate if the constraints are involved. This paper proposes two constraint handling approaches for an emerging metaheuristic of Cohort Intelligence (CI). More specifically CI with static penalty function approach (SCI) and CI with dynamic penalty function approach (DCI) are proposed. The approaches have been tested by solving several constrained test problems. The performance of the SCI and DCI have been compared with algorithms like GA, PSO, ABC, d-Ds. In addition, as well as three real world problems from mechanical engineering domain with improved solutions. The results were satisfactory and validated the applicability of CI methodology for solving real world problems.
Tasks
Published 2016-09-26
URL http://arxiv.org/abs/1610.06009v1
PDF http://arxiv.org/pdf/1610.06009v1.pdf
PWC https://paperswithcode.com/paper/constrained-cohort-intelligence-using-static
Repo
Framework

A probabilistic model for the numerical solution of initial value problems

Title A probabilistic model for the numerical solution of initial value problems
Authors Michael Schober, Simo Särkkä, Philipp Hennig
Abstract Like many numerical methods, solvers for initial value problems (IVPs) on ordinary differential equations estimate an analytically intractable quantity, using the results of tractable computations as inputs. This structure is closely connected to the notion of inference on latent variables in statistics. We describe a class of algorithms that formulate the solution to an IVP as inference on a latent path that is a draw from a Gaussian process probability measure (or equivalently, the solution of a linear stochastic differential equation). We then show that certain members of this class are connected precisely to generalized linear methods for ODEs, a number of Runge–Kutta methods, and Nordsieck methods. This probabilistic formulation of classic methods is valuable in two ways: analytically, it highlights implicit prior assumptions favoring certain approximate solutions to the IVP over others, and gives a precise meaning to the old observation that these methods act like filters. Practically, it endows the classic solvers with `docking points’ for notions of uncertainty and prior information about the initial value, the value of the ODE itself, and the solution of the problem. |
Tasks
Published 2016-10-17
URL http://arxiv.org/abs/1610.05261v3
PDF http://arxiv.org/pdf/1610.05261v3.pdf
PWC https://paperswithcode.com/paper/a-probabilistic-model-for-the-numerical
Repo
Framework

Multichannel Variable-Size Convolution for Sentence Classification

Title Multichannel Variable-Size Convolution for Sentence Classification
Authors Wenpeng Yin, Hinrich Schütze
Abstract We propose MVCNN, a convolution neural network (CNN) architecture for sentence classification. It (i) combines diverse versions of pretrained word embeddings and (ii) extracts features of multigranular phrases with variable-size convolution filters. We also show that pretraining MVCNN is critical for good performance. MVCNN achieves state-of-the-art performance on four tasks: on small-scale binary, small-scale multi-class and largescale Twitter sentiment prediction and on subjectivity classification.
Tasks Sentence Classification, Word Embeddings
Published 2016-03-15
URL http://arxiv.org/abs/1603.04513v1
PDF http://arxiv.org/pdf/1603.04513v1.pdf
PWC https://paperswithcode.com/paper/multichannel-variable-size-convolution-for
Repo
Framework

Graphical RNN Models

Title Graphical RNN Models
Authors Ashish Bora, Sugato Basu, Joydeep Ghosh
Abstract Many time series are generated by a set of entities that interact with one another over time. This paper introduces a broad, flexible framework to learn from multiple inter-dependent time series generated by such entities. Our framework explicitly models the entities and their interactions through time. It achieves this by building on the capabilities of Recurrent Neural Networks, while also offering several ways to incorporate domain knowledge/constraints into the model architecture. The capabilities of our approach are showcased through an application to weather prediction, which shows gains over strong baselines.
Tasks Time Series
Published 2016-12-15
URL http://arxiv.org/abs/1612.05054v1
PDF http://arxiv.org/pdf/1612.05054v1.pdf
PWC https://paperswithcode.com/paper/graphical-rnn-models
Repo
Framework

Hyperspectral Unmixing with Endmember Variability using Partial Membership Latent Dirichlet Allocation

Title Hyperspectral Unmixing with Endmember Variability using Partial Membership Latent Dirichlet Allocation
Authors Sheng Zou, Alina Zare
Abstract The application of Partial Membership Latent Dirichlet Allocation(PM-LDA) for hyperspectral endmember estimation and spectral unmixing is presented. PM-LDA provides a model for a hyperspectral image analysis that accounts for spectral variability and incorporates spatial information through the use of superpixel-based ‘documents.’ In our application of PM-LDA, we employ the Normal Compositional Model in which endmembers are represented as Normal distributions to account for spectral variability and proportion vectors are modeled as random variables governed by a Dirichlet distribution. The use of the Dirichlet distribution enforces positivity and sum-to-one constraints on the proportion values. Algorithm results on real hyperspectral data indicate that PM-LDA produces endmember distributions that represent the ground truth classes and their associated variability.
Tasks Hyperspectral Unmixing
Published 2016-09-12
URL http://arxiv.org/abs/1609.03500v1
PDF http://arxiv.org/pdf/1609.03500v1.pdf
PWC https://paperswithcode.com/paper/hyperspectral-unmixing-with-endmember-1
Repo
Framework

A Real-Time Deep Learning Pedestrian Detector for Robot Navigation

Title A Real-Time Deep Learning Pedestrian Detector for Robot Navigation
Authors David Ribeiro, Andre Mateus, Pedro Miraldo, Jacinto C. Nascimento
Abstract A real-time Deep Learning based method for Pedestrian Detection (PD) is applied to the Human-Aware robot navigation problem. The pedestrian detector combines the Aggregate Channel Features (ACF) detector with a deep Convolutional Neural Network (CNN) in order to obtain fast and accurate performance. Our solution is firstly evaluated using a set of real images taken from onboard and offboard cameras and, then, it is validated in a typical robot navigation environment with pedestrians (two distinct experiments are conducted). The results on both tests show that our pedestrian detector is robust and fast enough to be used on robot navigation applications.
Tasks Pedestrian Detection, Robot Navigation
Published 2016-07-15
URL http://arxiv.org/abs/1607.04436v2
PDF http://arxiv.org/pdf/1607.04436v2.pdf
PWC https://paperswithcode.com/paper/a-real-time-deep-learning-pedestrian-detector
Repo
Framework

Model Selection for Gaussian Process Regression by Approximation Set Coding

Title Model Selection for Gaussian Process Regression by Approximation Set Coding
Authors Benjamin Fischer, Nico Gorbach, Stefan Bauer, Yatao Bian, Joachim M. Buhmann
Abstract Gaussian processes are powerful, yet analytically tractable models for supervised learning. A Gaussian process is characterized by a mean function and a covariance function (kernel), which are determined by a model selection criterion. The functions to be compared do not just differ in their parametrization but in their fundamental structure. It is often not clear which function structure to choose, for instance to decide between a squared exponential and a rational quadratic kernel. Based on the principle of approximation set coding, we develop a framework for model selection to rank kernels for Gaussian process regression. In our experiments approximation set coding shows promise to become a model selection criterion competitive with maximum evidence (also called marginal likelihood) and leave-one-out cross-validation.
Tasks Gaussian Processes, Model Selection
Published 2016-10-04
URL http://arxiv.org/abs/1610.00907v1
PDF http://arxiv.org/pdf/1610.00907v1.pdf
PWC https://paperswithcode.com/paper/model-selection-for-gaussian-process
Repo
Framework

Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies

Title Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies
Authors Daniel Hein, Alexander Hentschel, Thomas Runkler, Steffen Udluft
Abstract Fuzzy controllers are efficient and interpretable system controllers for continuous state and action spaces. To date, such controllers have been constructed manually or trained automatically either using expert-generated problem-specific cost functions or incorporating detailed knowledge about the optimal control strategy. Both requirements for automatic training processes are not found in most real-world reinforcement learning (RL) problems. In such applications, online learning is often prohibited for safety reasons because online learning requires exploration of the problem’s dynamics during policy training. We introduce a fuzzy particle swarm reinforcement learning (FPSRL) approach that can construct fuzzy RL policies solely by training parameters on world models that simulate real system dynamics. These world models are created by employing an autonomous machine learning technique that uses previously generated transition samples of a real system. To the best of our knowledge, this approach is the first to relate self-organizing fuzzy controllers to model-based batch RL. Therefore, FPSRL is intended to solve problems in domains where online learning is prohibited, system dynamics are relatively easy to model from previously generated default policy transition samples, and it is expected that a relatively easily interpretable control policy exists. The efficiency of the proposed approach with problems from such domains is demonstrated using three standard RL benchmarks, i.e., mountain car, cart-pole balancing, and cart-pole swing-up. Our experimental results demonstrate high-performing, interpretable fuzzy policies.
Tasks
Published 2016-10-19
URL http://arxiv.org/abs/1610.05984v5
PDF http://arxiv.org/pdf/1610.05984v5.pdf
PWC https://paperswithcode.com/paper/particle-swarm-optimization-for-generating
Repo
Framework

Comparative study and enhancement of Camera Tampering Detection algorithms

Title Comparative study and enhancement of Camera Tampering Detection algorithms
Authors Mabrouka Hagui, Mohamed Ali Mahjoub, Ahmed Boukhris
Abstract Recently the use of video surveillance systems is widely increasing. Different places are equipped by camera surveillances such as hospitals, schools, airports, museums and military places in order to ensure the safety and security of the persons and their property. Therefore it becomes significant to guarantee the proper working of these systems. Intelligent video surveillance systems equipped by sophisticated digital camera can analyze video information s and automatically detect doubtful actions. The camera tampering detection algorithms may indicate that accidental or suspicious activities have occurred and that causes the abnormality works of the video surveillance. Camera Tampering Detection uses several techniques based on image processing and computer vision. In this paper, comparative study of performance of three algorithms that can detect abnormal disturbance for video surveillance is presented.
Tasks
Published 2016-08-08
URL http://arxiv.org/abs/1608.02385v1
PDF http://arxiv.org/pdf/1608.02385v1.pdf
PWC https://paperswithcode.com/paper/comparative-study-and-enhancement-of-camera
Repo
Framework

Learning with Value-Ramp

Title Learning with Value-Ramp
Authors Tom J. Ameloot, Jan Van den Bussche
Abstract We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.
Tasks
Published 2016-08-12
URL http://arxiv.org/abs/1608.03647v2
PDF http://arxiv.org/pdf/1608.03647v2.pdf
PWC https://paperswithcode.com/paper/learning-with-value-ramp
Repo
Framework

Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes

Title Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes
Authors Gordon Christie, Ankit Laddha, Aishwarya Agrawal, Stanislaw Antol, Yash Goyal, Kevin Kochersberger, Dhruv Batra
Abstract We present an approach to simultaneously perform semantic segmentation and prepositional phrase attachment resolution for captioned images. Some ambiguities in language cannot be resolved without simultaneously reasoning about an associated image. If we consider the sentence “I shot an elephant in my pajamas”, looking at language alone (and not using common sense), it is unclear if it is the person or the elephant wearing the pajamas or both. Our approach produces a diverse set of plausible hypotheses for both semantic segmentation and prepositional phrase attachment resolution that are then jointly reranked to select the most consistent pair. We show that our semantic segmentation and prepositional phrase attachment resolution modules have complementary strengths, and that joint reasoning produces more accurate results than any module operating in isolation. Multiple hypotheses are also shown to be crucial to improved multiple-module reasoning. Our vision and language approach significantly outperforms the Stanford Parser (De Marneffe et al., 2006) by 17.91% (28.69% relative) and 12.83% (25.28% relative) in two different experiments. We also make small improvements over DeepLab-CRF (Chen et al., 2015).
Tasks Common Sense Reasoning, Prepositional Phrase Attachment, Semantic Segmentation
Published 2016-04-07
URL http://arxiv.org/abs/1604.02125v4
PDF http://arxiv.org/pdf/1604.02125v4.pdf
PWC https://paperswithcode.com/paper/resolving-language-and-vision-ambiguities
Repo
Framework

A multinomial probabilistic model for movie genre predictions

Title A multinomial probabilistic model for movie genre predictions
Authors Eric Makita, Artem Lenskiy
Abstract This paper proposes a movie genre-prediction based on multinomial probability model. To the best of our knowledge, this problem has not been addressed yet in the field of recommender system. The prediction of a movie genre has many practical applications including complementing the items categories given by experts and providing a surprise effect in the recommendations given to a user. We employ mulitnomial event model to estimate a likelihood of a movie given genre and the Bayes rule to evaluate the posterior probability of a genre given a movie. Experiments with the MovieLens dataset validate our approach. We achieved 70% prediction rate using only 15% of the whole set for training.
Tasks Recommendation Systems
Published 2016-03-25
URL http://arxiv.org/abs/1603.07849v1
PDF http://arxiv.org/pdf/1603.07849v1.pdf
PWC https://paperswithcode.com/paper/a-multinomial-probabilistic-model-for-movie
Repo
Framework

Structured adaptive and random spinners for fast machine learning computations

Title Structured adaptive and random spinners for fast machine learning computations
Authors Mariusz Bojarski, Anna Choromanska, Krzysztof Choromanski, Francois Fagan, Cedric Gouy-Pailler, Anne Morvan, Nourhan Sakr, Tamas Sarlos, Jamal Atif
Abstract We consider an efficient computational framework for speeding up several machine learning algorithms with almost no loss of accuracy. The proposed framework relies on projections via structured matrices that we call Structured Spinners, which are formed as products of three structured matrix-blocks that incorporate rotations. The approach is highly generic, i.e. i) structured matrices under consideration can either be fully-randomized or learned, ii) our structured family contains as special cases all previously considered structured schemes, iii) the setting extends to the non-linear case where the projections are followed by non-linear functions, and iv) the method finds numerous applications including kernel approximations via random feature maps, dimensionality reduction algorithms, new fast cross-polytope LSH techniques, deep learning, convex optimization algorithms via Newton sketches, quantization with random projection trees, and more. The proposed framework comes with theoretical guarantees characterizing the capacity of the structured model in reference to its unstructured counterpart and is based on a general theoretical principle that we describe in the paper. As a consequence of our theoretical analysis, we provide the first theoretical guarantees for one of the most efficient existing LSH algorithms based on the HD3HD2HD1 structured matrix [Andoni et al., 2015]. The exhaustive experimental evaluation confirms the accuracy and efficiency of structured spinners for a variety of different applications.
Tasks Dimensionality Reduction, Quantization
Published 2016-10-19
URL http://arxiv.org/abs/1610.06209v3
PDF http://arxiv.org/pdf/1610.06209v3.pdf
PWC https://paperswithcode.com/paper/structured-adaptive-and-random-spinners-for
Repo
Framework

The Curious Robot: Learning Visual Representations via Physical Interactions

Title The Curious Robot: Learning Visual Representations via Physical Interactions
Authors Lerrel Pinto, Dhiraj Gandhi, Yuanfeng Han, Yong-Lae Park, Abhinav Gupta
Abstract What is the right supervisory signal to train visual representations? Current approaches in computer vision use category labels from datasets such as ImageNet to train ConvNets. However, in case of biological agents, visual representation learning does not require millions of semantic labels. We argue that biological agents use physical interactions with the world to learn visual representations unlike current vision systems which just use passive observations (images and videos downloaded from web). For example, babies push objects, poke them, put them in their mouth and throw them to learn representations. Towards this goal, we build one of the first systems on a Baxter platform that pushes, pokes, grasps and observes objects in a tabletop environment. It uses four different types of physical interactions to collect more than 130K datapoints, with each datapoint providing supervision to a shared ConvNet architecture allowing us to learn visual representations. We show the quality of learned representations by observing neuron activations and performing nearest neighbor retrieval on this learned representation. Quantitatively, we evaluate our learned ConvNet on image classification tasks and show improvements compared to learning without external data. Finally, on the task of instance retrieval, our network outperforms the ImageNet network on recall@1 by 3%
Tasks Image Classification, Representation Learning
Published 2016-04-05
URL http://arxiv.org/abs/1604.01360v2
PDF http://arxiv.org/pdf/1604.01360v2.pdf
PWC https://paperswithcode.com/paper/the-curious-robot-learning-visual
Repo
Framework

Adobe-MIT submission to the DSTC 4 Spoken Language Understanding pilot task

Title Adobe-MIT submission to the DSTC 4 Spoken Language Understanding pilot task
Authors Franck Dernoncourt, Ji Young Lee, Trung H. Bui, Hung H. Bui
Abstract The Dialog State Tracking Challenge 4 (DSTC 4) proposes several pilot tasks. In this paper, we focus on the spoken language understanding pilot task, which consists of tagging a given utterance with speech acts and semantic slots. We compare different classifiers: the best system obtains 0.52 and 0.67 F1-scores on the test set for speech act recognition for the tourist and the guide respectively, and 0.52 F1-score for semantic tagging for both the guide and the tourist.
Tasks Spoken Language Understanding
Published 2016-05-07
URL http://arxiv.org/abs/1605.02129v1
PDF http://arxiv.org/pdf/1605.02129v1.pdf
PWC https://paperswithcode.com/paper/adobe-mit-submission-to-the-dstc-4-spoken
Repo
Framework
comments powered by Disqus