July 28, 2019

2987 words 15 mins read

Paper Group ANR 237

Paper Group ANR 237

On the Universality of Memcomputing Machines. Weight-based Fish School Search algorithm for Many-Objective Optimization. An Unsupervised Game-Theoretic Approach to Saliency Detection. Logic Programming approaches for routing fault-free and maximally-parallel Wavelength Routed Optical Networks on Chip (Application paper). A Deep Learning Model for T …

On the Universality of Memcomputing Machines

Title On the Universality of Memcomputing Machines
Authors Yan Ru Pei, Fabio L. Traversa, Massimiliano Di Ventra
Abstract Universal memcomputing machines (UMMs) [IEEE Trans. Neural Netw. Learn. Syst. 26, 2702 (2015)] represent a novel computational model in which memory (time non-locality) accomplishes both tasks of storing and processing of information. UMMs have been shown to be Turing-complete, namely they can simulate any Turing machine. In this paper, using set theory and cardinality arguments, we compare them with liquid-state machines (or “reservoir computing”) and quantum machines (“quantum computing”). We show that UMMs can simulate both types of machines, hence they are both “liquid-” or “reservoir-complete” and “quantum-complete”. Of course, these statements pertain only to the type of problems these machines can solve, and not to the amount of resources required for such simulations. Nonetheless, the method presented here provides a general framework in which to describe the relation between UMMs and any other type of computational model.
Tasks
Published 2017-12-23
URL https://arxiv.org/abs/1712.08702v2
PDF https://arxiv.org/pdf/1712.08702v2.pdf
PWC https://paperswithcode.com/paper/on-the-universality-of-memcomputing-machines
Repo
Framework

Weight-based Fish School Search algorithm for Many-Objective Optimization

Title Weight-based Fish School Search algorithm for Many-Objective Optimization
Authors F. B. Lima Neto, I. M. C. Albuquerque, J. B. Monteiro Filho
Abstract Optimization problems with more than one objective consist in a very attractive topic for researchers due to its applicability in real-world situations. Over the years, the research effort in the Computational Intelligence field resulted in algorithms able to achieve good results by solving problems with more than one conflicting objective. However, these techniques do not exhibit the same performance as the number of objectives increases and become greater than 3. This paper proposes an adaptation of the metaheuristic Fish School Search to solve optimization problems with many objectives. This adaptation is based on the division of the candidate solutions in clusters that are specialized in solving a single-objective problem generated by the decomposition of the original problem. For this, we used concepts and ideas often employed by state-of-the-art algorithms, namely: (i) reference points and lines in the objectives space; (ii) clustering process; and (iii) the decomposition technique Penalty-based Boundary Intersection. The proposed algorithm was compared with two state-of-the-art bio-inspired algorithms. Moreover, a version of the proposed technique tailored to solve multi-modal problems was also presented. The experiments executed have shown that the performance obtained by both versions is competitive with state-of-the-art results.
Tasks
Published 2017-08-16
URL http://arxiv.org/abs/1708.04745v3
PDF http://arxiv.org/pdf/1708.04745v3.pdf
PWC https://paperswithcode.com/paper/weight-based-fish-school-search-algorithm-for
Repo
Framework

An Unsupervised Game-Theoretic Approach to Saliency Detection

Title An Unsupervised Game-Theoretic Approach to Saliency Detection
Authors Yu Zeng, Huchuan Lu, Ali Borji, Mengyang Feng
Abstract We propose a novel unsupervised game-theoretic salient object detection algorithm that does not require labeled training data. First, saliency detection problem is formulated as a non-cooperative game, hereinafter referred to as Saliency Game, in which image regions are players who choose to be “background” or “foreground” as their pure strategies. A payoff function is constructed by exploiting multiple cues and combining complementary features. Saliency maps are generated according to each region’s strategy in the Nash equilibrium of the proposed Saliency Game. Second, we explore the complementary relationship between color and deep features and propose an Iterative Random Walk algorithm to combine saliency maps produced by the Saliency Game using different features. Iterative random walk allows sharing information across feature spaces, and detecting objects that are otherwise very hard to detect. Extensive experiments over 6 challenging datasets demonstrate the superiority of our proposed unsupervised algorithm compared to several state of the art supervised algorithms.
Tasks Object Detection, Saliency Detection, Salient Object Detection
Published 2017-08-08
URL http://arxiv.org/abs/1708.02476v1
PDF http://arxiv.org/pdf/1708.02476v1.pdf
PWC https://paperswithcode.com/paper/an-unsupervised-game-theoretic-approach-to
Repo
Framework

Logic Programming approaches for routing fault-free and maximally-parallel Wavelength Routed Optical Networks on Chip (Application paper)

Title Logic Programming approaches for routing fault-free and maximally-parallel Wavelength Routed Optical Networks on Chip (Application paper)
Authors Marco Gavanelli, Maddalena Nonato, Andrea Peano, Davide Bertozzi
Abstract One promising trend in digital system integration consists of boosting on-chip communication performance by means of silicon photonics, thus materializing the so-called Optical Networks-on-Chip (ONoCs). Among them, wavelength routing can be used to route a signal to destination by univocally associating a routing path to the wavelength of the optical carrier. Such wavelengths should be chosen so to minimize interferences among optical channels and to avoid routing faults. As a result, physical parameter selection of such networks requires the solution of complex constrained optimization problems. In previous work, published in the proceedings of the International Conference on Computer-Aided Design, we proposed and solved the problem of computing the maximum parallelism obtainable in the communication between any two endpoints while avoiding misrouting of optical signals. The underlying technology, only quickly mentioned in that paper, is Answer Set Programming (ASP). In this work, we detail the ASP approach we used to solve such problem. Another important design issue is to select the wavelengths of optical carriers such that they are spread across the available spectrum, in order to reduce the likelihood that, due to imperfections in the manufacturing process, unintended routing faults arise. We show how to address such problem in Constraint Logic Programming on Finite Domains (CLP(FD)). This paper is under consideration for possible publication on Theory and Practice of Logic Programming.
Tasks
Published 2017-07-18
URL http://arxiv.org/abs/1707.05858v1
PDF http://arxiv.org/pdf/1707.05858v1.pdf
PWC https://paperswithcode.com/paper/logic-programming-approaches-for-routing
Repo
Framework

A Deep Learning Model for Traffic Flow State Classification Based on Smart Phone Sensor Data

Title A Deep Learning Model for Traffic Flow State Classification Based on Smart Phone Sensor Data
Authors Wenwen Tu, Feng Xiao, Liping Fu, Guangyuan Pan
Abstract This study proposes a Deep Belief Network model to classify traffic flow states. The model is capable of processing massive, high-density, and noise-contaminated data sets generated from smartphone sensors. The statistical features of Vehicle acceleration, angular acceleration, and GPS speed data, recorded by smartphone software, are analyzed, and then used as input for traffic flow state classification. Data from a five-day experiment is used to train and test the proposed model. A total of 747,856 sets of data are generated and used for both traffic flow states classification and sensitivity analysis of input variables. The result shows that the proposed Deep Belief Network model is superior to traditional machine learning methods in both classification performance and computational efficiency.
Tasks
Published 2017-09-26
URL http://arxiv.org/abs/1709.08802v1
PDF http://arxiv.org/pdf/1709.08802v1.pdf
PWC https://paperswithcode.com/paper/a-deep-learning-model-for-traffic-flow-state
Repo
Framework

Estimation of interventional effects of features on prediction

Title Estimation of interventional effects of features on prediction
Authors Patrick Blöbaum, Shohei Shimizu
Abstract The interpretability of prediction mechanisms with respect to the underlying prediction problem is often unclear. While several studies have focused on developing prediction models with meaningful parameters, the causal relationships between the predictors and the actual prediction have not been considered. Here, we connect the underlying causal structure of a data generation process and the causal structure of a prediction mechanism. To achieve this, we propose a framework that identifies the feature with the greatest causal influence on the prediction and estimates the necessary causal intervention of a feature such that a desired prediction is obtained. The general concept of the framework has no restrictions regarding data linearity; however, we focus on an implementation for linear data here. The framework applicability is evaluated using artificial data and demonstrated using real-world data.
Tasks
Published 2017-09-03
URL http://arxiv.org/abs/1709.00776v1
PDF http://arxiv.org/pdf/1709.00776v1.pdf
PWC https://paperswithcode.com/paper/estimation-of-interventional-effects-of
Repo
Framework

Strategy Synthesis in POMDPs via Game-Based Abstractions

Title Strategy Synthesis in POMDPs via Game-Based Abstractions
Authors Leonore Winterer, Sebastian Junges, Ralf Wimmer, Nils Jansen, Ufuk Topcu, Joost-Pieter Katoen, Bernd Becker
Abstract We study synthesis problems with constraints in partially observable Markov decision processes (POMDPs), where the objective is to compute a strategy for an agent that is guaranteed to satisfy certain safety and performance specifications. Verification and strategy synthesis for POMDPs are, however, computationally intractable in general. We alleviate this difficulty by focusing on planning applications and exploiting typical structural properties of such scenarios; for instance, we assume that the agent has the ability to observe its own position inside an environment. We propose an abstraction refinement framework which turns such a POMDP model into a (fully observable) probabilistic two-player game (PG). For the obtained PGs, efficient verification and synthesis tools allow to determine strategies with optimal safety and performance measures, which approximate optimal schedulers on the POMDP. If the approximation is too coarse to satisfy the given specifications, an refinement scheme improves the computed strategies. As a running example, we use planning problems where an agent moves inside an environment with randomly moving obstacles and restricted observability. We demonstrate that the proposed method advances the state of the art by solving problems several orders-of-magnitude larger than those that can be handled by existing POMDP solvers. Furthermore, this method gives guarantees on safety constraints, which is not supported by the majority of the existing solvers.
Tasks Motion Planning
Published 2017-08-14
URL https://arxiv.org/abs/1708.04236v2
PDF https://arxiv.org/pdf/1708.04236v2.pdf
PWC https://paperswithcode.com/paper/motion-planning-under-partial-observability
Repo
Framework

Gradient-based Regularization Parameter Selection for Problems with Non-smooth Penalty Functions

Title Gradient-based Regularization Parameter Selection for Problems with Non-smooth Penalty Functions
Authors Jean Feng, Noah Simon
Abstract In high-dimensional and/or non-parametric regression problems, regularization (or penalization) is used to control model complexity and induce desired structure. Each penalty has a weight parameter that indicates how strongly the structure corresponding to that penalty should be enforced. Typically the parameters are chosen to minimize the error on a separate validation set using a simple grid search or a gradient-free optimization method. It is more efficient to tune parameters if the gradient can be determined, but this is often difficult for problems with non-smooth penalty functions. Here we show that for many penalized regression problems, the validation loss is actually smooth almost-everywhere with respect to the penalty parameters. We can therefore apply a modified gradient descent algorithm to tune parameters. Through simulation studies on example regression problems, we find that increasing the number of penalty parameters and tuning them using our method can decrease the generalization error.
Tasks
Published 2017-03-28
URL http://arxiv.org/abs/1703.09813v1
PDF http://arxiv.org/pdf/1703.09813v1.pdf
PWC https://paperswithcode.com/paper/gradient-based-regularization-parameter
Repo
Framework

Deterministic Quantum Annealing Expectation-Maximization Algorithm

Title Deterministic Quantum Annealing Expectation-Maximization Algorithm
Authors Hideyuki Miyahara, Koji Tsumura, Yuki Sughiyama
Abstract Maximum likelihood estimation (MLE) is one of the most important methods in machine learning, and the expectation-maximization (EM) algorithm is often used to obtain maximum likelihood estimates. However, EM heavily depends on initial configurations and fails to find the global optimum. On the other hand, in the field of physics, quantum annealing (QA) was proposed as a novel optimization approach. Motivated by QA, we propose a quantum annealing extension of EM, which we call the deterministic quantum annealing expectation-maximization (DQAEM) algorithm. We also discuss its advantage in terms of the path integral formulation. Furthermore, by employing numerical simulations, we illustrate how it works in MLE and show that DQAEM outperforms EM.
Tasks
Published 2017-04-19
URL http://arxiv.org/abs/1704.05822v1
PDF http://arxiv.org/pdf/1704.05822v1.pdf
PWC https://paperswithcode.com/paper/deterministic-quantum-annealing-expectation
Repo
Framework

Cross-language Learning with Adversarial Neural Networks: Application to Community Question Answering

Title Cross-language Learning with Adversarial Neural Networks: Application to Community Question Answering
Authors Shafiq Joty, Preslav Nakov, Lluís Màrquez, Israa Jaradat
Abstract We address the problem of cross-language adaptation for question-question similarity reranking in community question answering, with the objective to port a system trained on one input language to another input language given labeled training data for the first language and only unlabeled data for the second language. In particular, we propose to use adversarial training of neural networks to learn high-level features that are discriminative for the main learning task, and at the same time are invariant across the input languages. The evaluation results show sizable improvements for our cross-language adversarial neural network (CLANN) model over a strong non-adversarial system.
Tasks Community Question Answering, Question Answering, Question Similarity
Published 2017-06-21
URL http://arxiv.org/abs/1706.06749v1
PDF http://arxiv.org/pdf/1706.06749v1.pdf
PWC https://paperswithcode.com/paper/cross-language-learning-with-adversarial-1
Repo
Framework

Audio-Visual Speech Enhancement based on Multimodal Deep Convolutional Neural Network

Title Audio-Visual Speech Enhancement based on Multimodal Deep Convolutional Neural Network
Authors Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Yu Tsao, Hsiu-Wen Chang, Hsin-Min Wang
Abstract Speech enhancement (SE) aims to reduce noise in speech signals. Most SE techniques focus on addressing audio information only. In this work, inspired by multimodal learning, which utilizes data from different modalities, and the recent success of convolutional neural networks (CNNs) in SE, we propose an audio-visual deep CNN (AVDCNN) SE model, which incorporates audio and visual streams into a unified network model. In the proposed AVDCNN SE model, audio and visual data are first processed using individual CNNs, and then, fused into a joint network to generate enhanced speech at the output layer. The AVDCNN model is trained in an end-to-end manner, and parameters are jointly learned through back-propagation. We evaluate enhanced speech using five objective criteria. Results show that the AVDCNN yields notably better performance, compared with an audio-only CNN-based SE model and two conventional SE approaches, confirming the effectiveness of integrating visual information into the SE process.
Tasks Speech Enhancement
Published 2017-09-01
URL http://arxiv.org/abs/1709.00944v2
PDF http://arxiv.org/pdf/1709.00944v2.pdf
PWC https://paperswithcode.com/paper/audio-visual-speech-enhancement-based-on
Repo
Framework

Accessible Melanoma Detection using Smartphones and Mobile Image Analysis

Title Accessible Melanoma Detection using Smartphones and Mobile Image Analysis
Authors T. -T. Do, T. Hoang, V. Pomponiu, Y. Zhou, Z. Chen, N. -M. Cheung, D. Koh, A. Tan, S. -H. Tan
Abstract We investigate the design of an entire mobile imaging system for early detection of melanoma. Different from previous work, we focus on smartphone-captured visible light images. Our design addresses two major challenges. First, images acquired using a smartphone under loosely-controlled environmental conditions may be subject to various distortions, and this makes melanoma detection more difficult. Second, processing performed on a smartphone is subject to stringent computation and memory constraints. In our work, we propose a detection system that is optimized to run entirely on the resource-constrained smartphone. Our system intends to localize the skin lesion by combining a lightweight method for skin detection with a hierarchical segmentation approach using two fast segmentation methods. Moreover, we study an extensive set of image features and propose new numerical features to characterize a skin lesion. Furthermore, we propose an improved feature selection algorithm to determine a small set of discriminative features used by the final lightweight system. In addition, we study the human-computer interface (HCI) design to understand the usability and acceptance issues of the proposed system.
Tasks Feature Selection
Published 2017-11-27
URL http://arxiv.org/abs/1711.09553v2
PDF http://arxiv.org/pdf/1711.09553v2.pdf
PWC https://paperswithcode.com/paper/accessible-melanoma-detection-using
Repo
Framework

Topic Based Sentiment Analysis Using Deep Learning

Title Topic Based Sentiment Analysis Using Deep Learning
Authors Sharath T. S., Shubhangi Tandon
Abstract In this paper , we tackle Sentiment Analysis conditioned on a Topic in Twitter data using Deep Learning . We propose a 2-tier approach : In the first phase we create our own Word Embeddings and see that they do perform better than state-of-the-art embeddings when used with standard classifiers. We then perform inference on these embeddings to learn more about a word with respect to all the topics being considered, and also the top n-influencing words for each topic. In the second phase we use these embeddings to predict the sentiment of the tweet with respect to a given topic, and all other topics under discussion.
Tasks Sentiment Analysis, Word Embeddings
Published 2017-10-28
URL http://arxiv.org/abs/1710.10498v1
PDF http://arxiv.org/pdf/1710.10498v1.pdf
PWC https://paperswithcode.com/paper/topic-based-sentiment-analysis-using-deep
Repo
Framework

Reweighted Infrared Patch-Tensor Model With Both Non-Local and Local Priors for Single-Frame Small Target Detection

Title Reweighted Infrared Patch-Tensor Model With Both Non-Local and Local Priors for Single-Frame Small Target Detection
Authors Yimian Dai, Yiquan Wu
Abstract Many state-of-the-art methods have been proposed for infrared small target detection. They work well on the images with homogeneous backgrounds and high-contrast targets. However, when facing highly heterogeneous backgrounds, they would not perform very well, mainly due to: 1) the existence of strong edges and other interfering components, 2) not utilizing the priors fully. Inspired by this, we propose a novel method to exploit both local and non-local priors simultaneously. Firstly, we employ a new infrared patch-tensor (IPT) model to represent the image and preserve its spatial correlations. Exploiting the target sparse prior and background non-local self-correlation prior, the target-background separation is modeled as a robust low-rank tensor recovery problem. Moreover, with the help of the structure tensor and reweighted idea, we design an entry-wise local-structure-adaptive and sparsity enhancing weight to replace the globally constant weighting parameter. The decomposition could be achieved via the element-wise reweighted higher-order robust principal component analysis with an additional convergence condition according to the practical situation of target detection. Extensive experiments demonstrate that our model outperforms the other state-of-the-arts, in particular for the images with very dim targets and heavy clutters.
Tasks
Published 2017-03-27
URL http://arxiv.org/abs/1703.09157v1
PDF http://arxiv.org/pdf/1703.09157v1.pdf
PWC https://paperswithcode.com/paper/reweighted-infrared-patch-tensor-model-with
Repo
Framework

Learning the Structure of Generative Models without Labeled Data

Title Learning the Structure of Generative Models without Labeled Data
Authors Stephen H. Bach, Bryan He, Alexander Ratner, Christopher Ré
Abstract Curating labeled training data has become the primary bottleneck in machine learning. Recent frameworks address this bottleneck with generative models to synthesize labels at scale from weak supervision sources. The generative model’s dependency structure directly affects the quality of the estimated labels, but selecting a structure automatically without any labeled data is a distinct challenge. We propose a structure estimation method that maximizes the $\ell_1$-regularized marginal pseudolikelihood of the observed data. Our analysis shows that the amount of unlabeled data required to identify the true structure scales sublinearly in the number of possible dependencies for a broad class of models. Simulations show that our method is 100$\times$ faster than a maximum likelihood approach and selects $1/4$ as many extraneous dependencies. We also show that our method provides an average of 1.5 F1 points of improvement over existing, user-developed information extraction applications on real-world data such as PubMed journal abstracts.
Tasks
Published 2017-03-02
URL http://arxiv.org/abs/1703.00854v2
PDF http://arxiv.org/pdf/1703.00854v2.pdf
PWC https://paperswithcode.com/paper/learning-the-structure-of-generative-models
Repo
Framework
comments powered by Disqus