July 27, 2019

2802 words 14 mins read

Paper Group ANR 637

Doubly Robust Data-Driven Distributionally Robust Optimization. Efficient Regret Minimization in Non-Convex Games. Listening to the World Improves Speech Command Recognition. BetaRun Soccer Simulation League Team: Variety, Complexity, and Learning. A semidiscrete version of the Citti-Petitot-Sarti model as a plausible model for anthropomorphic imag …

Doubly Robust Data-Driven Distributionally Robust Optimization


Title	Doubly Robust Data-Driven Distributionally Robust Optimization
Authors	Jose Blanchet, Yang Kang, Fan Zhang, Fei He, Zhangyi Hu
Abstract	Data-driven Distributionally Robust Optimization (DD-DRO) via optimal transport has been shown to encompass a wide range of popular machine learning algorithms. The distributional uncertainty size is often shown to correspond to the regularization parameter. The type of regularization (e.g. the norm used to regularize) corresponds to the shape of the distributional uncertainty. We propose a data-driven robust optimization methodology to inform the transportation cost underlying the definition of the distributional uncertainty. We show empirically that this additional layer of robustification, which produces a method we called doubly robust data-driven distributionally robust optimization (DD-R-DRO), allows to enhance the generalization properties of regularized estimators while reducing testing error relative to state-of-the-art classifiers in a wide range of data sets.
Tasks
Published	2017-05-19
URL	http://arxiv.org/abs/1705.07168v1
PDF	http://arxiv.org/pdf/1705.07168v1.pdf
PWC	https://paperswithcode.com/paper/doubly-robust-data-driven-distributionally
Repo
Framework

Efficient Regret Minimization in Non-Convex Games


Title	Efficient Regret Minimization in Non-Convex Games
Authors	Elad Hazan, Karan Singh, Cyril Zhang
Abstract	We consider regret minimization in repeated games with non-convex loss functions. Minimizing the standard notion of regret is computationally intractable. Thus, we define a natural notion of regret which permits efficient optimization and generalizes offline guarantees for convergence to an approximate local optimum. We give gradient-based methods that achieve optimal regret, which in turn guarantee convergence to equilibrium in this framework.
Tasks
Published	2017-07-31
URL	http://arxiv.org/abs/1708.00075v1
PDF	http://arxiv.org/pdf/1708.00075v1.pdf
PWC	https://paperswithcode.com/paper/efficient-regret-minimization-in-non-convex
Repo
Framework

Listening to the World Improves Speech Command Recognition


Title	Listening to the World Improves Speech Command Recognition
Authors	Brian McMahan, Delip Rao
Abstract	We study transfer learning in convolutional network architectures applied to the task of recognizing audio, such as environmental sound events and speech commands. Our key finding is that not only is it possible to transfer representations from an unrelated task like environmental sound classification to a voice-focused task like speech command recognition, but also that doing so improves accuracies significantly. We also investigate the effect of increased model capacity for transfer learning audio, by first validating known results from the field of Computer Vision of achieving better accuracies with increasingly deeper networks on two audio datasets: UrbanSound8k and the newly released Google Speech Commands dataset. Then we propose a simple multiscale input representation using dilated convolutions and show that it is able to aggregate larger contexts and increase classification performance. Further, the models trained using a combination of transfer learning and multiscale input representations need only 40% of the training data to achieve similar accuracies as a freshly trained model with 100% of the training data. Finally, we demonstrate a positive interaction effect for the multiscale input and transfer learning, making a case for the joint application of the two techniques.
Tasks	Environmental Sound Classification, Transfer Learning
Published	2017-10-23
URL	http://arxiv.org/abs/1710.08377v1
PDF	http://arxiv.org/pdf/1710.08377v1.pdf
PWC	https://paperswithcode.com/paper/listening-to-the-world-improves-speech
Repo
Framework

BetaRun Soccer Simulation League Team: Variety, Complexity, and Learning


Title	BetaRun Soccer Simulation League Team: Variety, Complexity, and Learning
Authors	Olivia Michael, Oliver Obst
Abstract	RoboCup offers a set of benchmark problems for Artificial Intelligence in form of official world championships since 1997. The most tactical advanced and richest in terms of behavioural complexity of these is the 2D Soccer Simulation League, a simulated robotic soccer competition. BetaRun is a new attempt combining both machine learning and manual programming approaches, with the ultimate goal to arrive at a team that is trained entirely from observing and playing games, and a new development based on agent2D.
Tasks
Published	2017-03-12
URL	http://arxiv.org/abs/1703.04115v2
PDF	http://arxiv.org/pdf/1703.04115v2.pdf
PWC	https://paperswithcode.com/paper/betarun-soccer-simulation-league-team-variety
Repo
Framework

A semidiscrete version of the Citti-Petitot-Sarti model as a plausible model for anthropomorphic image reconstruction and pattern recognition


Title	A semidiscrete version of the Citti-Petitot-Sarti model as a plausible model for anthropomorphic image reconstruction and pattern recognition
Authors	Dario Prandi, Jean-Paul Gauthier
Abstract	In his beautiful book [66], Jean Petitot proposes a sub-Riemannian model for the primary visual cortex of mammals. This model is neurophysiologically justified. Further developments of this theory lead to efficient algorithms for image reconstruction, based upon the consideration of an associated hypoelliptic diffusion. The sub-Riemannian model of Petitot and Citti-Sarti (or certain of its improvements) is a left-invariant structure over the group $SE(2)$ of rototranslations of the plane. Here, we propose a semi-discrete version of this theory, leading to a left-invariant structure over the group $SE(2,N)$, restricting to a finite number of rotations. This apparently very simple group is in fact quite atypical: it is maximally almost periodic, which leads to much simpler harmonic analysis compared to $SE(2).$ Based upon this semi-discrete model, we improve on previous image-reconstruction algorithms and we develop a pattern-recognition theory that leads also to very efficient algorithms in practice.
Tasks	Image Reconstruction
Published	2017-04-10
URL	http://arxiv.org/abs/1704.03069v3
PDF	http://arxiv.org/pdf/1704.03069v3.pdf
PWC	https://paperswithcode.com/paper/a-semidiscrete-version-of-the-citti-petitot
Repo
Framework

Bayesian Semi-nonnegative Tri-matrix Factorization to Identify Pathways Associated with Cancer Types


Title	Bayesian Semi-nonnegative Tri-matrix Factorization to Identify Pathways Associated with Cancer Types
Authors	Sunho Park, Tae Hyun Hwang
Abstract	Identifying altered pathways that are associated with specific cancer types can potentially bring a significant impact on cancer patient treatment. Accurate identification of such key altered pathways information can be used to develop novel therapeutic agents as well as to understand the molecular mechanisms of various types of cancers better. Tri-matrix factorization is an efficient tool to learn associations between two different entities (e.g., cancer types and pathways in our case) from data. To successfully apply tri-matrix factorization methods to biomedical problems, biological prior knowledge such as pathway databases or protein-protein interaction (PPI) networks, should be taken into account in the factorization model. However, it is not straightforward in the Bayesian setting even though Bayesian methods are more appealing than point estimate methods, such as a maximum likelihood or a maximum posterior method, in the sense that they calculate distributions over variables and are robust against overfitting. We propose a Bayesian (semi-)nonnegative matrix factorization model for human cancer genomic data, where the biological prior knowledge represented by a pathway database and a PPI network is taken into account in the factorization model through a finite dependent Beta-Bernoulli prior. We tested our method on The Cancer Genome Atlas (TCGA) dataset and found that the pathways identified by our method can be used as a prognostic biomarkers for patient subgroup identification.
Tasks
Published	2017-12-01
URL	http://arxiv.org/abs/1712.00520v1
PDF	http://arxiv.org/pdf/1712.00520v1.pdf
PWC	https://paperswithcode.com/paper/bayesian-semi-nonnegative-tri-matrix
Repo
Framework

Approximation learning methods of Harmonic Mappings in relation to Hardy Spaces


Title	Approximation learning methods of Harmonic Mappings in relation to Hardy Spaces
Authors	Zhulin Liu, C. L. Philip Chen
Abstract	A new Hardy space Hardy space approach of Dirichlet type problem based on Tikhonov regularization and Reproducing Hilbert kernel space is discussed in this paper, which turns out to be a typical extremal problem located on the upper upper-high complex plane. If considering this in the Hardy space, the optimization operator of this problem will be highly simplified and an efficient algorithm is possible. This is mainly realized by the help of reproducing properties of the functions in the Hardy space of upper-high complex plane, and the detail algorithm is proposed. Moreover, harmonic mappings, which is a significant geometric transformation, are commonly used in many applications such as image processing, since it describes the energy minimization mappings between individual manifolds. Particularly, when we focus on the planer mappings between two Euclid planer regions, the harmonic mappings are exist and unique, which is guaranteed solidly by the existence of harmonic function. This property is attractive and simulation results are shown in this paper to ensure the capability of applications such as planer shape distortion and surface registration.
Tasks
Published	2017-05-24
URL	http://arxiv.org/abs/1705.10596v1
PDF	http://arxiv.org/pdf/1705.10596v1.pdf
PWC	https://paperswithcode.com/paper/approximation-learning-methods-of-harmonic
Repo
Framework

CNNComparator: Comparative Analytics of Convolutional Neural Networks


Title	CNNComparator: Comparative Analytics of Convolutional Neural Networks
Authors	Haipeng Zeng, Hammad Haleem, Xavier Plantaz, Nan Cao, Huamin Qu
Abstract	Convolutional neural networks (CNNs) are widely used in many image recognition tasks due to their extraordinary performance. However, training a good CNN model can still be a challenging task. In a training process, a CNN model typically learns a large number of parameters over time, which usually results in different performance. Often, it is difficult to explore the relationships between the learned parameters and the model performance due to a large number of parameters and different random initializations. In this paper, we present a visual analytics approach to compare two different snapshots of a trained CNN model taken after different numbers of epochs, so as to provide some insight into the design or the training of a better CNN model. Our system compares snapshots by exploring the differences in operation parameters and the corresponding blob data at different levels. A case study has been conducted to demonstrate the effectiveness of our system.
Tasks
Published	2017-10-15
URL	http://arxiv.org/abs/1710.05285v1
PDF	http://arxiv.org/pdf/1710.05285v1.pdf
PWC	https://paperswithcode.com/paper/cnncomparator-comparative-analytics-of
Repo
Framework

Polysemy Detection in Distributed Representation of Word Sense


Title	Polysemy Detection in Distributed Representation of Word Sense
Authors	Kana Oomoto, Haruka Oikawa, Eiko Yamamoto, Mitsuo Yoshida, Masayuki Okabe, Kyoji Umemura
Abstract	In this paper, we propose a statistical test to determine whether a given word is used as a polysemic word or not. The statistic of the word in this test roughly corresponds to the fluctuation in the senses of the neighboring words a nd the word itself. Even though the sense of a word corresponds to a single vector, we discuss how polysemy of the words affects the position of vectors. Finally, we also explain the method to detect this effect.
Tasks
Published	2017-09-26
URL	http://arxiv.org/abs/1709.08858v1
PDF	http://arxiv.org/pdf/1709.08858v1.pdf
PWC	https://paperswithcode.com/paper/polysemy-detection-in-distributed
Repo
Framework

Recovery Guarantees for One-hidden-layer Neural Networks


Title	Recovery Guarantees for One-hidden-layer Neural Networks
Authors	Kai Zhong, Zhao Song, Prateek Jain, Peter L. Bartlett, Inderjit S. Dhillon
Abstract	In this paper, we consider regression problems with one-hidden-layer neural networks (1NNs). We distill some properties of activation functions that lead to $\mathit{local~strong~convexity}$ in the neighborhood of the ground-truth parameters for the 1NN squared-loss objective. Most popular nonlinear activation functions satisfy the distilled properties, including rectified linear units (ReLUs), leaky ReLUs, squared ReLUs and sigmoids. For activation functions that are also smooth, we show $\mathit{local~linear~convergence}$ guarantees of gradient descent under a resampling rule. For homogeneous activations, we show tensor methods are able to initialize the parameters to fall into the local strong convexity region. As a result, tensor initialization followed by gradient descent is guaranteed to recover the ground truth with sample complexity $ d \cdot \log(1/\epsilon) \cdot \mathrm{poly}(k,\lambda )$ and computational complexity $n\cdot d \cdot \mathrm{poly}(k,\lambda) $ for smooth homogeneous activations with high probability, where $d$ is the dimension of the input, $k$ ($k\leq d$) is the number of hidden nodes, $\lambda$ is a conditioning property of the ground-truth parameter matrix between the input layer and the hidden layer, $\epsilon$ is the targeted precision and $n$ is the number of samples. To the best of our knowledge, this is the first work that provides recovery guarantees for 1NNs with both sample complexity and computational complexity $\mathit{linear}$ in the input dimension and $\mathit{logarithmic}$ in the precision.
Tasks
Published	2017-06-10
URL	http://arxiv.org/abs/1706.03175v1
PDF	http://arxiv.org/pdf/1706.03175v1.pdf
PWC	https://paperswithcode.com/paper/recovery-guarantees-for-one-hidden-layer
Repo
Framework

Modulating and attending the source image during encoding improves Multimodal Translation


Title	Modulating and attending the source image during encoding improves Multimodal Translation
Authors	Jean-Benoit Delbrouck, Stéphane Dupont
Abstract	We propose a new and fully end-to-end approach for multimodal translation where the source text encoder modulates the entire visual input processing using conditional batch normalization, in order to compute the most informative image features for our task. Additionally, we propose a new attention mechanism derived from this original idea, where the attention model for the visual input is conditioned on the source text encoder representations. In the paper, we detail our models as well as the image analysis pipeline. Finally, we report experimental results. They are, as far as we know, the new state of the art on three different test sets.
Tasks
Published	2017-12-09
URL	http://arxiv.org/abs/1712.03449v1
PDF	http://arxiv.org/pdf/1712.03449v1.pdf
PWC	https://paperswithcode.com/paper/modulating-and-attending-the-source-image
Repo
Framework

Blue Sky Ideas in Artificial Intelligence Education from the EAAI 2017 New and Future AI Educator Program


Title	Blue Sky Ideas in Artificial Intelligence Education from the EAAI 2017 New and Future AI Educator Program
Authors	Eric Eaton, Sven Koenig, Claudia Schulz, Francesco Maurelli, John Lee, Joshua Eckroth, Mark Crowley, Richard G. Freedman, Rogelio E. Cardona-Rivera, Tiago Machado, Tom Williams
Abstract	The 7th Symposium on Educational Advances in Artificial Intelligence (EAAI’17, co-chaired by Sven Koenig and Eric Eaton) launched the EAAI New and Future AI Educator Program to support the training of early-career university faculty, secondary school faculty, and future educators (PhD candidates or postdocs who intend a career in academia). As part of the program, awardees were asked to address one of the following “blue sky” questions: * How could/should Artificial Intelligence (AI) courses incorporate ethics into the curriculum? * How could we teach AI topics at an early undergraduate or a secondary school level? * AI has the potential for broad impact to numerous disciplines. How could we make AI education more interdisciplinary, specifically to benefit non-engineering fields? This paper is a collection of their responses, intended to help motivate discussion around these issues in AI education.
Tasks
Published	2017-02-01
URL	http://arxiv.org/abs/1702.00137v1
PDF	http://arxiv.org/pdf/1702.00137v1.pdf
PWC	https://paperswithcode.com/paper/blue-sky-ideas-in-artificial-intelligence
Repo
Framework

An Optimal Bayesian Network Based Solution Scheme for the Constrained Stochastic On-line Equi-Partitioning Problem


Title	An Optimal Bayesian Network Based Solution Scheme for the Constrained Stochastic On-line Equi-Partitioning Problem
Authors	Sondre Glimsdal, Ole-Christoffer Granmo
Abstract	A number of intriguing decision scenarios revolve around partitioning a collection of objects to optimize some application specific objective function. This problem is generally referred to as the Object Partitioning Problem (OPP) and is known to be NP-hard. We here consider a particularly challenging version of OPP, namely, the Stochastic On-line Equi-Partitioning Problem (SO-EPP). In SO-EPP, the target partitioning is unknown and has to be inferred purely from observing an on-line sequence of object pairs. The paired objects belong to the same partition with probability $p$ and to different partitions with probability $1-p$, with $p$ also being unknown. As an additional complication, the partitions are required to be of equal cardinality. Previously, only sub-optimal solution strategies have been proposed for SO- EPP. In this paper, we propose the first optimal solution strategy. In brief, the scheme that we propose, BN-EPP, is founded on a Bayesian network representation of SO-EPP problems. Based on probabilistic reasoning, we are not only able to infer the underlying object partitioning with optimal accuracy. We are also able to simultaneously infer $p$, allowing us to accelerate learning as object pairs arrive. Furthermore, our scheme is the first to support arbitrary constraints on the partitioning (Constrained SO-EPP). Being optimal, BN-EPP provides superior performance compared to existing solution schemes. We additionally introduce Walk-BN-EPP, a novel WalkSAT inspired algorithm for solving large scale BN-EPP problems. Finally, we provide a BN-EPP based solution to the problem of order picking, a representative real-life application of BN-EPP.
Tasks
Published	2017-07-11
URL	http://arxiv.org/abs/1707.03098v1
PDF	http://arxiv.org/pdf/1707.03098v1.pdf
PWC	https://paperswithcode.com/paper/an-optimal-bayesian-network-based-solution
Repo
Framework

RelNet: End-to-End Modeling of Entities & Relations


Title	RelNet: End-to-End Modeling of Entities & Relations
Authors	Trapit Bansal, Arvind Neelakantan, Andrew McCallum
Abstract	We introduce RelNet: a new model for relational reasoning. RelNet is a memory augmented neural network which models entities as abstract memory slots and is equipped with an additional relational memory which models relations between all memory pairs. The model thus builds an abstract knowledge graph on the entities and relations present in a document which can then be used to answer questions about the document. It is trained end-to-end: only supervision to the model is in the form of correct answers to the questions. We test the model on the 20 bAbI question-answering tasks with 10k examples per task and find that it solves all the tasks with a mean error of 0.3%, achieving 0% error on 11 of the 20 tasks.
Tasks	Question Answering, Relational Reasoning
Published	2017-06-22
URL	http://arxiv.org/abs/1706.07179v2
PDF	http://arxiv.org/pdf/1706.07179v2.pdf
PWC	https://paperswithcode.com/paper/relnet-end-to-end-modeling-of-entities
Repo
Framework

Realizing Half-Diminished Reality from Video Stream of Manipulating Objects


Title	Realizing Half-Diminished Reality from Video Stream of Manipulating Objects
Authors	Hayato Okumoto, Mitsuo Yoshida, Kyoji Umemura
Abstract	When we watch a video, in which human hands manipulate objects, these hands may obscure some parts of those objects. We are willing to make clear how the objects are manipulated by making the image of hands semi-transparent, and showing the complete images of the hands and the object. By carefully choosing a Half-Diminished Reality method, this paper proposes a method that can process the video in real time and verifies that the proposed method works well.
Tasks
Published	2017-09-25
URL	http://arxiv.org/abs/1709.08340v1
PDF	http://arxiv.org/pdf/1709.08340v1.pdf
PWC	https://paperswithcode.com/paper/realizing-half-diminished-reality-from-video
Repo
Framework