January 27, 2020

2828 words 14 mins read

Paper Group ANR 1267

Active Automata Learning with Adaptive Distinguishing Sequences. A Deep Reinforcement Learning Approach to Multi-component Job Scheduling in Edge Computing. Latent Space Reinforcement Learning for Steering Angle Prediction. On the geometry of Stein variational gradient descent. Forecasting Future Action Sequences with Neural Memory Networks. Cluste …

Active Automata Learning with Adaptive Distinguishing Sequences


Title	Active Automata Learning with Adaptive Distinguishing Sequences
Authors	Markus Theo Frohme
Abstract	This document investigates the integration of adaptive distinguishing sequences into the process of active automata learning (AAL). A novel AAL algorithm “ADT” (adaptive discrimination tree) is developed and presented. Since the submission of the original thesis, the presented algorithm has been integrated into LearnLib - an open-source library for active automata learning - and has been successfully used in related fields of research.
Tasks
Published	2019-02-04
URL	http://arxiv.org/abs/1902.01139v1
PDF	http://arxiv.org/pdf/1902.01139v1.pdf
PWC	https://paperswithcode.com/paper/active-automata-learning-with-adaptive
Repo
Framework

A Deep Reinforcement Learning Approach to Multi-component Job Scheduling in Edge Computing


Title	A Deep Reinforcement Learning Approach to Multi-component Job Scheduling in Edge Computing
Authors	Zhi Cao, Honggang Zhang, Yu Cao, Benyuan Liu
Abstract	We are interested in the optimal scheduling of a collection of multi-component application jobs in an edge computing system that consists of geo-distributed edge computing nodes connected through a wide area network. The scheduling and placement of application jobs in an edge system is challenging due to the interdependence of multiple components of each job, and the communication delays between the geographically distributed data sources and edge nodes and their dynamic availability. In this paper we explore the feasibility of applying Deep Reinforcement Learning (DRL) based design to address these challenges. We introduce a DRL actor-critic algorithm that aims to find an optimal scheduling policy to minimize average job slowdown in the edge system. We have demonstrated through simulations that our design outperforms a few existing algorithms, based on both synthetic data and a Google cloud data trace.
Tasks
Published	2019-08-26
URL	https://arxiv.org/abs/1908.10290v3
PDF	https://arxiv.org/pdf/1908.10290v3.pdf
PWC	https://paperswithcode.com/paper/a-deep-reinforcement-learning-approach-to
Repo
Framework

Latent Space Reinforcement Learning for Steering Angle Prediction


Title	Latent Space Reinforcement Learning for Steering Angle Prediction
Authors	Qadeer Khan, Torsten Schön, Patrick Wenzel
Abstract	Model-free reinforcement learning has recently been shown to successfully learn navigation policies from raw sensor data. In this work, we address the problem of learning driving policies for an autonomous agent in a high-fidelity simulator. Building upon recent research that applies deep reinforcement learning to navigation problems, we present a modular deep reinforcement learning approach to predict the steering angle of the car from raw images. The first module extracts a low-dimensional latent semantic representation of the image. The control module trained with reinforcement learning takes the latent vector as input to predict the correct steering angle. The experimental results have showed that our method is capable of learning to maneuver the car without any human control signals.
Tasks
Published	2019-02-11
URL	http://arxiv.org/abs/1902.03765v1
PDF	http://arxiv.org/pdf/1902.03765v1.pdf
PWC	https://paperswithcode.com/paper/latent-space-reinforcement-learning-for
Repo
Framework

On the geometry of Stein variational gradient descent


Title	On the geometry of Stein variational gradient descent
Authors	A. Duncan, N. Nuesken, L. Szpruch
Abstract	Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely on iterated steepest descent steps with respect to a reproducing kernel Hilbert space norm. This construction leads to interacting particle systems, the mean-field limit of which is a gradient flow on the space of probability distributions equipped with a certain geometrical structure. We leverage this viewpoint to shed some light on the convergence properties of the algorithm, in particular addressing the problem of choosing a suitable positive definite kernel function. Our analysis leads us to considering certain nondifferentiable kernels with adjusted tails. We demonstrate significant performs gains of these in various numerical experiments.
Tasks	Bayesian Inference
Published	2019-12-02
URL	https://arxiv.org/abs/1912.00894v1
PDF	https://arxiv.org/pdf/1912.00894v1.pdf
PWC	https://paperswithcode.com/paper/on-the-geometry-of-stein-variational-gradient
Repo
Framework

Forecasting Future Action Sequences with Neural Memory Networks


Title	Forecasting Future Action Sequences with Neural Memory Networks
Authors	Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes
Abstract	We propose a novel neural memory network based framework for future action sequence forecasting. This is a challenging task where we have to consider short-term, within sequence relationships as well as relationships in between sequences, to understand how sequences of actions evolve over time. To capture these relationships effectively, we introduce neural memory networks to our modelling scheme. We show the significance of using two input streams, the observed frames and the corresponding action labels, which provide different information cues for our prediction task. Furthermore, through the proposed method we effectively map the long-term relationships among individual input sequences through separate memory modules, which enables better fusion of the salient features. Our method outperforms the state-of-the-art approaches by a large margin on two publicly available datasets: Breakfast and 50 Salads.
Tasks
Published	2019-09-20
URL	https://arxiv.org/abs/1909.09278v1
PDF	https://arxiv.org/pdf/1909.09278v1.pdf
PWC	https://paperswithcode.com/paper/forecasting-future-action-sequences-with
Repo
Framework

Clustering via Ant Colonies: Parameter Analysis and Improvement of the Algorithm


Title	Clustering via Ant Colonies: Parameter Analysis and Improvement of the Algorithm
Authors	Jeffry Chavarria-Molina, Juan Jose Fallas-Monge, Javier Trejos-Zelaya
Abstract	An ant colony optimization approach for partitioning a set of objects is proposed. In order to minimize the intra-variance, or within sum-of-squares, of the partitioned classes, we construct ant-like solutions by a constructive approach that selects objects to be put in a class with a probability that depends on the distance between the object and the centroid of the class (visibility) and the pheromone trail; the latter depends on the class memberships that have been defined along the iterations. The procedure is improved with the application of K-means algorithm in some iterations of the ant colony method. We performed a simulation study in order to evaluate the method with a Monte Carlo experiment that controls some sensitive parameters of the clustering problem. After some tuning of the parameters, the method has also been applied to some benchmark real-data sets. Encouraging results were obtained in nearly all cases.
Tasks
Published	2019-12-02
URL	https://arxiv.org/abs/1912.01105v1
PDF	https://arxiv.org/pdf/1912.01105v1.pdf
PWC	https://paperswithcode.com/paper/clustering-via-ant-colonies-parameter
Repo
Framework

Universal Approximation of Input-Output Maps by Temporal Convolutional Nets


Title	Universal Approximation of Input-Output Maps by Temporal Convolutional Nets
Authors	Joshua Hanson, Maxim Raginsky
Abstract	There has been a recent shift in sequence-to-sequence modeling from recurrent network architectures to convolutional network architectures due to computational advantages in training and operation while still achieving competitive performance. For systems having limited long-term temporal dependencies, the approximation capability of recurrent networks is essentially equivalent to that of temporal convolutional nets (TCNs). We prove that TCNs can approximate a large class of input-output maps having approximately finite memory to arbitrary error tolerance. Furthermore, we derive quantitative approximation rates for deep ReLU TCNs in terms of the width and depth of the network and modulus of continuity of the original input-output map, and apply these results to input-output maps of systems that admit finite-dimensional state-space realizations (i.e., recurrent models).
Tasks
Published	2019-06-21
URL	https://arxiv.org/abs/1906.09211v2
PDF	https://arxiv.org/pdf/1906.09211v2.pdf
PWC	https://paperswithcode.com/paper/universal-approximation-of-input-output-maps
Repo
Framework

On Multi-Agent Learning in Team Sports Games


Title	On Multi-Agent Learning in Team Sports Games
Authors	Yunqi Zhao, Igor Borovikov, Jason Rupert, Caedmon Somers, Ahmad Beirami
Abstract	In recent years, reinforcement learning has been successful in solving video games from Atari to Star Craft II. However, the end-to-end model-free reinforcement learning (RL) is not sample efficient and requires a significant amount of computational resources to achieve superhuman level performance. Model-free RL is also unlikely to produce human-like agents for playtesting and gameplaying AI in the development cycle of complex video games. In this paper, we present a hierarchical approach to training agents with the goal of achieving human-like style and high skill level in team sports games. While this is still work in progress, our preliminary results show that the presented approach holds promise for solving the posed multi-agent learning problem.
Tasks
Published	2019-06-25
URL	https://arxiv.org/abs/1906.10124v1
PDF	https://arxiv.org/pdf/1906.10124v1.pdf
PWC	https://paperswithcode.com/paper/on-multi-agent-learning-in-team-sports-games
Repo
Framework

On the Effectiveness of Laser Speckle Contrast Imaging and Deep Neural Networks for Detecting Known and Unknown Fingerprint Presentation Attacks


Title	On the Effectiveness of Laser Speckle Contrast Imaging and Deep Neural Networks for Detecting Known and Unknown Fingerprint Presentation Attacks
Authors	Hengameh Mirzaalian, Mohamed Hussein, Wael Abd-Almageed
Abstract	Fingerprint presentation attack detection (FPAD) is becoming an increasingly challenging problem due to the continuous advancement of attack techniques, which generate `realistic-looking’ fake fingerprint presentations. Recently, laser speckle contrast imaging (LSCI) has been introduced as a new sensing modality for FPAD. LSCI has the interesting characteristic of capturing the blood flow under the skin surface. Toward studying the importance and effectiveness of LSCI for FPAD, we conduct a comprehensive study using different patch-based deep neural network architectures. Our studied architectures include 2D and 3D convolutional networks as well as a recurrent network using long short-term memory (LSTM) units. The study demonstrates that strong FPAD performance can be achieved using LSCI. We evaluate the different models over a new large dataset. The dataset consists of 3743 bona fide samples, collected from 335 unique subjects, and 218 presentation attack samples, including six different types of attacks. To examine the effect of changing the training and testing sets, we conduct a 3-fold cross validation evaluation. To examine the effect of the presence of an unseen attack, we apply a leave-one-attack out strategy. The FPAD classification results of the networks, which are separately optimized and tuned for the temporal and spatial patch-sizes, indicate that the best performance is achieved by LSTM. \|
Tasks
Published	2019-06-06
URL	https://arxiv.org/abs/1906.02595v1
PDF	https://arxiv.org/pdf/1906.02595v1.pdf
PWC	https://paperswithcode.com/paper/on-the-effectiveness-of-laser-speckle
Repo
Framework

Generating Counterfactual and Contrastive Explanations using SHAP


Title	Generating Counterfactual and Contrastive Explanations using SHAP
Authors	Shubham Rathi
Abstract	With the advent of GDPR, the domain of explainable AI and model interpretability has gained added impetus. Methods to extract and communicate visibility into decision-making models have become legal requirement. Two specific types of explanations, contrastive and counterfactual have been identified as suitable for human understanding. In this paper, we propose a model agnostic method and its systemic implementation to generate these explanations using shapely additive explanations (SHAP). We discuss a generative pipeline to create contrastive explanations and use it to further to generate counterfactual datapoints. This pipeline is tested and discussed on the IRIS, Wine Quality & Mobile Features dataset. Analysis of the results obtained follows.
Tasks	Decision Making
Published	2019-06-21
URL	https://arxiv.org/abs/1906.09293v1
PDF	https://arxiv.org/pdf/1906.09293v1.pdf
PWC	https://paperswithcode.com/paper/generating-counterfactual-and-contrastive
Repo
Framework

Interactive Concept Mining on Personal Data – Bootstrapping Semantic Services


Title	Interactive Concept Mining on Personal Data – Bootstrapping Semantic Services
Authors	Markus Schröder, Christian Jilek, Andreas Dengel
Abstract	Semantic services (e.g. Semantic Desktops) are still afflicted by a cold start problem: in the beginning, the user’s personal information sphere, i.e. files, mails, bookmarks, etc., is not represented by the system. Information extraction tools used to kick-start the system typically create 1:1 representations of the different information items. Higher level concepts, for example found in file names, mail subjects or in the content body of these items, are not extracted. Leaving these concepts out may lead to underperformance, having to many of them (e.g. by making every found term a concept) will clutter the arising knowledge graph with non-helpful relations. In this paper, we present an interactive concept mining approach proposing concept candidates gathered by exploiting given schemata of usual personal information management applications and analysing the personal information sphere using various metrics. To heed the subjective view of the user, a graphical user interface allows to easily rank and give feedback on proposed concept candidates, thus keeping only those actually considered relevant. A prototypical implementation demonstrates major steps of our approach.
Tasks
Published	2019-03-14
URL	http://arxiv.org/abs/1903.05872v1
PDF	http://arxiv.org/pdf/1903.05872v1.pdf
PWC	https://paperswithcode.com/paper/interactive-concept-mining-on-personal-data
Repo
Framework

Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor


Title	Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor
Authors	Malvina Nissim, Rik van Noord, Rob van der Goot
Abstract	Analogies such as “man is to king as woman is to X” are often used to illustrate the amazing power of word embeddings. Concurrently, they have also been used to expose how strongly human biases are encoded in vector spaces built on natural language, like “man is to computer programmer as woman is to homemaker”. Recent work has shown that analogies are in fact not such a diagnostic for bias, and other methods have been proven to be more apt to the task. However, beside the intrinsic problems with the analogy task as a bias detection tool, in this paper we show that a series of issues related to how analogies have been implemented and used might have yielded a distorted picture of bias in word embeddings. Human biases are present in word embeddings and need to be addressed. Analogies, though, are probably not the right tool to do so. Also, the way they have been most often used has exacerbated some possibly non-existing biases and perhaps hid others. Because they are still widely popular, and some of them have become classics within and outside the NLP community, we deem it important to provide a series of clarifications that should put well-known, and potentially new cases into the right perspective.
Tasks	Word Embeddings
Published	2019-05-23
URL	https://arxiv.org/abs/1905.09866v2
PDF	https://arxiv.org/pdf/1905.09866v2.pdf
PWC	https://paperswithcode.com/paper/fair-is-better-than-sensationalman-is-to
Repo
Framework

Understanding Player Engagement and In-Game Purchasing Behavior with Ensemble Learning


Title	Understanding Player Engagement and In-Game Purchasing Behavior with Ensemble Learning
Authors	Anna Guitart, Ana Fernández del Río, África Periáñez
Abstract	As video games attract more and more players, the major challenge for game studios is to retain them. We present a deep behavioral analysis of churn (game abandonment) and what we called “purchase churn” (the transition from paying to non-paying user). A series of churning behavior profiles are identified, which allows a classification of churners in terms of whether they eventually return to the game (false churners)–or start purchasing again (false purchase churners)–and their subsequent behavior. The impact of excluding some or all of these churners from the training sample is then explored in several churn and purchase churn prediction models. Our results suggest that discarding certain combinations of “zombies” (players whose activity is extremely sporadic) and false churners has a significant positive impact in all models considered.
Tasks
Published	2019-07-09
URL	https://arxiv.org/abs/1907.03947v1
PDF	https://arxiv.org/pdf/1907.03947v1.pdf
PWC	https://paperswithcode.com/paper/understanding-player-engagement-and-in-game
Repo
Framework

Exploiting Relevance for Online Decision-Making in High-Dimensions


Title	Exploiting Relevance for Online Decision-Making in High-Dimensions
Authors	Eralp Turgay, Cem Bulucu, Cem Tekin
Abstract	Many sequential decision-making tasks require choosing at each decision step the right action out of the vast set of possibilities by extracting actionable intelligence from high-dimensional data streams. Most of the times, the high-dimensionality of actions and data makes learning of the optimal actions by traditional learning methods impracticable. In this work, we investigate how to discover and leverage the low-dimensional structure in actions and data to enable fast learning. As our learning model, we consider a structured contextual multi-armed bandit (CMAB) with high-dimensional arm (action) and context (data) sets, where the rewards depend only on a few relevant dimensions of the joint context-arm set. We depart from the prior work by assuming a high-dimensional and uncountable arm set, and allow relevant context dimensions to vary for each arm. We propose a new online learning algorithm called CMAB with Relevance Learning (CMAB-RL) and prove that its time-averaged regret asymptotically goes to zero. CMAB-RL enjoys a substantially improved regret bound compared to classical CMAB algorithms whose regrets depend on the dimensions $d_x$ and $d_a$ of the context and arm sets. Importantly, we show that if the learner knows upper bounds $\overline{d}_x$ and $\overline{d}_a$ on the number of relevant context and arm dimensions, then CMAB-RL achieves $\tilde{O}(T^{1 - 1 /(2 + 2\overline{d}_x + \overline{d}_a)})$ regret. Finally, we illustrate how CMAB algorithms can be used for optimal personalized blood glucose control in type 1 diabetes mellitus patients, and show that CMAB-RL outperforms other contextual MAB algorithms in this task, where the contexts represent multimodal physiological data streams obtained from sensor readings and the arms represent bolus insulin doses that are appropriate for injection.
Tasks	Decision Making
Published	2019-07-01
URL	https://arxiv.org/abs/1907.00783v1
PDF	https://arxiv.org/pdf/1907.00783v1.pdf
PWC	https://paperswithcode.com/paper/exploiting-relevance-for-online-decision
Repo
Framework

ML-based Fault Injection for Autonomous Vehicles: A Case for Bayesian Fault Injection


Title	ML-based Fault Injection for Autonomous Vehicles: A Case for Bayesian Fault Injection
Authors	Saurabh Jha, Subho S. Banerjee, Timothy Tsai, Siva K. S. Hari, Michael B. Sullivan, Zbigniew T. Kalbarczyk, Stephen W. Keckler, Ravishankar K. Iyer
Abstract	The safety and resilience of fully autonomous vehicles (AVs) are of significant concern, as exemplified by several headline-making accidents. While AV development today involves verification, validation, and testing, end-to-end assessment of AV systems under accidental faults in realistic driving scenarios has been largely unexplored. This paper presents DriveFI, a machine learning-based fault injection engine, which can mine situations and faults that maximally impact AV safety, as demonstrated on two industry-grade AV technology stacks (from NVIDIA and Baidu). For example, DriveFI found 561 safety-critical faults in less than 4 hours. In comparison, random injection experiments executed over several weeks could not find any safety-critical faults
Tasks	Autonomous Vehicles
Published	2019-07-01
URL	https://arxiv.org/abs/1907.01051v1
PDF	https://arxiv.org/pdf/1907.01051v1.pdf
PWC	https://paperswithcode.com/paper/ml-based-fault-injection-for-autonomous
Repo
Framework