April 2, 2020

2903 words 14 mins read

Paper Group ANR 158

Optimistic bounds for multi-output prediction. Learning Object Placements For Relational Instructions by Hallucinating Scene Representations. Partial Gromov-Wasserstein with Applications on Positive-Unlabeled Learning. Deontological Ethics By Monotonicity Shape Constraints. Implementing Dynamic Answer Set Programming. A scheme for automatic differe …

Optimistic bounds for multi-output prediction


Title	Optimistic bounds for multi-output prediction
Authors	Henry WJ Reeve, Ata Kaban
Abstract	We investigate the challenge of multi-output learning, where the goal is to learn a vector-valued function based on a supervised data set. This includes a range of important problems in Machine Learning including multi-target regression, multi-class classification and multi-label classification. We begin our analysis by introducing the self-bounding Lipschitz condition for multi-output loss functions, which interpolates continuously between a classical Lipschitz condition and a multi-dimensional analogue of a smoothness condition. We then show that the self-bounding Lipschitz condition gives rise to optimistic bounds for multi-output learning, which are minimax optimal up to logarithmic factors. The proof exploits local Rademacher complexity combined with a powerful minoration inequality due to Srebro, Sridharan and Tewari. As an application we derive a state-of-the-art generalization bound for multi-class gradient boosting.
Tasks	Multi-Label Classification
Published	2020-02-22
URL	https://arxiv.org/abs/2002.09769v1
PDF	https://arxiv.org/pdf/2002.09769v1.pdf
PWC	https://paperswithcode.com/paper/optimistic-bounds-for-multi-output-prediction
Repo
Framework

Learning Object Placements For Relational Instructions by Hallucinating Scene Representations


Title	Learning Object Placements For Relational Instructions by Hallucinating Scene Representations
Authors	Oier Mees, Alp Emek, Johan Vertens, Wolfram Burgard
Abstract	Robots coexisting with humans in their environment and performing services for them need the ability to interact with them. One particular requirement for such robots is that they are able to understand spatial relations and can place objects in accordance with the spatial relations expressed by their user. In this work, we present a convolutional neural network for estimating pixelwise object placement probabilities for a set of spatial relations from a single input image. During training, our network receives the learning signal by classifying hallucinated high-level scene representations as an auxiliary task. Unlike previous approaches, our method does not require ground truth data for the pixelwise relational probabilities or 3D models of the objects, which significantly expands the applicability in practical applications. Our results obtained using real-world data and human-robot experiments demonstrate the effectiveness of our method in reasoning about the best way to place objects to reproduce a spatial relation. Videos of our experiments can be found at https://youtu.be/zaZkHTWFMKM
Tasks
Published	2020-01-23
URL	https://arxiv.org/abs/2001.08481v2
PDF	https://arxiv.org/pdf/2001.08481v2.pdf
PWC	https://paperswithcode.com/paper/learning-object-placements-for-relational
Repo
Framework

Partial Gromov-Wasserstein with Applications on Positive-Unlabeled Learning


Title	Partial Gromov-Wasserstein with Applications on Positive-Unlabeled Learning
Authors	Laetitia Chapel, Mokhtar Z. Alaya, Gilles Gasso
Abstract	Optimal Transport (OT) framework allows defining similarity between probability distributions and provides metrics such as the Wasserstein and Gromov-Wasserstein discrepancies. Classical OT problem seeks a transportation map that preserves the total mass, requiring the mass of the source and target distributions to be the same. This may be too restrictive in certain applications such as color or shape matching, since the distributions may have arbitrary masses or that only a fraction of the total mass has to be transported. Several algorithms have been devised for computing unbalanced Wasserstein metrics but when it comes with the Gromov-Wasserstein problem, no partial formulation is available yet. This precludes from working with distributions that do not lie in the same metric space or when invariance to rotation or translation is needed. In this paper, we address the partial Gromov-Wasserstein problem and propose an algorithm to solve it. We showcase the new formulation in a positive-unlabeled (PU) learning application. To the best of our knowledge, this is the first application of optimal transport in this context and we first highlight that partial Wasserstein-based metrics prove effective in usual PU learning settings. We then demonstrate that partial Gromov-Wasserstein metrics is efficient in scenario where point clouds come from different domains or have different features.
Tasks
Published	2020-02-19
URL	https://arxiv.org/abs/2002.08276v1
PDF	https://arxiv.org/pdf/2002.08276v1.pdf
PWC	https://paperswithcode.com/paper/partial-gromov-wasserstein-with-applications
Repo
Framework

Deontological Ethics By Monotonicity Shape Constraints


Title	Deontological Ethics By Monotonicity Shape Constraints
Authors	Serena Wang, Maya Gupta
Abstract	We demonstrate how easy it is for modern machine-learned systems to violate common deontological ethical principles and social norms such as “favor the less fortunate,” and “do not penalize good attributes.” We propose that in some cases such ethical principles can be incorporated into a machine-learned model by adding shape constraints that constrain the model to respond only positively to relevant inputs. We analyze the relationship between these deontological constraints that act on individuals and the consequentialist group-based fairness goals of one-sided statistical parity and equal opportunity. This strategy works with sensitive attributes that are Boolean or real-valued such as income and age, and can help produce more responsible and trustworthy AI.
Tasks
Published	2020-01-31
URL	https://arxiv.org/abs/2001.11990v2
PDF	https://arxiv.org/pdf/2001.11990v2.pdf
PWC	https://paperswithcode.com/paper/deontological-ethics-by-monotonicity-shape
Repo
Framework

Implementing Dynamic Answer Set Programming


Title	Implementing Dynamic Answer Set Programming
Authors	Pedro Cabalar, Martín Diéguez, Torsten Schaub, François Laferrière
Abstract	We introduce an implementation of an extension of Answer Set Programming (ASP) with language constructs from dynamic (and temporal) logic that provides an expressive computational framework for modeling dynamic applications. Starting from logical foundations, provided by dynamic and temporal equilibrium logics over finite linear traces, we develop a translation of dynamic formulas into temporal logic programs. This provides us with a normal form result establishing the strong equivalence of formulas in different logics. Our translation relies on the introduction of auxiliary atoms to guarantee polynomial space complexity and to provide an embedding that is doomed to be impossible over the same language. Finally, the reduction of dynamic formulas to temporal logic programs allows us to extend ASP with both approaches in a uniform way and to implement both extensions via temporal ASP solvers such as telingo
Tasks
Published	2020-02-17
URL	https://arxiv.org/abs/2002.06916v2
PDF	https://arxiv.org/pdf/2002.06916v2.pdf
PWC	https://paperswithcode.com/paper/implementing-dynamic-answer-set-programming
Repo
Framework

A scheme for automatic differentiation of complex loss functions


Title	A scheme for automatic differentiation of complex loss functions
Authors	Chu Guo, Dario Poletti
Abstract	For a real function, automatic differentiation is such a standard algorithm used to efficiently compute its gradient, that it is integrated in various neural network frameworks. However, despite the recent advances in using complex functions in machine learning and the well-established usefulness of automatic differentiation, the support of automatic differentiation for complex functions is not as well-established and widespread as for real functions. In this work we propose an efficient and seamless scheme to implement automatic differentiation for complex functions, which is a compatible generalization of the current scheme for real functions. This scheme can significantly simplify the implementation of neural networks which use complex numbers.
Tasks
Published	2020-03-02
URL	https://arxiv.org/abs/2003.04295v1
PDF	https://arxiv.org/pdf/2003.04295v1.pdf
PWC	https://paperswithcode.com/paper/a-scheme-for-automatic-differentiation-of
Repo
Framework

SurvLIME: A method for explaining machine learning survival models


Title	SurvLIME: A method for explaining machine learning survival models
Authors	Maxim S. Kovalev, Lev V. Utkin, Ernest M. Kasimov
Abstract	A new method called SurvLIME for explaining machine learning survival models is proposed. It can be viewed as an extension or modification of the well-known method LIME. The main idea behind the proposed method is to apply the Cox proportional hazards model to approximate the survival model at the local area around a test example. The Cox model is used because it considers a linear combination of the example covariates such that coefficients of the covariates can be regarded as quantitative impacts on the prediction. Another idea is to approximate cumulative hazard functions of the explained model and the Cox model by using a set of perturbed points in a local area around the point of interest. The method is reduced to solving an unconstrained convex optimization problem. A lot of numerical experiments demonstrate the SurvLIME efficiency.
Tasks
Published	2020-03-18
URL	https://arxiv.org/abs/2003.08371v1
PDF	https://arxiv.org/pdf/2003.08371v1.pdf
PWC	https://paperswithcode.com/paper/survlime-a-method-for-explaining-machine
Repo
Framework

Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning


Title	Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning
Authors	Yiyan Chen, Li Tao, Xueting Wang, Toshihiko Yamasaki
Abstract	Conventional video summarization approaches based on reinforcement learning have the problem that the reward can only be received after the whole summary is generated. Such kind of reward is sparse and it makes reinforcement learning hard to converge. Another problem is that labelling each frame is tedious and costly, which usually prohibits the construction of large-scale datasets. To solve these problems, we propose a weakly supervised hierarchical reinforcement learning framework, which decomposes the whole task into several subtasks to enhance the summarization quality. This framework consists of a manager network and a worker network. For each subtask, the manager is trained to set a subgoal only by a task-level binary label, which requires much fewer labels than conventional approaches. With the guide of the subgoal, the worker predicts the importance scores for video frames in the subtask by policy gradient according to both global reward and innovative defined sub-rewards to overcome the sparse problem. Experiments on two benchmark datasets show that our proposal has achieved the best performance, even better than supervised approaches.
Tasks	Hierarchical Reinforcement Learning, Supervised Video Summarization, Video Summarization
Published	2020-01-12
URL	https://arxiv.org/abs/2001.05864v2
PDF	https://arxiv.org/pdf/2001.05864v2.pdf
PWC	https://paperswithcode.com/paper/weakly-supervised-video-summarization-by
Repo
Framework

Communication Optimization Strategies for Distributed Deep Learning: A Survey


Title	Communication Optimization Strategies for Distributed Deep Learning: A Survey
Authors	Shuo Ouyang, Dezun Dong, Yemao Xu, Liquan Xiao
Abstract	Recent trends in high-performance computing and deep learning lead to a proliferation of studies on large-scale deep neural network (DNN) training. However, the frequent communication requirements among computation nodes drastically slow down the overall training speed, which makes the bottleneck in distributed training, particularly in clusters with limited network bandwidth. To mitigate the drawbacks of distributed communication, researchers have proposed various optimization strategies. In this paper, we give a comprehensive survey of communication strategies from both algorithm and computer network perspectives. Algorithm optimizations focus on reducing the amount of communication in distributed training, while network optimizations focus on speeding up the communication between distributed devices. At the algorithm level, we describe how to reduce the number of communication rounds and transmitted bits per round, besides we shed light on how to overlap computation and communication. At the network level, we discuss the effect caused by network infrastructures, including communication schemes, network protocols, and topology. Finally, we extrapolate potential challenges and research directions for communication acceleration in distributed DNN training.
Tasks
Published	2020-03-06
URL	https://arxiv.org/abs/2003.03009v1
PDF	https://arxiv.org/pdf/2003.03009v1.pdf
PWC	https://paperswithcode.com/paper/communication-optimization-strategies-for
Repo
Framework

Finding manoeuvre motifs in vehicle telematics


Title	Finding manoeuvre motifs in vehicle telematics
Authors	Maria Inês Silva, Roberto Henriques
Abstract	Driving behaviour has a great impact on road safety. A popular way of analysing driving behaviour is to move the focus to the manoeuvres as they give useful information about the driver who is performing them. In this paper, we investigate a new way of identifying manoeuvres from vehicle telematics data, through motif detection in time-series. We implement a modified version of the Extended Motif Discovery (EMD) algorithm, a classical variable-length motif detection algorithm for time-series and we applied it to the UAH-DriveSet, a publicly available naturalistic driving dataset. After a systematic exploration of the extracted motifs, we were able to conclude that the EMD algorithm was not only capable of extracting simple manoeuvres such as accelerations, brakes and curves, but also more complex manoeuvres, such as lane changes and overtaking manoeuvres, which validates motif discovery as a worthwhile line for future research.
Tasks	Time Series
Published	2020-02-10
URL	https://arxiv.org/abs/2002.04127v1
PDF	https://arxiv.org/pdf/2002.04127v1.pdf
PWC	https://paperswithcode.com/paper/finding-manoeuvre-motifs-in-vehicle
Repo
Framework

A Prototype of Serverless Lucene


Title	A Prototype of Serverless Lucene
Authors	Jimmy Lin
Abstract	This paper describes a working prototype that adapts Lucene, the world’s most popular and most widely deployed open-source search library, to operate within a serverless environment in the cloud. Although the serverless search concept is not new, this work represents a substantial improvement over a previous implementation in eliminating most custom code and in enabling interactive search. While there remain limitations to the design, it nevertheless challenges conventional thinking about search architectures for particular operating points.
Tasks
Published	2020-02-04
URL	https://arxiv.org/abs/2002.01447v1
PDF	https://arxiv.org/pdf/2002.01447v1.pdf
PWC	https://paperswithcode.com/paper/a-prototype-of-serverless-lucene
Repo
Framework

FMore: An Incentive Scheme of Multi-dimensional Auction for Federated Learning in MEC


Title	FMore: An Incentive Scheme of Multi-dimensional Auction for Federated Learning in MEC
Authors	Rongfei Zeng, Shixun Zhang, Jiaqi Wang, Xiaowen Chu
Abstract	Promising federated learning coupled with Mobile Edge Computing (MEC) is considered as one of the most promising solutions to the AI-driven service provision. Plenty of studies focus on federated learning from the performance and security aspects, but they neglect the incentive mechanism. In MEC, edge nodes would not like to voluntarily participate in learning, and they differ in the provision of multi-dimensional resources, both of which might deteriorate the performance of federated learning. Also, lightweight schemes appeal to edge nodes in MEC. These features require the incentive mechanism to be well designed for MEC. In this paper, we present an incentive mechanism FMore with multi-dimensional procurement auction of K winners. Our proposal FMore not only is lightweight and incentive compatible, but also encourages more high-quality edge nodes with low cost to participate in learning and eventually improve the performance of federated learning. We also present theoretical results of Nash equilibrium strategy to edge nodes and employ the expected utility theory to provide guidance to the aggregator. Both extensive simulations and real-world experiments demonstrate that the proposed scheme can effectively reduce the training rounds and drastically improve the model accuracy for challenging AI tasks.
Tasks
Published	2020-02-22
URL	https://arxiv.org/abs/2002.09699v1
PDF	https://arxiv.org/pdf/2002.09699v1.pdf
PWC	https://paperswithcode.com/paper/fmore-an-incentive-scheme-of-multi
Repo
Framework

DeFraudNet:End2End Fingerprint Spoof Detection using Patch Level Attention


Title	DeFraudNet:End2End Fingerprint Spoof Detection using Patch Level Attention
Authors	B. V. S Anusha, Sayan Banerjee, Subhasis Chaudhuri
Abstract	In recent years, fingerprint recognition systems have made remarkable advancements in the field of biometric security as it plays an important role in personal, national and global security. In spite of all these notable advancements, the fingerprint recognition technology is still susceptible to spoof attacks which can significantly jeopardize the user security. The cross sensor and cross material spoof detection still pose a challenge with a myriad of spoof materials emerging every day, compromising sensor interoperability and robustness. This paper proposes a novel method for fingerprint spoof detection using both global and local fingerprint feature descriptors. These descriptors are extracted using DenseNet which significantly improves cross-sensor, cross-material and cross-dataset performance. A novel patch attention network is used for finding the most discriminative patches and also for network fusion. We evaluate our method on four publicly available datasets:LivDet 2011, 2013, 2015 and 2017. A set of comprehensive experiments are carried out to evaluate cross-sensor, cross-material and cross-dataset performance over these datasets. The proposed approach achieves an average accuracy of 99.52%, 99.16% and 99.72% on LivDet 2017,2015 and 2011 respectively outperforming the current state-of-the-art results by 3% and 4% for LivDet 2015 and 2011 respectively.
Tasks
Published	2020-02-19
URL	https://arxiv.org/abs/2002.08214v1
PDF	https://arxiv.org/pdf/2002.08214v1.pdf
PWC	https://paperswithcode.com/paper/defraudnetend2end-fingerprint-spoof-detection
Repo
Framework

Learning Canonical Shape Space for Category-Level 6D Object Pose and Size Estimation


Title	Learning Canonical Shape Space for Category-Level 6D Object Pose and Size Estimation
Authors	Dengsheng Chen, Jun Li, Zheng Wang, Kai Xu
Abstract	We present a novel approach to category-level 6D object pose and size estimation. To tackle intra-class shape variations, we learn canonical shape space (CASS), a unified representation for a large variety of instances of a certain object category. In particular, CASS is modeled as the latent space of a deep generative model of canonical 3D shapes with normalized pose. We train a variational auto-encoder (VAE) for generating 3D point clouds in the canonical space from an RGBD image. The VAE is trained in a cross-category fashion, exploiting the publicly available large 3D shape repositories. Since the 3D point cloud is generated in normalized pose (with actual size), the encoder of the VAE learns view-factorized RGBD embedding. It maps an RGBD image in arbitrary view into a pose-independent 3D shape representation. Object pose is then estimated via contrasting it with a pose-dependent feature of the input RGBD extracted with a separate deep neural networks. We integrate the learning of CASS and pose and size estimation into an end-to-end trainable network, achieving the state-of-the-art performance.
Tasks	3D Shape Representation, Generating 3D Point Clouds
Published	2020-01-25
URL	https://arxiv.org/abs/2001.09322v2
PDF	https://arxiv.org/pdf/2001.09322v2.pdf
PWC	https://paperswithcode.com/paper/learning-canonical-shape-space-for-category
Repo
Framework

I-ViSE: Interactive Video Surveillance as an Edge Service using Unsupervised Feature Queries


Title	I-ViSE: Interactive Video Surveillance as an Edge Service using Unsupervised Feature Queries
Authors	Seyed Yahya Nikouei, Yu Chen, Alexander Aved, Erik Blasch
Abstract	Situation AWareness (SAW) is essential for many mission critical applications. However, SAW is very challenging when trying to immediately identify objects of interest or zoom in on suspicious activities from thousands of video frames. This work aims at developing a queryable system to instantly select interesting content. While face recognition technology is mature, in many scenarios like public safety monitoring, the features of objects of interest may be much more complicated than face features. In addition, human operators may not be always able to provide a descriptive, simple, and accurate query. Actually, it is more often that there are only rough, general descriptions of certain suspicious objects or accidents. This paper proposes an Interactive Video Surveillance as an Edge service (I-ViSE) based on unsupervised feature queries. Adopting unsupervised methods that do not reveal any private information, the I-ViSE scheme utilizes general features of a human body and color of clothes. An I-ViSE prototype is built following the edge-fog computing paradigm and the experimental results verified the I-ViSE scheme meets the design goal of scene recognition in less than two seconds.
Tasks	Face Recognition, Scene Recognition
Published	2020-03-09
URL	https://arxiv.org/abs/2003.04169v1
PDF	https://arxiv.org/pdf/2003.04169v1.pdf
PWC	https://paperswithcode.com/paper/i-vise-interactive-video-surveillance-as-an
Repo
Framework