January 30, 2020

3198 words 16 mins read

Paper Group ANR 329

A geometric interpretation of stochastic gradient descent using diffusion metrics. Single Image Reflection Removal with Physically-based Rendering. From explanation to synthesis: Compositional program induction for learning from demonstration. Classification As Decoder: Trading Flexibility For Control In Neural Dialogue. Reliable training and estim …

A geometric interpretation of stochastic gradient descent using diffusion metrics


Title	A geometric interpretation of stochastic gradient descent using diffusion metrics
Authors	R. Fioresi, P. Chaudhari, S. Soatto
Abstract	Stochastic gradient descent (SGD) is a key ingredient in the training of deep neural networks and yet its geometrical significance appears elusive. We study a deterministic model in which the trajectories of our dynamical systems are described via geodesics of a family of metrics arising from the diffusion matrix. These metrics encode information about the highly non-isotropic gradient noise in SGD. We establish a parallel with General Relativity models, where the role of the electromagnetic field is played by the gradient of the loss function. We compute an example of a two layer network.
Tasks
Published	2019-10-27
URL	https://arxiv.org/abs/1910.12194v1
PDF	https://arxiv.org/pdf/1910.12194v1.pdf
PWC	https://paperswithcode.com/paper/a-geometric-interpretation-of-stochastic
Repo
Framework

Single Image Reflection Removal with Physically-based Rendering


Title	Single Image Reflection Removal with Physically-based Rendering
Authors	Soomin Kim, Yuchi Huo, Sung-Eui Yoon
Abstract	Recently, deep learning based single image reflection separation methods have been exploited widely. To benefit the learning approach, a large number of training image pairs (i.e., with and without reflections) were synthesized in various ways, yet they are away from a physically-based direction. In this paper, physically based rendering is used for faithfully synthesizing the required training images, and corresponding network structure is proposed. We utilize existing image data to estimate mesh, then physically simulate the depth-dependent light transportation between mesh, glass, and lens with path tracing. For guiding the separation better, we additionally consider a module of removing complicated ghosting and blurring glass-effects, which allows obtaining priori information before having the glass distortion. This module is easily accommodated within our approach, since that prior information can be physically generated by our rendering process. The proposed method considering the priori information as well as the existing posterior information is validated with various real reflection images, and is demonstrated to show visually pleasant and numerically better results compared to the state-of-theart techniques.
Tasks
Published	2019-04-26
URL	http://arxiv.org/abs/1904.11934v1
PDF	http://arxiv.org/pdf/1904.11934v1.pdf
PWC	https://paperswithcode.com/paper/single-image-reflection-removal-with
Repo
Framework

From explanation to synthesis: Compositional program induction for learning from demonstration


Title	From explanation to synthesis: Compositional program induction for learning from demonstration
Authors	Michael Burke, Svetlin Penkov, Subramanian Ramamoorthy
Abstract	Hybrid systems are a compact and natural mechanism with which to address problems in robotics. This work introduces an approach to learning hybrid systems from demonstrations, with an emphasis on extracting models that are explicitly verifiable and easily interpreted by robot operators. We fit a sequence of controllers using sequential importance sampling under a generative switching proportional controller task model. Here, we parameterise controllers using a proportional gain and a visually verifiable joint angle goal. Inference under this model is challenging, but we address this by introducing an attribution prior extracted from a neural end-to-end visuomotor control model. Given the sequence of controllers comprising a task, we simplify the trace using grammar parsing strategies, taking advantage of the sequence compositionality, before grounding the controllers by training perception networks to predict goals given images. Using this approach, we are successfully able to induce a program for a visuomotor reaching task involving loops and conditionals from a single demonstration and a neural end-to-end model. In addition, we are able to discover the program used for a tower building task. We argue that computer program-like control systems are more interpretable than alternative end-to-end learning approaches, and that hybrid systems inherently allow for better generalisation across task configurations.
Tasks
Published	2019-02-27
URL	https://arxiv.org/abs/1902.10657v2
PDF	https://arxiv.org/pdf/1902.10657v2.pdf
PWC	https://paperswithcode.com/paper/from-explanation-to-synthesis-compositional
Repo
Framework

Classification As Decoder: Trading Flexibility For Control In Neural Dialogue


Title	Classification As Decoder: Trading Flexibility For Control In Neural Dialogue
Authors	Sam Shleifer, Manish Chablani, Namit Katariya, Anitha Kannan, Xavier Amatriain
Abstract	Generative seq2seq dialogue systems are trained to predict the next word in dialogues that have already occurred. They can learn from large unlabeled conversation datasets, build a deep understanding of conversational context, and generate a wide variety of responses. This flexibility comes at the cost of control. Undesirable responses in the training data will be reproduced by the model at inference time, and longer generations often don’t make sense. Instead of generating responses one word at a time, we train a classifier to choose from a predefined list of full responses. The classifier is trained on (conversation context, response class) pairs, where each response class is a noisily labeled group of interchangeable responses. At inference, we generate the exemplar response associated with the predicted response class. Experts can edit and improve these exemplar responses over time without retraining the classifier or invalidating old training data. Human evaluation of 775 unseen doctor/patient conversations shows that this tradeoff improves responses. Only 12% of our discriminative approach’s responses are worse than the doctor’s response in the same conversational context, compared to 18% for the generative model. A discriminative model trained without any manual labeling of response classes achieves equal performance to the generative model.
Tasks
Published	2019-10-04
URL	https://arxiv.org/abs/1910.03476v3
PDF	https://arxiv.org/pdf/1910.03476v3.pdf
PWC	https://paperswithcode.com/paper/classification-as-decoder-trading-flexibility
Repo
Framework

Reliable training and estimation of variance networks


Title	Reliable training and estimation of variance networks
Authors	Nicki S. Detlefsen, Martin Jørgensen, Søren Hauberg
Abstract	We propose and investigate new complementary methodologies for estimating predictive variance networks in regression neural networks. We derive a locally aware mini-batching scheme that result in sparse robust gradients, and show how to make unbiased weight updates to a variance network. Further, we formulate a heuristic for robustly fitting both the mean and variance networks post hoc. Finally, we take inspiration from posterior Gaussian processes and propose a network architecture with similar extrapolation properties to Gaussian processes. The proposed methodologies are complementary, and improve upon baseline methods individually. Experimentally, we investigate the impact on predictive uncertainty on multiple datasets and tasks ranging from regression, active learning and generative modeling. Experiments consistently show significant improvements in predictive uncertainty estimation over state-of-the-art methods across tasks and datasets.
Tasks	Active Learning, Gaussian Processes
Published	2019-06-04
URL	https://arxiv.org/abs/1906.03260v2
PDF	https://arxiv.org/pdf/1906.03260v2.pdf
PWC	https://paperswithcode.com/paper/reliable-training-and-estimation-of-variance
Repo
Framework

Fair-by-design explainable models for prediction of recidivism


Title	Fair-by-design explainable models for prediction of recidivism
Authors	Eduardo Soares, Plamen Angelov
Abstract	Recidivism prediction provides decision makers with an assessment of the likelihood that a criminal defendant will reoffend that can be used in pre-trial decision-making. It can also be used for prediction of locations where crimes most occur, profiles that are more likely to commit violent crimes. While such instruments are gaining increasing popularity, their use is controversial as they may present potential discriminatory bias in the risk assessment. In this paper we propose a new fair-by-design approach to predict recidivism. It is prototype-based, learns locally and extracts empirically the data distribution. The results show that the proposed method is able to reduce the bias and provide human interpretable rules to assist specialists in the explanation of the given results.
Tasks	Decision Making
Published	2019-09-18
URL	https://arxiv.org/abs/1910.02043v1
PDF	https://arxiv.org/pdf/1910.02043v1.pdf
PWC	https://paperswithcode.com/paper/fair-by-design-explainable-models-for
Repo
Framework

Large-scale Gastric Cancer Screening and Localization Using Multi-task Deep Neural Network


Title	Large-scale Gastric Cancer Screening and Localization Using Multi-task Deep Neural Network
Authors	Hong Yu, Xiaofan Zhang, Lingjun Song, Liren Jiang, Xiaodi Huang, Wen Chen, Chenbin Zhang, Jiahui Li, Jiji Yang, Zhiqiang Hu, Qi Duan, Wanyuan Chen, Xianglei He, Jinshuang Fan, Weihai Jiang, Li Zhang, Chengmin Qiu, Minmin Gu, Weiwei Sun, Yangqiong Zhang, Guangyin Peng, Weiwei Shen, Guohui Fu
Abstract	Gastric cancer is one of the most common cancers, which ranks third among the leading causes of cancer death. Biopsy of gastric mucosal is a standard procedure in gastric cancer screening test. However, manual pathological inspection is labor-intensive and time-consuming. Besides, it is challenging for an automated algorithm to locate the small lesion regions in the gigapixel whole-slide image and make the decision correctly. To tackle these issues, we collected large-scale whole-slide image dataset with detailed lesion region annotation and designed a whole-slide image analyzing framework consisting of 3 networks which could not only determine the screen result but also present the suspicious areas to the pathologist for reference. Experiments demonstrated that our proposed framework achieves sensitivity of 97.05% and specificity of 92.72% in screening task and Dice coefficient of 0.8331 in segmentation task. Furthermore, we tested our best model in real-world scenario on 10, 316 whole-slide images collected from 4 medical centers.
Tasks
Published	2019-10-09
URL	https://arxiv.org/abs/1910.03729v2
PDF	https://arxiv.org/pdf/1910.03729v2.pdf
PWC	https://paperswithcode.com/paper/large-scale-gastric-cancer-screening-and
Repo
Framework

Pre-trained Contextual Embedding of Source Code


Title	Pre-trained Contextual Embedding of Source Code
Authors	Aditya Kanade, Petros Maniatis, Gogul Balakrishnan, Kensen Shi
Abstract	The source code of a program not only serves as a formal description of an executable task, but it also serves to communicate developer intent in a human-readable form. To facilitate this, developers use meaningful identifier names and natural-language documentation. This makes it possible to successfully apply sequence-modeling approaches, shown to be effective in natural-language processing, to source code. A major advancement in natural-language understanding has been the use of pre-trained token embeddings; BERT and other works have further shown that pre-trained contextual embeddings can be extremely powerful and can be fine-tuned effectively for a variety of downstream supervised tasks. Inspired by these developments, we present the first attempt to replicate this success on source code. We curate a massive corpus of Python programs from GitHub to pre-train a BERT model, which we call Code Understanding BERT (CuBERT). We also pre-train Word2Vec embeddings on the same dataset. We create a benchmark of five classification tasks and compare fine-tuned CuBERT against sequence models trained with and without the Word2Vec embeddings. Our results show that CuBERT outperforms the baseline methods by a margin of 2.9-22%. We also show its superiority when fine-tuned with smaller datasets, and over fewer epochs. We further evaluate CuBERT’s effectiveness on a joint classification, localization and repair task involving prediction of two pointers.
Tasks
Published	2019-12-21
URL	https://arxiv.org/abs/2001.00059v1
PDF	https://arxiv.org/pdf/2001.00059v1.pdf
PWC	https://paperswithcode.com/paper/pre-trained-contextual-embedding-of-source-1
Repo
Framework

Learning Latent Representations of Bank Customers With The Variational Autoencoder


Title	Learning Latent Representations of Bank Customers With The Variational Autoencoder
Authors	Rogelio A Mancisidor, Michael Kampffmeyer, Kjersti Aas, Robert Jenssen
Abstract	Learning data representations that reflect the customers’ creditworthiness can improve marketing campaigns, customer relationship management, data and process management or the credit risk assessment in retail banks. In this research, we adopt the Variational Autoencoder (VAE), which has the ability to learn latent representations that contain useful information. We show that it is possible to steer the latent representations in the latent space of the VAE using the Weight of Evidence and forming a specific grouping of the data that reflects the customers’ creditworthiness. Our proposed method learns a latent representation of the data, which shows a well-defied clustering structure capturing the customers’ creditworthiness. These clusters are well suited for the aforementioned banks’ activities. Further, our methodology generalizes to new customers, captures high-dimensional and complex financial data, and scales to large data sets.
Tasks
Published	2019-03-14
URL	http://arxiv.org/abs/1903.06580v1
PDF	http://arxiv.org/pdf/1903.06580v1.pdf
PWC	https://paperswithcode.com/paper/learning-latent-representations-of-bank
Repo
Framework

Accelerated learning from recommender systems using multi-armed bandit


Title	Accelerated learning from recommender systems using multi-armed bandit
Authors	Meisam Hejazinia, Kyler Eastman, Shuqin Ye, Abbas Amirabadi, Ravi Divvela
Abstract	Recommendation systems are a vital component of many online marketplaces, where there are often millions of items to potentially present to users who have a wide variety of wants or needs. Evaluating recommender system algorithms is a hard task, given all the inherent bias in the data, and successful companies must be able to rapidly iterate on their solution to maintain their competitive advantage. The gold standard for evaluating recommendation algorithms has been the A/B test since it is an unbiased way to estimate how well one or more algorithms compare in the real world. However, there are a number of issues with A/B testing that make it impractical to be the sole method of testing, including long lead time, and high cost of exploration. We argue that multi armed bandit (MAB) testing as a solution to these issues. We showcase how we implemented a MAB solution as an extra step between offline and online A/B testing in a production system. We present the result of our experiment and compare all the offline, MAB, and online A/B tests metrics for our use case.
Tasks	Recommendation Systems
Published	2019-08-16
URL	https://arxiv.org/abs/1908.06158v1
PDF	https://arxiv.org/pdf/1908.06158v1.pdf
PWC	https://paperswithcode.com/paper/accelerated-learning-from-recommender-systems
Repo
Framework

Data-Driven Crowd Simulation with Generative Adversarial Networks


Title	Data-Driven Crowd Simulation with Generative Adversarial Networks
Authors	Javad Amirian, Wouter van Toll, Jean-Bernard Hayet, Julien Pettré
Abstract	This paper presents a novel data-driven crowd simulation method that can mimic the observed traffic of pedestrians in a given environment. Given a set of observed trajectories, we use a recent form of neural networks, Generative Adversarial Networks (GANs), to learn the properties of this set and generate new trajectories with similar properties. We define a way for simulated pedestrians (agents) to follow such a trajectory while handling local collision avoidance. As such, the system can generate a crowd that behaves similarly to observations, while still enabling real-time interactions between agents. Via experiments with real-world data, we show that our simulated trajectories preserve the statistical properties of their input. Our method simulates crowds in real time that resemble existing crowds, while also allowing insertion of extra agents, combination with other simulation methods, and user interaction.
Tasks
Published	2019-05-23
URL	https://arxiv.org/abs/1905.09661v1
PDF	https://arxiv.org/pdf/1905.09661v1.pdf
PWC	https://paperswithcode.com/paper/data-driven-crowd-simulation-with-generative
Repo
Framework

AMS-SFE: Towards an Alignment of Manifold Structures via Semantic Feature Expansion for Zero-shot Learning


Title	AMS-SFE: Towards an Alignment of Manifold Structures via Semantic Feature Expansion for Zero-shot Learning
Authors	Jingcai Guo, Song Guo
Abstract	Zero-shot learning (ZSL) aims at recognizing unseen classes with knowledge transferred from seen classes. This is typically achieved by exploiting a semantic feature space (FS) shared by both seen and unseen classes, i.e., attributes or word vectors, as the bridge. However, due to the mutually disjoint of training (seen) and testing (unseen) data, existing ZSL methods easily and commonly suffer from the domain shift problem. To address this issue, we propose a novel model called AMS-SFE. It considers the Alignment of Manifold Structures by Semantic Feature Expansion. Specifically, we build up an autoencoder based model to expand the semantic features and joint with an alignment to an embedded manifold extracted from the visual FS of data. It is the first attempt to align these two FSs by way of expanding semantic features. Extensive experiments show the remarkable performance improvement of our model compared with other existing methods.
Tasks	Zero-Shot Learning
Published	2019-04-12
URL	http://arxiv.org/abs/1904.06254v1
PDF	http://arxiv.org/pdf/1904.06254v1.pdf
PWC	https://paperswithcode.com/paper/ams-sfe-towards-an-alignment-of-manifold
Repo
Framework


Title	A model of brain morphological changes related to aging and Alzheimer’s disease from cross-sectional assessments
Authors	Raphaël Sivera, Hervé Delingette, Marco Lorenzi, Xavier Pennec, Nicholas Ayache
Abstract	In this study we propose a deformation-based framework to jointly model the influence of aging and Alzheimer’s disease (AD) on the brain morphological evolution. Our approach combines a spatio-temporal description of both processes into a generative model. A reference morphology is deformed along specific trajectories to match subject specific morphologies. It is used to define two imaging progression markers: 1) a morphological age and 2) a disease score. These markers can be computed locally in any brain region. The approach is evaluated on brain structural magnetic resonance images (MRI) from the ADNI database. The generative model is first estimated on a control population, then, for each subject, the markers are computed for each acquisition. The longitudinal evolution of these markers is then studied in relation with the clinical diagnosis of the subjects and used to generate possible morphological evolution. In the model, the morphological changes associated with normal aging are mainly found around the ventricles, while the Alzheimer’s disease specific changes are more located in the temporal lobe and the hippocampal area. The statistical analysis of these markers highlights differences between clinical conditions even though the inter-subject variability is quiet high. In this context, the model can be used to generate plausible morphological trajectories associated with the disease. Our method gives two interpretable scalar imaging biomarkers assessing the effects of aging and disease on brain morphology at the individual and population level. These markers confirm an acceleration of apparent aging for Alzheimer’s subjects and can help discriminate clinical conditions even in prodromal stages. More generally, the joint modeling of normal and pathological evolutions shows promising results to describe age-related brain diseases over long time scales.
Tasks
Published	2019-05-23
URL	https://arxiv.org/abs/1905.09826v1
PDF	https://arxiv.org/pdf/1905.09826v1.pdf
PWC	https://paperswithcode.com/paper/a-model-of-brain-morphological-changes
Repo
Framework

Soft Marginal TransE for Scholarly Knowledge Graph Completion


Title	Soft Marginal TransE for Scholarly Knowledge Graph Completion
Authors	Mojtaba Nayyeri, Sahar Vahdati, Jens Lehmann, Hamed Shariat Yazdi
Abstract	Knowledge graphs (KGs), i.e. representation of information as a semantic graph, provide a significant test bed for many tasks including question answering, recommendation, and link prediction. Various amount of scholarly metadata have been made vailable as knowledge graphs from the diversity of data providers and agents. However, these high-quantities of data remain far from quality criteria in terms of completeness while growing at a rapid pace. Most of the attempts in completing such KGs are following traditional data digitization, harvesting and collaborative curation approaches. Whereas, advanced AI-related approaches such as embedding models - specifically designed for such tasks - are usually evaluated for standard benchmarks such as Freebase and Wordnet. The tailored nature of such datasets prevents those approaches to shed the lights on more accurate discoveries. Application of such models on domain-specific KGs takes advantage of enriched meta-data and provides accurate results where the underlying domain can enormously benefit. In this work, the TransE embedding model is reconciled for a specific link prediction task on scholarly metadata. The results show a significant shift in the accuracy and performance evaluation of the model on a dataset with scholarly metadata. The newly proposed version of TransE obtains 99.9% for link prediction task while original TransE gets 95%. In terms of accuracy and Hit@10, TransE outperforms other embedding models such as ComplEx, TransH and TransR experimented over scholarly knowledge graphs
Tasks	Knowledge Graph Completion, Knowledge Graphs, Link Prediction, Question Answering
Published	2019-04-27
URL	http://arxiv.org/abs/1904.12211v1
PDF	http://arxiv.org/pdf/1904.12211v1.pdf
PWC	https://paperswithcode.com/paper/soft-marginal-transe-for-scholarly-knowledge
Repo
Framework

Lifelong Learning with a Changing Action Set


Title	Lifelong Learning with a Changing Action Set
Authors	Yash Chandak, Georgios Theocharous, Chris Nota, Philip S. Thomas
Abstract	In many real-world sequential decision making problems, the number of available actions (decisions) can vary over time. While problems like catastrophic forgetting, changing transition dynamics, changing rewards functions, etc. have been well-studied in the lifelong learning literature, the setting where the action set changes remains unaddressed. In this paper, we present an algorithm that autonomously adapts to an action set whose size changes over time. To tackle this open problem, we break it into two problems that can be solved iteratively: inferring the underlying, unknown, structure in the space of actions and optimizing a policy that leverages this structure. We demonstrate the efficiency of this approach on large-scale real-world lifelong learning problems.
Tasks	Decision Making
Published	2019-06-05
URL	https://arxiv.org/abs/1906.01770v2
PDF	https://arxiv.org/pdf/1906.01770v2.pdf
PWC	https://paperswithcode.com/paper/lifelong-learning-with-a-changing-action-set
Repo
Framework