January 29, 2020

2869 words 14 mins read

Paper Group ANR 510

Paper Group ANR 510

Deep learning for molecular design - a review of the state of the art. Procedural Generation of Initial States of Sokoban. Scalable Hyperbolic Recommender Systems. How far should self-driving cars see? Effect of observation range on vehicle self-localization. Degeneracy in Self-Calibration Revisited and a Deep Learning Solution for Uncalibrated SLA …

Deep learning for molecular design - a review of the state of the art

Title Deep learning for molecular design - a review of the state of the art
Authors Daniel C. Elton, Zois Boukouvalas, Mark D. Fuge, Peter W. Chung
Abstract In the space of only a few years, deep generative modeling has revolutionized how we think of artificial creativity, yielding autonomous systems which produce original images, music, and text. Inspired by these successes, researchers are now applying deep generative modeling techniques to the generation and optimization of molecules - in our review we found 45 papers on the subject published in the past two years. These works point to a future where such systems will be used to generate lead molecules, greatly reducing resources spent downstream synthesizing and characterizing bad leads in the lab. In this review we survey the increasingly complex landscape of models and representation schemes that have been proposed. The four classes of techniques we describe are recursive neural networks, autoencoders, generative adversarial networks, and reinforcement learning. After first discussing some of the mathematical fundamentals of each technique, we draw high level connections and comparisons with other techniques and expose the pros and cons of each. Several important high level themes emerge as a result of this work, including the shift away from the SMILES string representation of molecules towards more sophisticated representations such as graph grammars and 3D representations, the importance of reward function design, the need for better standards for benchmarking and testing, and the benefits of adversarial training and reinforcement learning over maximum likelihood based training.
Tasks
Published 2019-03-11
URL https://arxiv.org/abs/1903.04388v3
PDF https://arxiv.org/pdf/1903.04388v3.pdf
PWC https://paperswithcode.com/paper/deep-learning-for-molecular-generation-and
Repo
Framework

Procedural Generation of Initial States of Sokoban

Title Procedural Generation of Initial States of Sokoban
Authors Dâmaris S. Bento, André G. Pereira, Levi H. S. Lelis
Abstract Procedural generation of initial states of state-space search problems have applications in human and machine learning as well as in the evaluation of planning systems. In this paper we deal with the task of generating hard and solvable initial states of Sokoban puzzles. We propose hardness metrics based on pattern database heuristics and the use of novelty to improve the exploration of search methods in the task of generating initial states. We then present a system called Beta that uses our hardness metrics and novelty to generate initial states. Experiments show that Beta is able to generate initial states that are harder to solve by a specialized solver than those designed by human experts.
Tasks
Published 2019-07-04
URL https://arxiv.org/abs/1907.02548v1
PDF https://arxiv.org/pdf/1907.02548v1.pdf
PWC https://paperswithcode.com/paper/procedural-generation-of-initial-states-of
Repo
Framework

Scalable Hyperbolic Recommender Systems

Title Scalable Hyperbolic Recommender Systems
Authors Benjamin Paul Chamberlain, Stephen R. Hardwick, David R. Wardrope, Fabon Dzogang, Fabio Daolio, Saúl Vargas
Abstract We present a large scale hyperbolic recommender system. We discuss why hyperbolic geometry is a more suitable underlying geometry for many recommendation systems and cover the fundamental milestones and insights that we have gained from its development. In doing so, we demonstrate the viability of hyperbolic geometry for recommender systems, showing that they significantly outperform Euclidean models on datasets with the properties of complex networks. Key to the success of our approach are the novel choice of underlying hyperbolic model and the use of the Einstein midpoint to define an asymmetric recommender system in hyperbolic space. These choices allow us to scale to millions of users and hundreds of thousands of items.
Tasks Recommendation Systems
Published 2019-02-22
URL http://arxiv.org/abs/1902.08648v1
PDF http://arxiv.org/pdf/1902.08648v1.pdf
PWC https://paperswithcode.com/paper/scalable-hyperbolic-recommender-systems
Repo
Framework

How far should self-driving cars see? Effect of observation range on vehicle self-localization

Title How far should self-driving cars see? Effect of observation range on vehicle self-localization
Authors Mahdi Javanmardi, Ehsan Javanmardi, Shunsuke Kamijo
Abstract Accuracy and time efficiency are two essential requirements for the self-localization of autonomous vehicles. While the observation range considered for simultaneous localization and mapping (SLAM) has a significant effect on both accuracy and computation time, its effect is not well investigated in the literature. In this paper, we will answer the question: How far should a driverless car observe during self-localization? We introduce a framework to dynamically define the observation range for localization to meet the accuracy requirement for autonomous driving, while keeping the computation time low. To model the effect of scanning range on the localization accuracy for every point on the map, several map factors were employed. The capability of the proposed framework was verified using field data, demonstrating that it is able to improve the average matching time from 142.2 ms to 39.3 ms while keeping the localization accuracy around 8.1 cm.
Tasks Autonomous Driving, Autonomous Vehicles, Self-Driving Cars, Simultaneous Localization and Mapping
Published 2019-08-19
URL https://arxiv.org/abs/1908.06588v1
PDF https://arxiv.org/pdf/1908.06588v1.pdf
PWC https://paperswithcode.com/paper/how-far-should-self-driving-cars-see-effect
Repo
Framework

Degeneracy in Self-Calibration Revisited and a Deep Learning Solution for Uncalibrated SLAM

Title Degeneracy in Self-Calibration Revisited and a Deep Learning Solution for Uncalibrated SLAM
Authors Bingbing Zhuang, Quoc-Huy Tran, Pan Ji, Gim Hee Lee, Loong Fah Cheong, Manmohan Chandraker
Abstract Self-calibration of camera intrinsics and radial distortion has a long history of research in the computer vision community. However, it remains rare to see real applications of such techniques to modern Simultaneous Localization And Mapping (SLAM) systems, especially in driving scenarios. In this paper, we revisit the geometric approach to this problem, and provide a theoretical proof that explicitly shows the ambiguity between radial distortion and scene depth when two-view geometry is used to self-calibrate the radial distortion. In view of such geometric degeneracy, we propose a learning approach that trains a convolutional neural network (CNN) on a large amount of synthetic data. We demonstrate the utility of our proposed method by applying it as a checkerboard-free calibration tool for SLAM, achieving comparable or superior performance to previous learning and hand-crafted methods.
Tasks Calibration, Simultaneous Localization and Mapping
Published 2019-07-30
URL https://arxiv.org/abs/1907.13185v1
PDF https://arxiv.org/pdf/1907.13185v1.pdf
PWC https://paperswithcode.com/paper/degeneracy-in-self-calibration-revisited-and
Repo
Framework

Katyusha Acceleration for Convex Finite-Sum Compositional Optimization

Title Katyusha Acceleration for Convex Finite-Sum Compositional Optimization
Authors Yibo Xu, Yangyang Xu
Abstract Structured problems arise in many applications. To solve these problems, it is important to leverage the structure information. This paper focuses on convex problems with a finite-sum compositional structure. Finite-sum problems appear as the sample average approximation of a stochastic optimization problem and also arise in machine learning with a huge amount of training data. One popularly used numerical approach for finite-sum problems is the stochastic gradient method (SGM). However, the additional compositional structure prohibits easy access to unbiased stochastic approximation of the gradient, so directly applying the SGM to a finite-sum compositional optimization problem (COP) is often inefficient. We design new algorithms for solving strongly-convex and also convex two-level finite-sum COPs. Our design incorporates the Katyusha acceleration technique and adopts the mini-batch sampling from both outer-level and inner-level finite-sum. We first analyze the algorithm for strongly-convex finite-sum COPs. Similar to a few existing works, we obtain linear convergence rate in terms of the expected objective error, and from the convergence rate result, we then establish complexity results of the algorithm to produce an $\varepsilon$-solution. Our complexity results have the same dependence on the number of component functions as existing works. However, due to the use of Katyusha acceleration, our results have better dependence on the condition number $\kappa$ and improve to $\kappa^{2.5}$ from the best-known $\kappa^3$. Finally, we analyze the algorithm for convex finite-sum COPs, which uses as a subroutine the algorithm for strongly-convex finite-sum COPs. Again, we obtain better complexity results than existing works in terms of the dependence on $\varepsilon$, improving to $\varepsilon^{-2.5}$ from the best-known $\varepsilon^{-3}$.
Tasks Stochastic Optimization
Published 2019-10-24
URL https://arxiv.org/abs/1910.11217v1
PDF https://arxiv.org/pdf/1910.11217v1.pdf
PWC https://paperswithcode.com/paper/katyusha-acceleration-for-convex-finite-sum
Repo
Framework

Drug Repurposing for Cancer: An NLP Approach to Identify Low-Cost Therapies

Title Drug Repurposing for Cancer: An NLP Approach to Identify Low-Cost Therapies
Authors Shivashankar Subramanian, Ioana Baldini, Sushma Ravichandran, Dmitriy A. Katz-Rogozhnikov, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Kush R. Varshney, Annmarie Wang, Pradeep Mangalath, Laura B. Kleiman
Abstract More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of generic drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs worldwide. Evidence on the efficacy of non-cancer generic drugs being tested for cancer exists in scientific publications, but trying to manually identify and extract such evidence is intractable. In this paper, we introduce a system to automate this evidence extraction from PubMed abstracts. Our primary contribution is to define the natural language processing pipeline required to obtain such evidence, comprising the following modules: querying, filtering, cancer type entity extraction, therapeutic association classification, and study type classification. Using the subject matter expertise on our team, we create our own datasets for these specialized domain-specific tasks. We obtain promising performance in each of the modules by utilizing modern language modeling techniques and plan to treat them as baseline approaches for future improvement of individual components.
Tasks Entity Extraction, Language Modelling
Published 2019-11-18
URL https://arxiv.org/abs/1911.07819v2
PDF https://arxiv.org/pdf/1911.07819v2.pdf
PWC https://paperswithcode.com/paper/drug-repurposing-for-cancer-an-nlp-approach
Repo
Framework

Conductor Galloping Prediction on Imbalanced Datasets: SVM with Smart Sampling

Title Conductor Galloping Prediction on Imbalanced Datasets: SVM with Smart Sampling
Authors Kui Wang, Jian Sun, Chenye Wu, Yang Yu
Abstract Conductor galloping is the high-amplitude, low-frequency oscillation of overhead power lines due to wind. Such movements may lead to severe damages to transmission lines, and hence pose significant risks to the power system operation. In this paper, we target to design a prediction framework for conductor galloping. The difficulty comes from imbalanced dataset as galloping happens rarely. By examining the impacts of data balance and data volume on the prediction performance, we propose to employ proper sample adjustment methods to achieve better performance. Numerical study suggests that using only three features, together with over sampling, the SVM based prediction framework achieves an F_1-score of 98.9%.
Tasks
Published 2019-11-09
URL https://arxiv.org/abs/1911.04467v1
PDF https://arxiv.org/pdf/1911.04467v1.pdf
PWC https://paperswithcode.com/paper/conductor-galloping-prediction-on-imbalanced
Repo
Framework

Transfer Learning in 4D for Breast Cancer Diagnosis using Dynamic Contrast-Enhanced Magnetic Resonance Imaging

Title Transfer Learning in 4D for Breast Cancer Diagnosis using Dynamic Contrast-Enhanced Magnetic Resonance Imaging
Authors Qiyuan Hu, Heather M. Whitney, Maryellen L. Giger
Abstract Deep transfer learning using dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) has shown strong predictive power in characterization of breast lesions. However, pretrained convolutional neural networks (CNNs) require 2D inputs, limiting the ability to exploit the rich 4D (volumetric and temporal) image information inherent in DCE-MRI that is clinically valuable for lesion assessment. Training 3D CNNs from scratch, a common method to utilize high-dimensional information in medical images, is computationally expensive and is not best suited for moderately sized healthcare datasets. Therefore, we propose a novel approach using transfer learning that incorporates the 4D information from DCE-MRI, where volumetric information is collapsed at feature level by max pooling along the projection perpendicular to the transverse slices and the temporal information is contained either in second-post contrast subtraction images. Our methodology yielded an area under the receiver operating characteristic curve of 0.89+/-0.01 on a dataset of 1161 breast lesions, significantly outperforming a previous approach that incorporates the 4D information in DCE-MRI by the use of maximum intensity projection (MIP) images.
Tasks Transfer Learning
Published 2019-11-08
URL https://arxiv.org/abs/1911.03022v1
PDF https://arxiv.org/pdf/1911.03022v1.pdf
PWC https://paperswithcode.com/paper/transfer-learning-in-4d-for-breast-cancer
Repo
Framework

Template co-updating in multi-modal human activity recognition systems

Title Template co-updating in multi-modal human activity recognition systems
Authors Annalisa Franco, Antonio Magnani, Dario Maio
Abstract Multi-modal systems are quite common in the context of human activity recognition; widely used RGB-D sensors (Kinect is the most prominent example) give access to parallel data streams, typically RGB images, depth data, skeleton information. The richness of multimodal information has been largely exploited in many works in the literature, while an analysis of their effectiveness for incremental template updating has not been investigated so far. This paper is aimed at defining a general framework for unsupervised template updating in multi-modal systems, where the different data sources can provide complementary information, increasing the effectiveness of the updating procedure and reducing at the same time the probability of incorrect template modifications.
Tasks Activity Recognition, Human Activity Recognition
Published 2019-12-04
URL https://arxiv.org/abs/1912.02024v1
PDF https://arxiv.org/pdf/1912.02024v1.pdf
PWC https://paperswithcode.com/paper/template-co-updating-in-multi-modal-human
Repo
Framework

Playing Flappy Bird via Asynchronous Advantage Actor Critic Algorithm

Title Playing Flappy Bird via Asynchronous Advantage Actor Critic Algorithm
Authors Elit Cenk Alp, Mehmet Serdar Guzel
Abstract Flappy Bird, which has a very high popularity, has been trained in many algorithms. Some of these studies were trained from raw pixel values of game and some from specific attributes. In this study, the model was trained with raw game images, which had not been seen before. The trained model has learned as reinforcement when to make which decision. As an input to the model, the reward or penalty at the end of each step was returned and the training was completed. Flappy Bird game was trained with the Reinforcement Learning algorithm Deep Q-Network and Asynchronous Advantage Actor Critic (A3C) algorithms.
Tasks
Published 2019-07-06
URL https://arxiv.org/abs/1907.03098v1
PDF https://arxiv.org/pdf/1907.03098v1.pdf
PWC https://paperswithcode.com/paper/playing-flappy-bird-via-asynchronous
Repo
Framework

BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation

Title BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation
Authors Wanhua Li, Jiwen Lu, Jianjiang Feng, Chunjing Xu, Jie Zhou, Qi Tian
Abstract Age estimation is an important yet very challenging problem in computer vision. Existing methods for age estimation usually apply a divide-and-conquer strategy to deal with heterogeneous data caused by the non-stationary aging process. However, the facial aging process is also a continuous process, and the continuity relationship between different components has not been effectively exploited. In this paper, we propose BridgeNet for age estimation, which aims to mine the continuous relation between age labels effectively. The proposed BridgeNet consists of local regressors and gating networks. Local regressors partition the data space into multiple overlapping subspaces to tackle heterogeneous data and gating networks learn continuity aware weights for the results of local regressors by employing the proposed bridge-tree structure, which introduces bridge connections into tree models to enforce the similarity between neighbor nodes. Moreover, these two components of BridgeNet can be jointly learned in an end-to-end way. We show experimental results on the MORPH II, FG-NET and Chalearn LAP 2015 datasets and find that BridgeNet outperforms the state-of-the-art methods.
Tasks Age Estimation
Published 2019-04-06
URL http://arxiv.org/abs/1904.03358v1
PDF http://arxiv.org/pdf/1904.03358v1.pdf
PWC https://paperswithcode.com/paper/bridgenet-a-continuity-aware-probabilistic
Repo
Framework

Automated Surgical Activity Recognition with One Labeled Sequence

Title Automated Surgical Activity Recognition with One Labeled Sequence
Authors Robert DiPietro, Gregory D. Hager
Abstract Prior work has demonstrated the feasibility of automated activity recognition in robot-assisted surgery from motion data. However, these efforts have assumed the availability of a large number of densely-annotated sequences, which must be provided manually by experts. This process is tedious, expensive, and error-prone. In this paper, we present the first analysis under the assumption of scarce annotations, where as little as one annotated sequence is available for training. We demonstrate feasibility of automated recognition in this challenging setting, and we show that learning representations in an unsupervised fashion, before the recognition phase, leads to significant gains in performance. In addition, our paper poses a new challenge to the community: how much further can we push performance in this important yet relatively unexplored regime?
Tasks Activity Recognition
Published 2019-07-20
URL https://arxiv.org/abs/1907.08825v1
PDF https://arxiv.org/pdf/1907.08825v1.pdf
PWC https://paperswithcode.com/paper/automated-surgical-activity-recognition-with
Repo
Framework

Classical and Quantum Algorithms for Tensor Principal Component Analysis

Title Classical and Quantum Algorithms for Tensor Principal Component Analysis
Authors M. B. Hastings
Abstract We present classical and quantum algorithms based on spectral methods for a problem in tensor principal component analysis. The quantum algorithm achieves a quartic speedup while using exponentially smaller space than the fastest classical spectral algorithm, and a super-polynomial speedup over classical algorithms that use only polynomial space. The classical algorithms that we present are related to, but slightly different from those presented recently in Ref. 1. In particular, we have an improved threshold for recovery and the algorithms we present work for both even and odd order tensors. These results suggest that large-scale inference problems are a promising future application for quantum computers.
Tasks
Published 2019-07-30
URL https://arxiv.org/abs/1907.12724v2
PDF https://arxiv.org/pdf/1907.12724v2.pdf
PWC https://paperswithcode.com/paper/classical-and-quantum-algorithms-for-tensor
Repo
Framework

A Recurrent Variational Autoencoder for Speech Enhancement

Title A Recurrent Variational Autoencoder for Speech Enhancement
Authors Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud
Abstract This paper presents a generative approach to speech enhancement based on a recurrent variational autoencoder (RVAE). The deep generative speech model is trained using clean speech signals only, and it is combined with a nonnegative matrix factorization noise model for speech enhancement. We propose a variational expectation-maximization algorithm where the encoder of the RVAE is fine-tuned at test time, to approximate the distribution of the latent variables given the noisy speech observations. Compared with previous approaches based on feed-forward fully-connected architectures, the proposed recurrent deep generative speech model induces a posterior temporal dynamic over the latent variables, which is shown to improve the speech enhancement results.
Tasks Speech Enhancement
Published 2019-10-24
URL https://arxiv.org/abs/1910.10942v2
PDF https://arxiv.org/pdf/1910.10942v2.pdf
PWC https://paperswithcode.com/paper/a-recurrent-variational-autoencoder-for
Repo
Framework
comments powered by Disqus