January 26, 2020

3083 words 15 mins read

Paper Group ANR 1467

Paper Group ANR 1467

Meta-GNN: On Few-shot Node Classification in Graph Meta-learning. Beyond the EM Algorithm: Constrained Optimization Methods for Latent Class Model. Optimal Nonparametric Inference under Quantization. Improved Safe Real-time Heuristic Search. Depthwise Convolution is All You Need for Learning Multiple Visual Domains. Monitoring Achilles tendon heali …

Meta-GNN: On Few-shot Node Classification in Graph Meta-learning

Title Meta-GNN: On Few-shot Node Classification in Graph Meta-learning
Authors Fan Zhou, Chengtai Cao, Kunpeng Zhang, Goce Trajcevski, Ting Zhong, Ji Geng
Abstract Meta-learning has received a tremendous recent attention as a possible approach for mimicking human intelligence, i.e., acquiring new knowledge and skills with little or even no demonstration. Most of the existing meta-learning methods are proposed to tackle few-shot learning problems such as image and text, in rather Euclidean domain. However, there are very few works applying meta-learning to non-Euclidean domains, and the recently proposed graph neural networks (GNNs) models do not perform effectively on graph few-shot learning problems. Towards this, we propose a novel graph meta-learning framework – Meta-GNN – to tackle the few-shot node classification problem in graph meta-learning settings. It obtains the prior knowledge of classifiers by training on many similar few-shot learning tasks and then classifies the nodes from new classes with only few labeled samples. Additionally, Meta-GNN is a general model that can be straightforwardly incorporated into any existing state-of-the-art GNN. Our experiments conducted on three benchmark datasets demonstrate that our proposed approach not only improves the node classification performance by a large margin on few-shot learning problems in meta-learning paradigm, but also learns a more general and flexible model for task adaption.
Tasks Few-Shot Learning, Meta-Learning, Node Classification
Published 2019-05-23
URL https://arxiv.org/abs/1905.09718v1
PDF https://arxiv.org/pdf/1905.09718v1.pdf
PWC https://paperswithcode.com/paper/meta-gnn-on-few-shot-node-classification-in
Repo
Framework

Beyond the EM Algorithm: Constrained Optimization Methods for Latent Class Model

Title Beyond the EM Algorithm: Constrained Optimization Methods for Latent Class Model
Authors Hao Chen, Lanshan Han, Alvin Lim
Abstract Latent class model (LCM), which is a finite mixture of different categorical distributions, is one of the most widely used models in statistics and machine learning fields. Because of its non-continuous nature and the flexibility in shape, researchers in practice areas such as marketing and social sciences also frequently use LCM to gain insights from their data. One likelihood-based method, the Expectation-Maximization (EM) algorithm, is often used to obtain the model estimators. However, the EM algorithm is well-known for its notoriously slow convergence. In this research, we explore alternative likelihood-based methods that can potential remedy the slow convergence of the EM algorithm. More specifically, we regard likelihood-based approach as a constrained nonlinear optimization problem, and apply quasi-Newton type methods to solve them. We examine two different constrained optimization methods to maximize the log likelihood function. We present simulation study results to show that the proposed methods not only converge in less iterations than the EM algorithm but also produce more accurate model estimators.
Tasks
Published 2019-01-09
URL https://arxiv.org/abs/1901.02928v3
PDF https://arxiv.org/pdf/1901.02928v3.pdf
PWC https://paperswithcode.com/paper/beyond-the-em-algorithm-constrained
Repo
Framework

Optimal Nonparametric Inference under Quantization

Title Optimal Nonparametric Inference under Quantization
Authors Ruiqi Liu, Ganggang Xu, Zuofeng Shang
Abstract Statistical inference based on lossy or incomplete samples is of fundamental importance in research areas such as signal/image processing, medical image storage, remote sensing, signal transmission. In this paper, we propose a nonparametric testing procedure based on quantized samples. In contrast to the classic nonparametric approach, our method lives on a coarse grid of sample information and are simple-to-use. Under mild technical conditions, we establish the asymptotic properties of the proposed procedures including asymptotic null distribution of the quantization test statistic as well as its minimax power optimality. Concrete quantizers are constructed for achieving the minimax optimality in practical use. Simulation results and a real data analysis are provided to demonstrate the validity and effectiveness of the proposed test. Our work bridges the classical nonparametric inference to modern lossy data setting.
Tasks Quantization
Published 2019-01-24
URL http://arxiv.org/abs/1901.08571v2
PDF http://arxiv.org/pdf/1901.08571v2.pdf
PWC https://paperswithcode.com/paper/optimal-nonparametric-inference-under
Repo
Framework
Title Improved Safe Real-time Heuristic Search
Authors Bence Cserna, Kevin C. Gall, Wheeler Ruml
Abstract A fundamental concern in real-time planning is the presence of dead-ends in the state space, from which no goal is reachable. Recently, the SafeRTS algorithm was proposed for searching in such spaces. SafeRTS exploits a user-provided predicate to identify safe states, from which a goal is likely reachable, and attempts to maintain a backup plan for reaching a safe state at all times. In this paper, we study the SafeRTS approach, identify certain properties of its behavior, and design an improved framework for safe real-time search. We prove that the new approach performs at least as well as SafeRTS and present experimental results showing that its promise is fulfilled in practice.
Tasks
Published 2019-05-15
URL https://arxiv.org/abs/1905.06402v1
PDF https://arxiv.org/pdf/1905.06402v1.pdf
PWC https://paperswithcode.com/paper/improved-safe-real-time-heuristic-search
Repo
Framework

Depthwise Convolution is All You Need for Learning Multiple Visual Domains

Title Depthwise Convolution is All You Need for Learning Multiple Visual Domains
Authors Yunhui Guo, Yandong Li, Rogerio Feris, Liqiang Wang, Tajana Rosing
Abstract There is a growing interest in designing models that can deal with images from different visual domains. If there exists a universal structure in different visual domains that can be captured via a common parameterization, then we can use a single model for all domains rather than one model per domain. A model aware of the relationships between different domains can also be trained to work on new domains with less resources. However, to identify the reusable structure in a model is not easy. In this paper, we propose a multi-domain learning architecture based on depthwise separable convolution. The proposed approach is based on the assumption that images from different domains share cross-channel correlations but have domain-specific spatial correlations. The proposed model is compact and has minimal overhead when being applied to new domains. Additionally, we introduce a gating mechanism to promote soft sharing between different domains. We evaluate our approach on Visual Decathlon Challenge, a benchmark for testing the ability of multi-domain models. The experiments show that our approach can achieve the highest score while only requiring 50% of the parameters compared with the state-of-the-art approaches.
Tasks
Published 2019-02-03
URL http://arxiv.org/abs/1902.00927v2
PDF http://arxiv.org/pdf/1902.00927v2.pdf
PWC https://paperswithcode.com/paper/depthwise-convolution-is-all-you-need-for
Repo
Framework

Monitoring Achilles tendon healing progress in ultrasound imaging with convolutional neural networks

Title Monitoring Achilles tendon healing progress in ultrasound imaging with convolutional neural networks
Authors Piotr Woznicki, Przemyslaw Przybyszewski, Norbert Kapinski, Jakub Zielinski, Beata Ciszkowska-Lyson, Bartosz A. Borucki, Tomasz Trzcinski, Krzysztof S. Nowinski
Abstract Achilles tendon rupture is a debilitating injury, which is typically treated with surgical repair and long-term rehabilitation. The recovery, however, is protracted and often incomplete. Diagnosis, as well as healing progress assessment, are largely based on ultrasound and magnetic resonance imaging. In this paper, we propose an automatic method based on deep learning for analysis of Achilles tendon condition and estimation of its healing progress on ultrasound images. We develop custom convolutional neural networks for classification and regression on healing score and feature extraction. Our models are trained and validated on an acquired dataset of over 250.000 sagittal and over 450.000 axial ultrasound slices. The obtained estimates show a high correlation with the assessment of expert radiologists, with respect to all key parameters describing healing progress. We also observe that parameters associated with i.a. intratendinous healing processes are better modeled with sagittal slices. We prove that ultrasound imaging is quantitatively useful for clinical assessment of Achilles tendon healing process and should be viewed as complementary to magnetic resonance imaging.
Tasks
Published 2019-09-11
URL https://arxiv.org/abs/1909.04973v1
PDF https://arxiv.org/pdf/1909.04973v1.pdf
PWC https://paperswithcode.com/paper/monitoring-achilles-tendon-healing-progress
Repo
Framework

Face X-ray for More General Face Forgery Detection

Title Face X-ray for More General Face Forgery Detection
Authors Lingzhi Li, Jianmin Bao, Ting Zhang, Hao Yang, Dong Chen, Fang Wen, Baining Guo
Abstract In this paper we propose a novel image representation called face X-ray for detecting forgery in face images. The face X-ray of an input face image is a greyscale image that reveals whether the input image can be decomposed into the blending of two images from different sources. It does so by showing the blending boundary for a forged image and the absence of blending for a real image. We observe that most existing face manipulation methods share a common step: blending the altered face into an existing background image. For this reason, face X-ray provides an effective way for detecting forgery generated by most existing face manipulation algorithms. Face X-ray is general in the sense that it only assumes the existence of a blending step and does not rely on any knowledge of the artifacts associated with a specific face manipulation technique. Indeed, the algorithm for computing face X-ray can be trained without fake images generated by any of the state-of-the-art face manipulation methods. Extensive experiments show that face X-ray remains effective when applied to forgery generated by unseen face manipulation techniques, while most existing face forgery detection algorithms experience a significant performance drop.
Tasks
Published 2019-12-31
URL https://arxiv.org/abs/1912.13458v1
PDF https://arxiv.org/pdf/1912.13458v1.pdf
PWC https://paperswithcode.com/paper/face-x-ray-for-more-general-face-forgery
Repo
Framework

Cultural Endowment as Collective Improvisation: subjectivity and digital infinity

Title Cultural Endowment as Collective Improvisation: subjectivity and digital infinity
Authors Victor Peterson II
Abstract Philosophically, a repertoire of signifying practices as constitutive of a cultural endowment was said to be ambiguous or unworthy of pursuit. Currently, a unique capacity of the mind is considered to be its ability to produce a digital infinity. The infinity produced, here an operation expressing subjectivity, follows a simple principle according to which a limited set of means, here functions, are utilized to produce an infinite range of potentially meaningful expressions. It is from this concept that I propose a theory of subjectivity and the endowment from which it expresses a self in the world(s) it participates. In particular, I make a case for the subjectivity of blackness. I treat subjectivity as an operation in order to address problems with Identity theory, Afro-Pessimism, and to formalize an analysis of blackness despite the onto-epistemological commitments of racialized systems of categorization. In sum, subjectivity will be characterized as poetic computation.
Tasks
Published 2019-07-19
URL https://arxiv.org/abs/1907.08643v1
PDF https://arxiv.org/pdf/1907.08643v1.pdf
PWC https://paperswithcode.com/paper/cultural-endowment-as-collective
Repo
Framework

Stochastic Optimal Control as Approximate Input Inference

Title Stochastic Optimal Control as Approximate Input Inference
Authors Joe Watson, Hany Abdulsamad, Jan Peters
Abstract Optimal control of stochastic nonlinear dynamical systems is a major challenge in the domain of robot learning. Given the intractability of the global control problem, state-of-the-art algorithms focus on approximate sequential optimization techniques, that heavily rely on heuristics for regularization in order to achieve stable convergence. By building upon the duality between inference and control, we develop the view of Optimal Control as Input Estimation, devising a probabilistic stochastic optimal control formulation that iteratively infers the optimal input distributions by minimizing an upper bound of the control cost. Inference is performed through Expectation Maximization and message passing on a probabilistic graphical model of the dynamical system, and time-varying linear Gaussian feedback controllers are extracted from the joint state-action distribution. This perspective incorporates uncertainty quantification, effective initialization through priors, and the principled regularization inherent to the Bayesian treatment. Moreover, it can be shown that for deterministic linearized systems, our framework derives the maximum entropy linear quadratic optimal control law. We provide a complete and detailed derivation of our probabilistic approach and highlight its advantages in comparison to other deterministic and probabilistic solvers.
Tasks
Published 2019-10-07
URL https://arxiv.org/abs/1910.03003v1
PDF https://arxiv.org/pdf/1910.03003v1.pdf
PWC https://paperswithcode.com/paper/stochastic-optimal-control-as-approximate
Repo
Framework

Phase mapping for cardiac unipolar electrograms with neural network instead of phase transformation

Title Phase mapping for cardiac unipolar electrograms with neural network instead of phase transformation
Authors Konstantin Ushenin, Tatyana Nesterova, Dmitry Shmarko, Vladimir Sholokhov
Abstract A phase mapping is an approach to processing signals of electrograms recorded from the surface of cardiac tissue. The main concept of phase mapping is the application of the phase transformation with the aim to obtain signals with useful properties. In our study, we propose to use a simple sawtooth signal instead of a phase signal for processing of electrogram data and building of the phase maps. We denote transformation that can provide this signal as a phase-like transformation (PLT). PLT defined via a convolutional neural network that is trained on a dataset from computer models of cardiac tissue electrophysiology. The proposed approaches were validated on data from the detailed personalized model of the human torso electrophysiology. This paper includes visualization of the phase map based on PLT and shows the robustness of the proposed approaches in the analysis of the complex non-stationary periodic activity of the excitable cardiac tissue.
Tasks
Published 2019-11-21
URL https://arxiv.org/abs/1911.09731v2
PDF https://arxiv.org/pdf/1911.09731v2.pdf
PWC https://paperswithcode.com/paper/phase-mapping-for-cardiac-unipolar
Repo
Framework

A Dynamically Controlled Recurrent Neural Network for Modeling Dynamical Systems

Title A Dynamically Controlled Recurrent Neural Network for Modeling Dynamical Systems
Authors Yiwei Fu, Samer Saab Jr, Asok Ray, Michael Hauser
Abstract This work proposes a novel neural network architecture, called the Dynamically Controlled Recurrent Neural Network (DCRNN), specifically designed to model dynamical systems that are governed by ordinary differential equations (ODEs). The current state vectors of these types of dynamical systems only depend on their state-space models, along with the respective inputs and initial conditions. Long Short-Term Memory (LSTM) networks, which have proven to be very effective for memory-based tasks, may fail to model physical processes as they tend to memorize, rather than learn how to capture the information on the underlying dynamics. The proposed DCRNN includes learnable skip-connections across previously hidden states, and introduces a regularization term in the loss function by relying on Lyapunov stability theory. The regularizer enables the placement of eigenvalues of the transfer function induced by the DCRNN to desired values, thereby acting as an internal controller for the hidden state trajectory. The results show that, for forecasting a chaotic dynamical system, the DCRNN outperforms the LSTM in $100$ out of $100$ randomized experiments by reducing the mean squared error of the LSTM’s forecasting by $80.0% \pm 3.0%$.
Tasks
Published 2019-10-31
URL https://arxiv.org/abs/1911.00089v1
PDF https://arxiv.org/pdf/1911.00089v1.pdf
PWC https://paperswithcode.com/paper/a-dynamically-controlled-recurrent-neural
Repo
Framework

Allowing for equal opportunities for artists in music recommendation

Title Allowing for equal opportunities for artists in music recommendation
Authors Christine Bauer
Abstract Promoting diversity in the music sector is widely discussed on the media. While the major problem may lie deep in our society, music information retrieval contributes to promoting diversity or may create unequal opportunities for artists. For example, considering the known problem of popularity bias in music recommendation, it is important to investigate whether the short head of popular music artists and the long tail of less popular ones show similar patterns of diversity—in terms of, for example, age, gender, or ethnic origin—or the popularity bias amplifies a positive or negative effect. I advocate for reasonable opportunities for artists—for (currently) popular artists and artists in the long-tail alike—in music recommender systems. In this work, I represent the position that we need to develop a deep understanding of the biases and inequalities because it is the essential basis to design approaches for music recommendation that provide reasonable opportunities. Thus, research needs to investigate the various reasons that hinder equal opportunity and diversity in music recommendation.
Tasks Information Retrieval, Music Information Retrieval, Recommendation Systems
Published 2019-11-13
URL https://arxiv.org/abs/1911.05395v1
PDF https://arxiv.org/pdf/1911.05395v1.pdf
PWC https://paperswithcode.com/paper/allowing-for-equal-opportunities-for-artists
Repo
Framework

A quantum active learning algorithm for sampling against adversarial attacks

Title A quantum active learning algorithm for sampling against adversarial attacks
Authors P. A. M. Casares, M. A. Martin-Delgado
Abstract Adversarial attacks represent a serious menace for learning algorithms and may compromise the security of future autonomous systems. A theorem by Khoury and Hadfield-Menell (KH), provides sufficient conditions to guarantee the robustness of active learning algorithms, but comes with a caveat: it is crucial to know the smallest distance among the classes of the corresponding classification problem. We propose a theoretical framework that allows us to think of active learning as sampling the most promising new points to be classified, so that the minimum distance between classes can be found and the theorem KH used. The complexity of the quantum active learning algorithm is polynomial in the variables used, like the dimension of the space $m$ and the size of the initial training data $n$. On the other hand, if one replicates this approach with a classical computer, we expect that it would take exponential time in $m$, an example of the so-called `curse of dimensionality’. |
Tasks Active Learning
Published 2019-12-06
URL https://arxiv.org/abs/1912.03283v1
PDF https://arxiv.org/pdf/1912.03283v1.pdf
PWC https://paperswithcode.com/paper/a-quantum-active-learning-algorithm-for
Repo
Framework

A unified framework of epidemic spreading prediction by empirical mode decomposition based ensemble learning techniques

Title A unified framework of epidemic spreading prediction by empirical mode decomposition based ensemble learning techniques
Authors Yun Feng, Bing-Chuan Wang
Abstract In this paper, a unified susceptible-exposed-infected-susceptible-aware (SEIS-A) framework is proposed to combine epidemic spreading with individuals’ on-line self-consultation behaviors. An epidemic spreading prediction model is established based on the SEIS-A framework. The prediction process contains two phases. In phase I, the time series data of disease density are decomposed through the empirical mode decomposition (EMD) method to obtain the intrinsic mode functions (IMFs). In phase II, the ensemble learning techniques which use the on-line query data as an additional input are applied to these IMFs. Finally, experiments for prediction of weekly consultation rates of Hand-foot-and-mouth disease (HFMD) in Hong Kong are conducted to validate the effectiveness of the proposed method. The main advantage of this method is that it outperforms other methods on fluctuating complex data.
Tasks Time Series
Published 2019-01-03
URL http://arxiv.org/abs/1901.01144v2
PDF http://arxiv.org/pdf/1901.01144v2.pdf
PWC https://paperswithcode.com/paper/a-unified-framework-of-epidemic-spreading
Repo
Framework

Compact Autoregressive Network

Title Compact Autoregressive Network
Authors Di Wang, Feiqing Huang, Jingyu Zhao, Guodong Li, Guangjian Tian
Abstract Autoregressive networks can achieve promising performance in many sequence modeling tasks with short-range dependence. However, when handling high-dimensional inputs and outputs, the huge amount of parameters in the network lead to expensive computational cost and low learning efficiency. The problem can be alleviated slightly by introducing one more narrow hidden layer to the network, but the sample size required to achieve a certain training error is still large. To address this challenge, we rearrange the weight matrices of a linear autoregressive network into a tensor form, and then make use of Tucker decomposition to represent low-rank structures. This leads to a novel compact autoregressive network, called Tucker AutoRegressive (TAR) net. Interestingly, the TAR net can be applied to sequences with long-range dependence since the dimension along the sequential order is reduced. Theoretical studies show that the TAR net improves the learning efficiency, and requires much fewer samples for model training. Experiments on synthetic and real-world datasets demonstrate the promising performance of the proposed compact network.
Tasks
Published 2019-09-06
URL https://arxiv.org/abs/1909.03830v1
PDF https://arxiv.org/pdf/1909.03830v1.pdf
PWC https://paperswithcode.com/paper/compact-autoregressive-network
Repo
Framework
comments powered by Disqus