April 1, 2020

# Paper Group ANR 394

Differentially Private ERM Based on Data Perturbation. Phonetic Feedback for Speech Enhancement With and Without Parallel Speech Data. Vision-Dialog Navigation by Exploring Cross-modal Memory. Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement. Deep learning approach for breast cancer diagnosis. Multifactorial Cellula …

#### Differentially Private ERM Based on Data Perturbation

Title Differentially Private ERM Based on Data Perturbation
Authors Yilin Kang, Yong Liu, Lizhong Ding, Xinwang Liu, Xinyi Tong, Weiping Wang
Abstract In this paper, after observing that different training data instances affect the machine learning model to different extents, we attempt to improve the performance of differentially private empirical risk minimization (DP-ERM) from a new perspective. Specifically, we measure the contributions of various training data instances on the final machine learning model, and select some of them to add random noise. Considering that the key of our method is to measure each data instance separately, we propose a new `Data perturbation’ based (DB) paradigm for DP-ERM: adding random noise to the original training data and achieving ($\epsilon,\delta$)-differential privacy on the final machine learning model, along with the preservation on the original data. By introducing the Influence Function (IF), we quantitatively measure the impact of the training data on the final model. Theoretical and experimental results show that our proposed DBDP-ERM paradigm enhances the model performance significantly. |
Published 2020-02-20
URL https://arxiv.org/abs/2002.08578v1
PDF https://arxiv.org/pdf/2002.08578v1.pdf
PWC https://paperswithcode.com/paper/differentially-private-erm-based-on-data
Repo
Framework

#### Phonetic Feedback for Speech Enhancement With and Without Parallel Speech Data

Title Phonetic Feedback for Speech Enhancement With and Without Parallel Speech Data
Authors Peter Plantinga, Deblin Bagchi, Eric Fosler-Lussier
Abstract While deep learning systems have gained significant ground in speech enhancement research, these systems have yet to make use of the full potential of deep learning systems to provide high-level feedback. In particular, phonetic feedback is rare in speech enhancement research even though it includes valuable top-down information. We use the technique of mimic loss to provide phonetic feedback to an off-the-shelf enhancement system, and find gains in objective intelligibility scores on CHiME-4 data. This technique takes a frozen acoustic model trained on clean speech to provide valuable feedback to the enhancement model, even in the case where no parallel speech data is available. Our work is one of the first to show intelligibility improvement for neural enhancement systems without parallel speech data, and we show phonetic feedback can improve a state-of-the-art neural enhancement system trained with parallel speech data.
Published 2020-03-03
URL https://arxiv.org/abs/2003.01769v1
PDF https://arxiv.org/pdf/2003.01769v1.pdf
PWC https://paperswithcode.com/paper/phonetic-feedback-for-speech-enhancement-with
Repo
Framework

#### Vision-Dialog Navigation by Exploring Cross-modal Memory

Title Vision-Dialog Navigation by Exploring Cross-modal Memory
Authors Yi Zhu, Fengda Zhu, Zhaohuan Zhan, Bingqian Lin, Jianbin Jiao, Xiaojun Chang, Xiaodan Liang
Published 2020-03-15
URL https://arxiv.org/abs/2003.06745v1
PDF https://arxiv.org/pdf/2003.06745v1.pdf
Repo
Framework

#### Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

Title Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement
Authors Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee
Abstract Recent studies have highlighted adversarial examples as ubiquitous threats to the deep neural network (DNN) based speech recognition systems. In this work, we present a U-Net based attention model, U-Net$_{At}$, to enhance adversarial speech signals. Specifically, we evaluate the model performance by interpretable speech recognition metrics and discuss the model performance by the augmented adversarial training. Our experiments show that our proposed U-Net$_{At}$ improves the perceptual evaluation of speech quality (PESQ) from 1.13 to 2.78, speech transmission index (STI) from 0.65 to 0.75, short-term objective intelligibility (STOI) from 0.83 to 0.96 on the task of speech enhancement with adversarial speech examples. We conduct experiments on the automatic speech recognition (ASR) task with adversarial audio attacks. We find that (i) temporal features learned by the attention network are capable of enhancing the robustness of DNN based ASR models; (ii) the generalization power of DNN based ASR model could be enhanced by applying adversarial training with an additive adversarial data augmentation. The ASR metric on word-error-rates (WERs) shows that there is an absolute 2.22 $%$ decrease under gradient-based perturbation, and an absolute 2.03 $%$ decrease, under evolutionary-optimized perturbation, which suggests that our enhancement models with adversarial training can further secure a resilient ASR system.
Tasks Data Augmentation, Speech Enhancement, Speech Recognition
Published 2020-03-31
URL https://arxiv.org/abs/2003.13917v1
PDF https://arxiv.org/pdf/2003.13917v1.pdf
Repo
Framework

#### Deep learning approach for breast cancer diagnosis

Title Deep learning approach for breast cancer diagnosis
Authors Essam A. Rashed, M. Samir Abou El Seoud
Abstract Breast cancer is one of the leading fatal disease worldwide with high risk control if early discovered. Conventional method for breast screening is x-ray mammography, which is known to be challenging for early detection of cancer lesions. The dense breast structure produced due to the compression process during imaging lead to difficulties to recognize small size abnormalities. Also, inter- and intra-variations of breast tissues lead to significant difficulties to achieve high diagnosis accuracy using hand-crafted features. Deep learning is an emerging machine learning technology that requires a relatively high computation power. Yet, it proved to be very effective in several difficult tasks that requires decision making at the level of human intelligence. In this paper, we develop a new network architecture inspired by the U-net structure that can be used for effective and early detection of breast cancer. Results indicate a high rate of sensitivity and specificity that indicate potential usefulness of the proposed approach in clinical use.
Published 2020-03-10
URL https://arxiv.org/abs/2003.04480v1
PDF https://arxiv.org/pdf/2003.04480v1.pdf
PWC https://paperswithcode.com/paper/deep-learning-approach-for-breast-cancer
Repo
Framework

#### Multifactorial Cellular Genetic Algorithm (MFCGA): Algorithmic Design, Performance Comparison and Genetic Transferability Analysis

Title Multifactorial Cellular Genetic Algorithm (MFCGA): Algorithmic Design, Performance Comparison and Genetic Transferability Analysis
Authors Eneko Osaba, Aritz D. Martinez, Jesus L. Lobo, Javier Del Ser, Francisco Herrera
Published 2020-03-24
URL https://arxiv.org/abs/2003.10768v1
PDF https://arxiv.org/pdf/2003.10768v1.pdf
PWC https://paperswithcode.com/paper/multifactorial-cellular-genetic-algorithm
Repo
Framework

#### Flow descriptors of human mobility networks

Title Flow descriptors of human mobility networks
Authors David Pastor-Escuredo, Enrique Frias-Martinez
Abstract Mobile phone data has enabled the timely and fine-grained study human mobility. Call Detail Records, generated at call events, allow building descriptions of mobility at different resolutions and with different spatial, temporal and social granularity. Individual trajectories are the basis for long-term observation of mobility patterns and identify factors of human dynamics. Here we propose a systematic analysis to characterize mobility network flows and topology and assess their impact into individual traces. Discrete flow-based descriptors are used to classify and understand human mobility patterns at multiple scales. This framework is suitable to assess urban planning, optimize transportation, measure the impact of external events and conditions, monitor internal dynamics and profile users according to their movement patterns.
Published 2020-03-16
URL https://arxiv.org/abs/2003.07279v1
PDF https://arxiv.org/pdf/2003.07279v1.pdf
PWC https://paperswithcode.com/paper/flow-descriptors-of-human-mobility-networks
Repo
Framework

#### EvoNet: A Neural Network for Predicting the Evolution of Dynamic Graphs

Title EvoNet: A Neural Network for Predicting the Evolution of Dynamic Graphs
Authors Changmin Wu, Giannis Nikolentzos, Michalis Vazirgiannis
Abstract Neural networks for structured data like graphs have been studied extensively in recent years. To date, the bulk of research activity has focused mainly on static graphs. However, most real-world networks are dynamic since their topology tends to change over time. Predicting the evolution of dynamic graphs is a task of high significance in the area of graph mining. Despite its practical importance, the task has not been explored in depth so far, mainly due to its challenging nature. In this paper, we propose a model that predicts the evolution of dynamic graphs. Specifically, we use a graph neural network along with a recurrent architecture to capture the temporal evolution patterns of dynamic graphs. Then, we employ a generative model which predicts the topology of the graph at the next time step and constructs a graph instance that corresponds to that topology. We evaluate the proposed model on several artificial datasets following common network evolving dynamics, as well as on real-world datasets. Results demonstrate the effectiveness of the proposed model.
Published 2020-03-02
URL https://arxiv.org/abs/2003.00842v1
PDF https://arxiv.org/pdf/2003.00842v1.pdf
PWC https://paperswithcode.com/paper/evonet-a-neural-network-for-predicting-the-1
Repo
Framework

#### Random Matrix Theory Proves that Deep Learning Representations of GAN-data Behave as Gaussian Mixtures

Title Random Matrix Theory Proves that Deep Learning Representations of GAN-data Behave as Gaussian Mixtures
Authors Mohamed El Amine Seddik, Cosme Louart, Mohamed Tamaazousti, Romain Couillet
Abstract This paper shows that deep learning (DL) representations of data produced by generative adversarial nets (GANs) are random vectors which fall within the class of so-called \textit{concentrated} random vectors. Further exploiting the fact that Gram matrices, of the type $G = X^T X$ with $X=[x_1,\ldots,x_n]\in \mathbb{R}^{p\times n}$ and $x_i$ independent concentrated random vectors from a mixture model, behave asymptotically (as $n,p\to \infty$) as if the $x_i$ were drawn from a Gaussian mixture, suggests that DL representations of GAN-data can be fully described by their first two statistical moments for a wide range of standard classifiers. Our theoretical findings are validated by generating images with the BigGAN model and across different popular deep representation networks.
Published 2020-01-21
URL https://arxiv.org/abs/2001.08370v1
PDF https://arxiv.org/pdf/2001.08370v1.pdf
PWC https://paperswithcode.com/paper/random-matrix-theory-proves-that-deep-1
Repo
Framework

#### Tangent-Space Gradient Optimization of Tensor Network for Machine Learning

Title Tangent-Space Gradient Optimization of Tensor Network for Machine Learning
Authors Zheng-zhi Sun, Shi-ju Ran, Gang Su
Abstract The gradient-based optimization method for deep machine learning models suffers from gradient vanishing and exploding problems, particularly when the computational graph becomes deep. In this work, we propose the tangent-space gradient optimization (TSGO) for the probabilistic models to keep the gradients from vanishing or exploding. The central idea is to guarantee the orthogonality between the variational parameters and the gradients. The optimization is then implemented by rotating parameter vector towards the direction of gradient. We explain and testify TSGO in tensor network (TN) machine learning, where the TN describes the joint probability distribution as a normalized state $\left \psi \right\rangle$ in Hilbert space. We show that the gradient can be restricted in the tangent space of $\left\langle \psi \right.\left \psi \right\rangle = 1$ hyper-sphere. Instead of additional adaptive methods to control the learning rate in deep learning, the learning rate of TSGO is naturally determined by the angle $\theta$ as $\eta = \tan \theta$. Our numerical results reveal better convergence of TSGO in comparison to the off-the-shelf Adam.
Published 2020-01-10
URL https://arxiv.org/abs/2001.04029v1
PDF https://arxiv.org/pdf/2001.04029v1.pdf
Repo
Framework

#### Gravitational Wave Detection and Information Extraction via Neural Networks

Title Gravitational Wave Detection and Information Extraction via Neural Networks
Authors Gerson R. Santos, Marcela P. Figueiredo, Antonio de Pádua Santos, Pavlos Protopapas, Tiago A. E. Ferreira
Abstract Laser Interferometer Gravitational-Wave Observatory (LIGO) was the first laboratory to measure the gravitational waves. It was needed an exceptional experimental design to measure distance changes much less than a radius of a proton. In the same way, the data analyses to confirm and extract information is a tremendously hard task. Here, it is shown a computational procedure base on artificial neural networks to detect a gravitation wave event and extract the knowledge of its ring-down time from the LIGO data. With this proposal, it is possible to make a probabilistic thermometer for gravitational wave detection and obtain physical information about the astronomical body system that created the phenomenon. Here, the ring-down time is determined with a direct data measure, without the need to use numerical relativity techniques and high computational power.
Published 2020-03-22
URL https://arxiv.org/abs/2003.09995v1
PDF https://arxiv.org/pdf/2003.09995v1.pdf
PWC https://paperswithcode.com/paper/gravitational-wave-detection-and-information
Repo
Framework

Authors Guanxiong Liu, Issa Khalil, Abdallah Khreishah
Published 2020-02-22
URL https://arxiv.org/abs/2002.09632v2
PDF https://arxiv.org/pdf/2002.09632v2.pdf
Repo
Framework

#### Time-Frequency Analysis based Blind Modulation Classification for Multiple-Antenna Systems

Title Time-Frequency Analysis based Blind Modulation Classification for Multiple-Antenna Systems
Authors Weiheng Jiang, Xiaogang Wu, Bolin Chen, Wenjiang Feng, Yi Jin
Abstract Blind modulation classification is an important step to implement cognitive radio networks. The multiple-input multiple-output (MIMO) technique is widely used in military and civil communication systems. Due to the lack of prior information about channel parameters and the overlapping of signals in the MIMO systems, the traditional likelihood-based and feature-based approaches cannot be applied in these scenarios directly. Hence, in this paper, to resolve the problem of blind modulation classification in MIMO systems, the time-frequency analysis method based on the windowed short-time Fourier transform is used to analyse the time-frequency characteristics of time-domain modulated signals. Then the extracted time-frequency characteristics are converted into RGB spectrogram images, and the convolutional neural network based on transfer learning is applied to classify the modulation types according to the RGB spectrogram images. Finally, a decision fusion module is used to fuse the classification results of all the receive antennas. Through simulations, we analyse the classification performance at different signal-to-noise ratios (SNRs), the results indicate that, for the single-input single-output (SISO) network, our proposed scheme can achieve 92.37% and 99.12% average classification accuracy at SNRs of -4 dB and 10 dB, respectively. For the MIMO network, our scheme achieves 80.42% and 87.92% average classification accuracy at -4 dB and 10 dB, respectively. This outperforms the existing classification methods based on baseband signals.
Published 2020-04-01
URL https://arxiv.org/abs/2004.00378v1
PDF https://arxiv.org/pdf/2004.00378v1.pdf
PWC https://paperswithcode.com/paper/time-frequency-analysis-based-blind
Repo
Framework

#### Diagnosing COVID-19 Pneumonia from X-Ray and CT Images using Deep Learning and Transfer Learning Algorithms

Title Diagnosing COVID-19 Pneumonia from X-Ray and CT Images using Deep Learning and Transfer Learning Algorithms
Abstract COVID-19 (also known as 2019 Novel Coronavirus) first emerged in Wuhan, China and spread across the globe with unprecedented effect and has now become the greatest crisis of the modern era. The COVID-19 has proved much more pervasive demands for diagnosis that has driven researchers to develop more intelligent, highly responsive and efficient detection methods. In this work, we focus on proposing AI tools that can be used by radiologists or healthcare professionals to diagnose COVID-19 cases in a quick and accurate manner. However, the lack of a publicly available dataset of X-ray and CT images makes the design of such AI tools a challenging task. To this end, this study aims to build a comprehensive dataset of X-rays and CT scan images from multiple sources as well as provides a simple but an effective COVID-19 detection technique using deep learning and transfer learning algorithms. In this vein, a simple convolution neural network (CNN) and modified pre-trained AlexNet model are applied on the prepared X-rays and CT scan images dataset. The result of the experiments shows that the utilized models can provide accuracy up to 98 % via pre-trained network and 94.1 % accuracy by using the modified CNN.
Published 2020-03-31
URL https://arxiv.org/abs/2004.00038v1
PDF https://arxiv.org/pdf/2004.00038v1.pdf
PWC https://paperswithcode.com/paper/diagnosing-covid-19-pneumonia-from-x-ray-and
Repo
Framework

#### Transfer Learning of Photometric Phenotypes in Agriculture Using Metadata

Title Transfer Learning of Photometric Phenotypes in Agriculture Using Metadata
Authors Dan Halbersberg, Aharon Bar Hillel, Shon Mendelson, Daniel Koster, Lena Karol, Boaz Lerner
Abstract Estimation of photometric plant phenotypes (e.g., hue, shine, chroma) in field conditions is important for decisions on the expected yield quality, fruit ripeness, and need for further breeding. Estimating these from images is difficult due to large variances in lighting conditions, shadows, and sensor properties. We combine the image and metadata regarding capturing conditions embedded into a network, enabling more accurate estimation and transfer between different conditions. Compared to a state-of-the-art deep CNN and a human expert, metadata embedding improves the estimation of the tomato’s hue and chroma.