October 17, 2019

3258 words 16 mins read

Paper Group ANR 832

Paper Group ANR 832

Joint Demosaicing and Denoising with Perceptual Optimization on a Generative Adversarial Network. Physics-aware Deep Generative Models for Creating Synthetic Microstructures. Privacy-preserving classifiers recognize shared mobility behaviours from WiFi network imperfect data. NLP-assisted software testing: A systematic mapping of the literature. CA …

Joint Demosaicing and Denoising with Perceptual Optimization on a Generative Adversarial Network

Title Joint Demosaicing and Denoising with Perceptual Optimization on a Generative Adversarial Network
Authors Weishong Dong, Ming Yuan, Xin Li, Guangming Shi
Abstract Image demosaicing - one of the most important early stages in digital camera pipelines - addressed the problem of reconstructing a full-resolution image from so-called color-filter-arrays. Despite tremendous progress made in the pase decade, a fundamental issue that remains to be addressed is how to assure the visual quality of reconstructed images especially in the presence of noise corruption. Inspired by recent advances in generative adversarial networks (GAN), we present a novel deep learning approach toward joint demosaicing and denoising (JDD) with perceptual optimization in order to ensure the visual quality of reconstructed images. The key contributions of this work include: 1) we have developed a GAN-based approach toward image demosacing in which a discriminator network with both perceptual and adversarial loss functions are used for quality assurance; 2) we propose to optimize the perceptual quality of reconstructed images by the proposed GAN in an end-to-end manner. Such end-to-end optimization of GAN is particularly effective for jointly exploiting the gain brought by each modular component (e.g., residue learning in the generative network and perceptual loss in the discriminator network). Our extensive experimental results have shown convincingly improved performance over existing state-of-the-art methods in terms of both subjective and objective quality metrics with a comparable computational cost.
Tasks Demosaicking, Denoising
Published 2018-02-13
URL http://arxiv.org/abs/1802.04723v1
PDF http://arxiv.org/pdf/1802.04723v1.pdf
PWC https://paperswithcode.com/paper/joint-demosaicing-and-denoising-with
Repo
Framework

Physics-aware Deep Generative Models for Creating Synthetic Microstructures

Title Physics-aware Deep Generative Models for Creating Synthetic Microstructures
Authors Rahul Singh, Viraj Shah, Balaji Pokuri, Soumik Sarkar, Baskar Ganapathysubramanian, Chinmay Hegde
Abstract A key problem in computational material science deals with understanding the effect of material distribution (i.e., microstructure) on material performance. The challenge is to synthesize microstructures, given a finite number of microstructure images, and/or some physical invariances that the microstructure exhibits. Conventional approaches are based on stochastic optimization and are computationally intensive. We introduce three generative models for the fast synthesis of binary microstructure images. The first model is a WGAN model that uses a finite number of training images to synthesize new microstructures that weakly satisfy the physical invariances respected by the original data. The second model explicitly enforces known physical invariances by replacing the traditional discriminator in a GAN with an invariance checker. Our third model combines the first two models to reconstruct microstructures that respect both explicit physics invariances as well as implicit constraints learned from the image data. We illustrate these models by reconstructing two-phase microstructures that exhibit coarsening behavior. The trained models also exhibit interesting latent variable interpolation behavior, and the results indicate considerable promise for enforcing user-defined physics constraints during microstructure synthesis.
Tasks Stochastic Optimization
Published 2018-11-21
URL http://arxiv.org/abs/1811.09669v1
PDF http://arxiv.org/pdf/1811.09669v1.pdf
PWC https://paperswithcode.com/paper/physics-aware-deep-generative-models-for
Repo
Framework

Privacy-preserving classifiers recognize shared mobility behaviours from WiFi network imperfect data

Title Privacy-preserving classifiers recognize shared mobility behaviours from WiFi network imperfect data
Authors Orestes Manzanilla-Salazar, Brunilde Sansò
Abstract This paper proves the concept that it is feasible to accurately recognize specific human mobility shared patterns, based solely on the connection logs between portable devices and WiFi Access Points (APs), while preserving user’s privacy. We gathered data from the Eduroam WiFi network of Polytechnique Montreal, making omission of device tracking or physical layer data. The behaviors we chose to detect were the movements associated to the end of an academic class, and the patterns related to the small break periods between classes. Stringent conditions were self-imposed in our experiments. The data is known to have errors noise, and be susceptible to information loss. No countermeasures were adopted to mitigate any of these issues. Data pre-processing consists of basic statistics that were used in aggregating the data in time intervals. We obtained accuracy values of 93.7 % and 83.3 % (via Bagged Trees) when recognizing behaviour patterns of breaks between classes and end-of-classes, respectively.
Tasks
Published 2018-07-17
URL http://arxiv.org/abs/1807.06190v2
PDF http://arxiv.org/pdf/1807.06190v2.pdf
PWC https://paperswithcode.com/paper/privacy-preserving-classifiers-recognize
Repo
Framework

NLP-assisted software testing: A systematic mapping of the literature

Title NLP-assisted software testing: A systematic mapping of the literature
Authors Vahid Garousi, Sara Bauer, Michael Felderer
Abstract Context: To reduce manual effort of extracting test cases from natural-language requirements, many approaches based on Natural Language Processing (NLP) have been proposed in the literature. Given the large amount of approaches in this area, and since many practitioners are eager to utilize such techniques, it is important to synthesize and provide an overview of the state-of-the-art in this area. Objective: Our objective is to summarize the state-of-the-art in NLP-assisted software testing which could benefit practitioners to potentially utilize those NLP-based techniques. Moreover, this can benefit researchers in providing an overview of the research landscape. Method: To address the above need, we conducted a survey in the form of a systematic literature mapping (classification). After compiling an initial pool of 95 papers, we conducted a systematic voting, and our final pool included 67 technical papers. Results: This review paper provides an overview of the contribution types presented in the papers, types of NLP approaches used to assist software testing, types of required input requirements, and a review of tool support in this area. Some key results we have detected are: (1) only four of the 38 tools (11%) presented in the papers are available for download; (2) a larger ratio of the papers (30 of 67) provided a shallow exposure to the NLP aspects (almost no details). Conclusion: This paper would benefit both practitioners and researchers by serving as an “index” to the body of knowledge in this area. The results could help practitioners utilizing the existing NLP-based techniques; this in turn reduces the cost of test-case design and decreases the amount of human resources spent on test activities. After sharing this review with some of our industrial collaborators, initial insights show that this review can indeed be useful and beneficial to practitioners.
Tasks
Published 2018-06-02
URL https://arxiv.org/abs/1806.00696v3
PDF https://arxiv.org/pdf/1806.00696v3.pdf
PWC https://paperswithcode.com/paper/nlp-assisted-software-testing-a-systematic
Repo
Framework

CAKE: Compact and Accurate K-dimensional representation of Emotion

Title CAKE: Compact and Accurate K-dimensional representation of Emotion
Authors Corentin Kervadec, Valentin Vielzeuf, Stéphane Pateux, Alexis Lechervy, Frédéric Jurie
Abstract Numerous models describing the human emotional states have been built by the psychology community. Alongside, Deep Neural Networks (DNN) are reaching excellent performances and are becoming interesting features extraction tools in many computer vision tasks.Inspired by works from the psychology community, we first study the link between the compact two-dimensional representation of the emotion known as arousal-valence, and discrete emotion classes (e.g. anger, happiness, sadness, etc.) used in the computer vision community. It enables to assess the benefits – in terms of discrete emotion inference – of adding an extra dimension to arousal-valence (usually named dominance). Building on these observations, we propose CAKE, a 3-dimensional representation of emotion learned in a multi-domain fashion, achieving accurate emotion recognition on several public datasets. Moreover, we visualize how emotions boundaries are organized inside DNN representations and show that DNNs are implicitly learning arousal-valence-like descriptions of emotions. Finally, we use the CAKE representation to compare the quality of the annotations of different public datasets.
Tasks Emotion Recognition
Published 2018-07-30
URL http://arxiv.org/abs/1807.11215v2
PDF http://arxiv.org/pdf/1807.11215v2.pdf
PWC https://paperswithcode.com/paper/cake-compact-and-accurate-k-dimensional
Repo
Framework

A context encoder for audio inpainting

Title A context encoder for audio inpainting
Authors Andrés Marafioti, Nathanaël Perraudin, Nicki Holighaus, Piotr Majdak
Abstract We study the ability of deep neural networks (DNNs) to restore missing audio content based on its context, i.e., inpaint audio gaps. We focus on a condition which has not received much attention yet: gaps in the range of tens of milliseconds. We propose a DNN structure that is provided with the signal surrounding the gap in the form of time-frequency (TF) coefficients. Two DNNs with either complex-valued TF coefficient output or magnitude TF coefficient output were studied by separately training them on inpainting two types of audio signals (music and musical instruments) having 64-ms long gaps. The magnitude DNN outperformed the complex-valued DNN in terms of signal-to-noise ratios and objective difference grades. Although, for instruments, a reference inpainting obtained through linear predictive coding performed better in both metrics, it performed worse than the magnitude DNN for music. This demonstrates the potential of the magnitude DNN, in particular for inpainting signals that are more complex than single instrument sounds.
Tasks
Published 2018-10-29
URL https://arxiv.org/abs/1810.12138v2
PDF https://arxiv.org/pdf/1810.12138v2.pdf
PWC https://paperswithcode.com/paper/a-context-encoder-for-audio-inpainting
Repo
Framework

High Fidelity Semantic Shape Completion for Point Clouds using Latent Optimization

Title High Fidelity Semantic Shape Completion for Point Clouds using Latent Optimization
Authors Swaminathan Gurumurthy, Shubham Agrawal
Abstract Semantic shape completion is a challenging problem in 3D computer vision where the task is to generate a complete 3D shape using a partial 3D shape as input. We propose a learning-based approach to complete incomplete 3D shapes through generative modeling and latent manifold optimization. Our algorithm works directly on point clouds. We use an autoencoder and a GAN to learn a distribution of embeddings for point clouds of object classes. An input point cloud with missing regions is first encoded to a feature vector. The representations learnt by the GAN are then used to find the best latent vector on the manifold using a combined optimization that finds a vector in the manifold of plausible vectors that is close to the original input (both in the feature space and the output space of the decoder). Experiments show that our algorithm is capable of successfully reconstructing point clouds with large missing regions with very high fidelity without having to rely on exemplar based database retrieval.
Tasks
Published 2018-07-09
URL http://arxiv.org/abs/1807.03407v2
PDF http://arxiv.org/pdf/1807.03407v2.pdf
PWC https://paperswithcode.com/paper/high-fidelity-semantic-shape-completion-for
Repo
Framework

Hypergraph based semi-supervised learning algorithms applied to speech recognition problem: a novel approach

Title Hypergraph based semi-supervised learning algorithms applied to speech recognition problem: a novel approach
Authors Loc Hoang Tran, Trang Hoang, Bui Hoang Nam Huynh
Abstract Most network-based speech recognition methods are based on the assumption that the labels of two adjacent speech samples in the network are likely to be the same. However, assuming the pairwise relationship between speech samples is not complete. The information a group of speech samples that show very similar patterns and tend to have similar labels is missed. The natural way overcoming the information loss of the above assumption is to represent the feature data of speech samples as the hypergraph. Thus, in this paper, the three un-normalized, random walk, and symmetric normalized hypergraph Laplacian based semi-supervised learning methods applied to hypergraph constructed from the feature data of speech samples in order to predict the labels of speech samples are introduced. Experiment results show that the sensitivity performance measures of these three hypergraph Laplacian based semi-supervised learning methods are greater than the sensitivity performance measures of the Hidden Markov Model method (the current state of the art method applied to speech recognition problem) and graph based semi-supervised learning methods (i.e. the current state of the art network-based method for classification problems) applied to network created from the feature data of speech samples.
Tasks Speech Recognition
Published 2018-10-28
URL http://arxiv.org/abs/1810.12743v1
PDF http://arxiv.org/pdf/1810.12743v1.pdf
PWC https://paperswithcode.com/paper/hypergraph-based-semi-supervised-learning
Repo
Framework

Particle identification in ground-based gamma-ray astronomy using convolutional neural networks

Title Particle identification in ground-based gamma-ray astronomy using convolutional neural networks
Authors E. B. Postnikov, I. V. Bychkov, J. Y. Dubenskaya, O. L. Fedorov, Y. A. Kazarina, E. E. Korosteleva, A. P. Kryukov, A. A. Mikhailov, M. D. Nguyen, S. P. Polyakov, A. O. Shigarov, D. A. Shipilov, D. P. Zhurov
Abstract Modern detectors of cosmic gamma-rays are a special type of imaging telescopes (air Cherenkov telescopes) supplied with cameras with a relatively large number of photomultiplier-based pixels. For example, the camera of the TAIGA-IACT telescope has 560 pixels of hexagonal structure. Images in such cameras can be analysed by deep learning techniques to extract numerous physical and geometrical parameters and/or for incoming particle identification. The most powerful deep learning technique for image analysis, the so-called convolutional neural network (CNN), was implemented in this study. Two open source libraries for machine learning, PyTorch and TensorFlow, were tested as possible software platforms for particle identification in imaging air Cherenkov telescopes. Monte Carlo simulation was performed to analyse images of gamma-rays and background particles (protons) as well as estimate identification accuracy. Further steps of implementation and improvement of this technique are discussed.
Tasks
Published 2018-12-04
URL http://arxiv.org/abs/1812.01551v1
PDF http://arxiv.org/pdf/1812.01551v1.pdf
PWC https://paperswithcode.com/paper/particle-identification-in-ground-based-gamma
Repo
Framework

Agnostic Sample Compression for Linear Regression

Title Agnostic Sample Compression for Linear Regression
Authors Steve Hanneke, Aryeh Kontorovich, Menachem Sadigurschi
Abstract We obtain the first positive results for bounded sample compression in the agnostic regression setting. We show that for p in {1,infinity}, agnostic linear regression with $\ell_p$ loss admits a bounded sample compression scheme. Specifically, we exhibit efficient sample compression schemes for agnostic linear regression in $R^d$ of size $d+1$ under the $\ell_1$ loss and size $d+2$ under the $\ell_\infty$ loss. We further show that for every other $\ell_p$ loss (1 < p < infinity), there does not exist an agnostic compression scheme of bounded size. This refines and generalizes a negative result of David, Moran, and Yehudayoff (2016) for the $\ell_2$ loss. We close by posing a general open question: for agnostic regression with $\ell_1$ loss, does every function class admit a compression scheme of size equal to its pseudo-dimension? This question generalizes Warmuth’s classic sample compression conjecture for realizable-case classification (Warmuth, 2003).
Tasks
Published 2018-10-03
URL http://arxiv.org/abs/1810.01864v1
PDF http://arxiv.org/pdf/1810.01864v1.pdf
PWC https://paperswithcode.com/paper/agnostic-sample-compression-for-linear
Repo
Framework

Cluster Variational Approximations for Structure Learning of Continuous-Time Bayesian Networks from Incomplete Data

Title Cluster Variational Approximations for Structure Learning of Continuous-Time Bayesian Networks from Incomplete Data
Authors Dominik Linzner, Heinz Koeppl
Abstract Continuous-time Bayesian networks (CTBNs) constitute a general and powerful framework for modeling continuous-time stochastic processes on networks. This makes them particularly attractive for learning the directed structures among interacting entities. However, if the available data is incomplete, one needs to simulate the prohibitively complex CTBN dynamics. Existing approximation techniques, such as sampling and low-order variational methods, either scale unfavorably in system size, or are unsatisfactory in terms of accuracy. Inspired by recent advances in statistical physics, we present a new approximation scheme based on cluster-variational methods significantly improving upon existing variational approximations. We can analytically marginalize the parameters of the approximate CTBN, as these are of secondary importance for structure learning. This recovers a scalable scheme for direct structure learning from incomplete and noisy time-series data. Our approach outperforms existing methods in terms of scalability.
Tasks Time Series
Published 2018-09-12
URL http://arxiv.org/abs/1809.04294v4
PDF http://arxiv.org/pdf/1809.04294v4.pdf
PWC https://paperswithcode.com/paper/cluster-variational-approximations-for
Repo
Framework

Community structure: A comparative evaluation of community detection methods

Title Community structure: A comparative evaluation of community detection methods
Authors Vinh-Loc Dao, Cécile Bothorel, Philippe Lenca
Abstract Discovering community structure in complex networks is a mature field since a tremendous number of community detection methods have been introduced in the literature. Nevertheless, it is still very challenging for practioners to determine which method would be suitable to get insights into the structural information of the networks they study. Many recent efforts have been devoted to investigating various quality scores of the community structure, but the problem of distinguishing between different types of communities is still open. In this paper, we propose a comparative, extensive and empirical study to investigate what types of communities many state-of-the-art and well-known community detection methods are producing. Specifically, we provide comprehensive analyses on computation time, community size distribution, a comparative evaluation of methods according to their optimisation schemes as well as a comparison of their partioning strategy through validation metrics. We process our analyses on a very large corpus of hundreds of networks from five different network categories and propose ways to classify community detection methods, helping a potential user to navigate the complex landscape of community detection.
Tasks Community Detection
Published 2018-12-14
URL https://arxiv.org/abs/1812.06598v4
PDF https://arxiv.org/pdf/1812.06598v4.pdf
PWC https://paperswithcode.com/paper/community-structure-a-comparative-evaluation
Repo
Framework

Non-linear process convolutions for multi-output Gaussian processes

Title Non-linear process convolutions for multi-output Gaussian processes
Authors Mauricio A. Álvarez, Wil O. C. Ward, Cristian Guarnizo
Abstract The paper introduces a non-linear version of the process convolution formalism for building covariance functions for multi-output Gaussian processes. The non-linearity is introduced via Volterra series, one series per each output. We provide closed-form expressions for the mean function and the covariance function of the approximated Gaussian process at the output of the Volterra series. The mean function and covariance function for the joint Gaussian process are derived using formulae for the product moments of Gaussian variables. We compare the performance of the non-linear model against the classical process convolution approach in one synthetic dataset and two real datasets.
Tasks Gaussian Processes
Published 2018-10-10
URL http://arxiv.org/abs/1810.04632v2
PDF http://arxiv.org/pdf/1810.04632v2.pdf
PWC https://paperswithcode.com/paper/non-linear-process-convolutions-for-multi
Repo
Framework

From Word to Sense Embeddings: A Survey on Vector Representations of Meaning

Title From Word to Sense Embeddings: A Survey on Vector Representations of Meaning
Authors Jose Camacho-Collados, Mohammad Taher Pilehvar
Abstract Over the past years, distributed semantic representations have proved to be effective and flexible keepers of prior knowledge to be integrated into downstream applications. This survey focuses on the representation of meaning. We start from the theoretical background behind word vector space models and highlight one of their major limitations: the meaning conflation deficiency, which arises from representing a word with all its possible meanings as a single vector. Then, we explain how this deficiency can be addressed through a transition from the word level to the more fine-grained level of word senses (in its broader acceptation) as a method for modelling unambiguous lexical meaning. We present a comprehensive overview of the wide range of techniques in the two main branches of sense representation, i.e., unsupervised and knowledge-based. Finally, this survey covers the main evaluation procedures and applications for this type of representation, and provides an analysis of four of its important aspects: interpretability, sense granularity, adaptability to different domains and compositionality.
Tasks
Published 2018-05-10
URL http://arxiv.org/abs/1805.04032v3
PDF http://arxiv.org/pdf/1805.04032v3.pdf
PWC https://paperswithcode.com/paper/from-word-to-sense-embeddings-a-survey-on
Repo
Framework

Energy Spatio-Temporal Pattern Prediction for Electric Vehicle Networks

Title Energy Spatio-Temporal Pattern Prediction for Electric Vehicle Networks
Authors Qinglong Wang
Abstract Information about the spatio-temporal pattern of electricity energy carried by EVs, instead of EVs themselves, is crucial for EVs to establish more effective and intelligent interactions with the smart grid. In this paper, we propose a framework for predicting the amount of the electricity energy stored by a large number of EVs aggregated within different city-scale regions, based on spatio-temporal pattern of the electricity energy. The spatial pattern is modeled via using a neural network based spatial predictor, while the temporal pattern is captured via using a linear-chain conditional random field (CRF) based temporal predictor. Two predictors are fed with spatial and temporal features respectively, which are extracted based on real trajectories data recorded in Beijing. Furthermore, we combine both predictors to build the spatio-temporal predictor, by using an optimal combination coefficient which minimizes the normalized mean square error (NMSE) of the predictions. The prediction performance is evaluated based on extensive experiments covering both spatial and temporal predictions, and the improvement achieved by the combined spatio-temporal predictor. The experiment results show that the NMSE of the spatio-temporal predictor is maintained below 0.1 for all investigate regions of Beijing. We further visualize the prediction and discuss the potential benefits can be brought to smart grid scheduling and EV charging by utilizing the proposed framework.
Tasks
Published 2018-02-14
URL http://arxiv.org/abs/1802.04931v1
PDF http://arxiv.org/pdf/1802.04931v1.pdf
PWC https://paperswithcode.com/paper/energy-spatio-temporal-pattern-prediction-for
Repo
Framework
comments powered by Disqus