October 19, 2019

2853 words 14 mins read

Paper Group ANR 226

Pixel Level Data Augmentation for Semantic Image Segmentation using Generative Adversarial Networks. Contextual Parameter Generation for Universal Neural Machine Translation. A Bio-inspired Collision Detecotr for Small Quadcopter. Kalman Filter Modifier for Neural Networks in Non-stationary Environments. Comparing Temporal Graphs Using Dynamic Time …

Pixel Level Data Augmentation for Semantic Image Segmentation using Generative Adversarial Networks


Title	Pixel Level Data Augmentation for Semantic Image Segmentation using Generative Adversarial Networks
Authors	Shuangting Liu, Jiaqi Zhang, Yuxin Chen, Yifan Liu, Zengchang Qin, Tao Wan
Abstract	Semantic segmentation is one of the basic topics in computer vision, it aims to assign semantic labels to every pixel of an image. Unbalanced semantic label distribution could have a negative influence on segmentation accuracy. In this paper, we investigate using data augmentation approach to balance the semantic label distribution in order to improve segmentation performance. We propose using generative adversarial networks (GANs) to generate realistic images for improving the performance of semantic segmentation networks. Experimental results show that the proposed method can not only improve segmentation performance on those classes with low accuracy, but also obtain 1.3% to 2.1% increase in average segmentation accuracy. It shows that this augmentation method can boost accuracy and be easily applicable to any other segmentation models.
Tasks	Data Augmentation, Semantic Segmentation
Published	2018-11-01
URL	https://arxiv.org/abs/1811.00174v4
PDF	https://arxiv.org/pdf/1811.00174v4.pdf
PWC	https://paperswithcode.com/paper/pixel-level-data-augmentation-for-semantic
Repo
Framework

Contextual Parameter Generation for Universal Neural Machine Translation


Title	Contextual Parameter Generation for Universal Neural Machine Translation
Authors	Emmanouil Antonios Platanios, Mrinmaya Sachan, Graham Neubig, Tom Mitchell
Abstract	We propose a simple modification to existing neural machine translation (NMT) models that enables using a single universal model to translate between multiple languages while allowing for language specific parameterization, and that can also be used for domain adaptation. Our approach requires no changes to the model architecture of a standard NMT system, but instead introduces a new component, the contextual parameter generator (CPG), that generates the parameters of the system (e.g., weights in a neural network). This parameter generator accepts source and target language embeddings as input, and generates the parameters for the encoder and the decoder, respectively. The rest of the model remains unchanged and is shared across all languages. We show how this simple modification enables the system to use monolingual data for training and also perform zero-shot translation. We further show it is able to surpass state-of-the-art performance for both the IWSLT-15 and IWSLT-17 datasets and that the learned language embeddings are able to uncover interesting relationships between languages.
Tasks	Domain Adaptation, Machine Translation
Published	2018-08-26
URL	http://arxiv.org/abs/1808.08493v1
PDF	http://arxiv.org/pdf/1808.08493v1.pdf
PWC	https://paperswithcode.com/paper/contextual-parameter-generation-for-universal
Repo
Framework

A Bio-inspired Collision Detecotr for Small Quadcopter


Title	A Bio-inspired Collision Detecotr for Small Quadcopter
Authors	Jiannan Zhao, Cheng Hu, Chun Zhang, Zhihua Wang, Shigang Yue
Abstract	Sense and avoid capability enables insects to fly versatilely and robustly in dynamic complex environment. Their biological principles are so practical and efficient that inspired we human imitating them in our flying machines. In this paper, we studied a novel bio-inspired collision detector and its application on a quadcopter. The detector is inspired from LGMD neurons in the locusts, and modeled into an STM32F407 MCU. Compared to other collision detecting methods applied on quadcopters, we focused on enhancing the collision selectivity in a bio-inspired way that can considerably increase the computing efficiency during an obstacle detecting task even in complex dynamic environment. We designed the quadcopter’s responding operation imminent collisions and tested this bio-inspired system in an indoor arena. The observed results from the experiments demonstrated that the LGMD collision detector is feasible to work as a vision module for the quadcopter’s collision avoidance task.
Tasks
Published	2018-01-14
URL	http://arxiv.org/abs/1801.04530v1
PDF	http://arxiv.org/pdf/1801.04530v1.pdf
PWC	https://paperswithcode.com/paper/a-bio-inspired-collision-detecotr-for-small
Repo
Framework

Kalman Filter Modifier for Neural Networks in Non-stationary Environments


Title	Kalman Filter Modifier for Neural Networks in Non-stationary Environments
Authors	Honglin Li, Frieder Ganz, Shirin Enshaeifar, Payam Barnaghi
Abstract	Learning in a non-stationary environment is an inevitable problem when applying machine learning algorithm to real world environment. Learning new tasks without forgetting the previous knowledge is a challenge issue in machine learning. We propose a Kalman Filter based modifier to maintain the performance of Neural Network models under non-stationary environments. The result shows that our proposed model can preserve the key information and adapts better to the changes. The accuracy of proposed model decreases by 0.4% in our experiments, while the accuracy of conventional model decreases by 90% in the drifts environment.
Tasks
Published	2018-11-06
URL	http://arxiv.org/abs/1811.02361v1
PDF	http://arxiv.org/pdf/1811.02361v1.pdf
PWC	https://paperswithcode.com/paper/kalman-filter-modifier-for-neural-networks-in
Repo
Framework

Comparing Temporal Graphs Using Dynamic Time Warping


Title	Comparing Temporal Graphs Using Dynamic Time Warping
Authors	Vincent Froese, Brijnesh Jain, Rolf Niedermeier, Malte Renken
Abstract	Within many real-world networks the links between pairs of nodes change over time. Thus, there has been a recent boom in studying temporal graphs. Recognizing patterns in temporal graphs requires a similarity (dissimilarity) measure to compare different temporal graphs. To this end, we propose to study dynamic time warping on temporal graphs. We define the dynamic temporal graph warping distance (dtgw) to determine the similarity of two temporal graphs. Our novel measure is flexible and can be applied in various application domains. We show that computing the dtgw-distance is a challenging (in general) NP-hard optimization problem and identify some polynomial-time solvable special cases. Moreover, we develop a quadratic programming formulation and an efficient heuristic. In experiments on real-word data we show that the heuristic performs very well and that our dtgw-distance performs favorably in de-anonymizing networks compared to other approaches.
Tasks	Time Series
Published	2018-10-15
URL	https://arxiv.org/abs/1810.06240v3
PDF	https://arxiv.org/pdf/1810.06240v3.pdf
PWC	https://paperswithcode.com/paper/comparing-temporal-graphs-using-dynamic-time
Repo
Framework

Probabilistic Inference Using Generators - The Statues Algorithm


Title	Probabilistic Inference Using Generators - The Statues Algorithm
Authors	Pierre Denis
Abstract	We present here a new probabilistic inference algorithm that gives exact results in the domain of discrete probability distributions. This algorithm, named the Statues algorithm, calculates the marginal probability distribution on probabilistic models defined as direct acyclic graphs. These models are made up of well-defined primitives that allow to express, in particular, joint probability distributions, Bayesian networks, discrete Markov chains, conditioning and probabilistic arithmetic. The Statues algorithm relies on a variable binding mechanism based on the generator construct, a special form of coroutine; being related to the enumeration algorithm, this new algorithm brings important improvements in terms of efficiency, which makes it valuable in regard to other exact marginalization algorithms. After introduction of several definitions, primitives and compositional rules, we present in details the Statues algorithm. Then, we briefly discuss the interest of this algorithm compared to others and we present possible extensions. Finally, we introduce Lea and MicroLea, two Python libraries implementing the Statues algorithm, along with several use cases. A proof of the correctness of the algorithm is provided in appendix.
Tasks
Published	2018-06-24
URL	http://arxiv.org/abs/1806.09997v2
PDF	http://arxiv.org/pdf/1806.09997v2.pdf
PWC	https://paperswithcode.com/paper/probabilistic-inference-using-generators-the
Repo
Framework

On the Importance of Strong Baselines in Bayesian Deep Learning


Title	On the Importance of Strong Baselines in Bayesian Deep Learning
Authors	Jishnu Mukhoti, Pontus Stenetorp, Yarin Gal
Abstract	Like all sub-fields of machine learning Bayesian Deep Learning is driven by empirical validation of its theoretical proposals. Given the many aspects of an experiment it is always possible that minor or even major experimental flaws can slip by both authors and reviewers. One of the most popular experiments used to evaluate approximate inference techniques is the regression experiment on UCI datasets. However, in this experiment, models which have been trained to convergence have often been compared with baselines trained only for a fixed number of iterations. We find that a well-established baseline, Monte Carlo dropout, when evaluated under the same experimental settings shows significant improvements. In fact, the baseline outperforms or performs competitively with methods that claimed to be superior to the very same baseline method when they were introduced. Hence, by exposing this flaw in experimental procedure, we highlight the importance of using identical experimental setups to evaluate, compare, and benchmark methods in Bayesian Deep Learning.
Tasks
Published	2018-11-23
URL	http://arxiv.org/abs/1811.09385v2
PDF	http://arxiv.org/pdf/1811.09385v2.pdf
PWC	https://paperswithcode.com/paper/on-the-importance-of-strong-baselines-in
Repo
Framework

Niching an Archive-based Gaussian Estimation of Distribution Algorithm via Adaptive Clustering


Title	Niching an Archive-based Gaussian Estimation of Distribution Algorithm via Adaptive Clustering
Authors	Yongsheng Liang, Zhigang Ren, Bei Pang, An Chen
Abstract	As a model-based evolutionary algorithm, estimation of distribution algorithm (EDA) possesses unique characteristics and has been widely applied to global optimization. However, traditional Gaussian EDA (GEDA) may suffer from premature convergence and has a high risk of falling into local optimum when dealing with multimodal problem. In this paper, we first attempts to improve the performance of GEDA by utilizing historical solutions and develops a novel archive-based EDA variant. The use of historical solutions not only enhances the search efficiency of EDA to a large extent, but also significantly reduces the population size so that a faster convergence could be achieved. Then, the archive-based EDA is further integrated with a novel adaptive clustering strategy for solving multimodal optimization problems. Taking the advantage of the clustering strategy in locating different promising areas and the powerful exploitation ability of the archive-based EDA, the resultant algorithm is endowed with strong capability in finding multiple optima. To verify the efficiency of the proposed algorithm, we tested it on a set of well-known niching benchmark problems and compared it with several state-of-the-art niching algorithms. The experimental results indicate that the proposed algorithm is competitive.
Tasks
Published	2018-03-01
URL	http://arxiv.org/abs/1803.00986v1
PDF	http://arxiv.org/pdf/1803.00986v1.pdf
PWC	https://paperswithcode.com/paper/niching-an-archive-based-gaussian-estimation
Repo
Framework

dAIrector: Automatic Story Beat Generation through Knowledge Synthesis


Title	dAIrector: Automatic Story Beat Generation through Knowledge Synthesis
Authors	Markus Eger, Kory W. Mathewson
Abstract	dAIrector is an automated director which collaborates with humans storytellers for live improvisational performances and writing assistance. dAIrector can be used to create short narrative arcs through contextual plot generation. In this work, we present the system architecture, a quantitative evaluation of design choices, and a case-study usage of the system which provides qualitative feedback from a professional improvisational performer. We present relevant metrics for the understudied domain of human-machine creative generation, specifically long-form narrative creation. We include, alongside publication, open-source code so that others may test, evaluate, and run the dAIrector.
Tasks
Published	2018-10-31
URL	https://arxiv.org/abs/1811.03423v1
PDF	https://arxiv.org/pdf/1811.03423v1.pdf
PWC	https://paperswithcode.com/paper/dairector-automatic-story-beat-generation
Repo
Framework

FRAME Revisited: An Interpretation View Based on Particle Evolution


Title	FRAME Revisited: An Interpretation View Based on Particle Evolution
Authors	Xu Cai, Yang Wu, Guanbin Li, Ziliang Chen, Liang Lin
Abstract	FRAME (Filters, Random fields, And Maximum Entropy) is an energy-based descriptive model that synthesizes visual realism by capturing mutual patterns from structural input signals. The maximum likelihood estimation (MLE) is applied by default, yet conventionally causes the unstable training energy that wrecks the generated structures, which remains unexplained. In this paper, we provide a new theoretical insight to analyze FRAME, from a perspective of particle physics ascribing the weird phenomenon to KL-vanishing issue. In order to stabilize the energy dissipation, we propose an alternative Wasserstein distance in discrete time based on the conclusion that the Jordan-Kinderlehrer-Otto (JKO) discrete flow approximates KL discrete flow when the time step size tends to 0. Besides, this metric can still maintain the model’s statistical consistency. Quantitative and qualitative experiments have been respectively conducted on several widely used datasets. The empirical studies have evidenced the effectiveness and superiority of our method.
Tasks
Published	2018-12-04
URL	http://arxiv.org/abs/1812.01186v4
PDF	http://arxiv.org/pdf/1812.01186v4.pdf
PWC	https://paperswithcode.com/paper/frame-revisited-an-interpretation-view-based
Repo
Framework

A Simple Way to Deal with Cherry-picking


Title	A Simple Way to Deal with Cherry-picking
Authors	Junpei Komiyama, Takanori Maehara
Abstract	Statistical hypothesis testing serves as statistical evidence for scientific innovation. However, if the reported results are intentionally biased, hypothesis testing no longer controls the rate of false discovery. In particular, we study such selection bias in machine learning models where the reporter is motivated to promote an algorithmic innovation. When the number of possible configurations (e.g., datasets) is large, we show that the reporter can falsely report an innovation even if there is no improvement at all. We propose a `post-reporting’ solution to this issue where the bias of the reported results is verified by another set of results. The theoretical findings are supported by experimental results with synthetic and real-world datasets. \|
Tasks
Published	2018-10-11
URL	http://arxiv.org/abs/1810.04996v1
PDF	http://arxiv.org/pdf/1810.04996v1.pdf
PWC	https://paperswithcode.com/paper/a-simple-way-to-deal-with-cherry-picking
Repo
Framework

Learning monocular visual odometry with dense 3D mapping from dense 3D flow


Title	Learning monocular visual odometry with dense 3D mapping from dense 3D flow
Authors	Cheng Zhao, Li Sun, Pulak Purkait, Tom Duckett, Rustam Stolkin
Abstract	This paper introduces a fully deep learning approach to monocular SLAM, which can perform simultaneous localization using a neural network for learning visual odometry (L-VO) and dense 3D mapping. Dense 2D flow and a depth image are generated from monocular images by sub-networks, which are then used by a 3D flow associated layer in the L-VO network to generate dense 3D flow. Given this 3D flow, the dual-stream L-VO network can then predict the 6DOF relative pose and furthermore reconstruct the vehicle trajectory. In order to learn the correlation between motion directions, the Bivariate Gaussian modelling is employed in the loss function. The L-VO network achieves an overall performance of 2.68% for average translational error and 0.0143 deg/m for average rotational error on the KITTI odometry benchmark. Moreover, the learned depth is fully leveraged to generate a dense 3D map. As a result, an entire visual SLAM system, that is, learning monocular odometry combined with dense 3D mapping, is achieved.
Tasks	Monocular Visual Odometry, Visual Odometry
Published	2018-03-06
URL	http://arxiv.org/abs/1803.02286v2
PDF	http://arxiv.org/pdf/1803.02286v2.pdf
PWC	https://paperswithcode.com/paper/learning-monocular-visual-odometry-with-dense
Repo
Framework

Your 2 is My 1, Your 3 is My 9: Handling Arbitrary Miscalibrations in Ratings


Title	Your 2 is My 1, Your 3 is My 9: Handling Arbitrary Miscalibrations in Ratings
Authors	Jingyan Wang, Nihar B. Shah
Abstract	Cardinal scores (numeric ratings) collected from people are well known to suffer from miscalibrations. A popular approach to address this issue is to assume simplistic models of miscalibration (such as linear biases) to de-bias the scores. This approach, however, often fares poorly because people’s miscalibrations are typically far more complex and not well understood. In the absence of simplifying assumptions on the miscalibration, it is widely believed by the crowdsourcing community that the only useful information in the cardinal scores is the induced ranking. In this paper, inspired by the framework of Stein’s shrinkage, empirical Bayes, and the classic two-envelope problem, we contest this widespread belief. Specifically, we consider cardinal scores with arbitrary (or even adversarially chosen) miscalibrations which are only required to be consistent with the induced ranking. We design estimators which despite making no assumptions on the miscalibration, strictly and uniformly outperform all possible estimators that rely on only the ranking. Our estimators are flexible in that they can be used as a plug-in for a variety of applications, and we provide a proof-of-concept for A/B testing and ranking. Our results thus provide novel insights in the eternal debate between cardinal and ordinal data.
Tasks
Published	2018-06-13
URL	http://arxiv.org/abs/1806.05085v2
PDF	http://arxiv.org/pdf/1806.05085v2.pdf
PWC	https://paperswithcode.com/paper/your-2-is-my-1-your-3-is-my-9-handling
Repo
Framework

TED: Teaching AI to Explain its Decisions


Title	TED: Teaching AI to Explain its Decisions
Authors	Michael Hind, Dennis Wei, Murray Campbell, Noel C. F. Codella, Amit Dhurandhar, Aleksandra Mojsilović, Karthikeyan Natesan Ramamurthy, Kush R. Varshney
Abstract	Artificial intelligence systems are being increasingly deployed due to their potential to increase the efficiency, scale, consistency, fairness, and accuracy of decisions. However, as many of these systems are opaque in their operation, there is a growing demand for such systems to provide explanations for their decisions. Conventional approaches to this problem attempt to expose or discover the inner workings of a machine learning model with the hope that the resulting explanations will be meaningful to the consumer. In contrast, this paper suggests a new approach to this problem. It introduces a simple, practical framework, called Teaching Explanations for Decisions (TED), that provides meaningful explanations that match the mental model of the consumer. We illustrate the generality and effectiveness of this approach with two different examples, resulting in highly accurate explanations with no loss of prediction accuracy for these two examples.
Tasks
Published	2018-11-12
URL	https://arxiv.org/abs/1811.04896v2
PDF	https://arxiv.org/pdf/1811.04896v2.pdf
PWC	https://paperswithcode.com/paper/ted-teaching-ai-to-explain-its-decisions
Repo
Framework

Back-Translation Sampling by Targeting Difficult Words in Neural Machine Translation


Title	Back-Translation Sampling by Targeting Difficult Words in Neural Machine Translation
Authors	Marzieh Fadaee, Christof Monz
Abstract	Neural Machine Translation has achieved state-of-the-art performance for several language pairs using a combination of parallel and synthetic data. Synthetic data is often generated by back-translating sentences randomly sampled from monolingual data using a reverse translation model. While back-translation has been shown to be very effective in many cases, it is not entirely clear why. In this work, we explore different aspects of back-translation, and show that words with high prediction loss during training benefit most from the addition of synthetic data. We introduce several variations of sampling strategies targeting difficult-to-predict words using prediction losses and frequencies of words. In addition, we also target the contexts of difficult words and sample sentences that are similar in context. Experimental results for the WMT news translation task show that our method improves translation quality by up to 1.7 and 1.2 Bleu points over back-translation using random sampling for German-English and English-German, respectively.
Tasks	Machine Translation
Published	2018-08-27
URL	http://arxiv.org/abs/1808.09006v2
PDF	http://arxiv.org/pdf/1808.09006v2.pdf
PWC	https://paperswithcode.com/paper/back-translation-sampling-by-targeting
Repo
Framework