January 25, 2020

2720 words 13 mins read

Paper Group ANR 1631

Chainer: A Deep Learning Framework for Accelerating the Research Cycle. Multi-sense Definition Modeling using Word Sense Decompositions. Matching Thermal to Visible Face Images Using a Semantic-Guided Generative Adversarial Network. Black-Box Decision based Adversarial Attack with Symmetric $α$-stable Distribution. Domain-Specific Priors and Meta L …

Chainer: A Deep Learning Framework for Accelerating the Research Cycle


Title	Chainer: A Deep Learning Framework for Accelerating the Research Cycle
Authors	Seiya Tokui, Ryosuke Okuta, Takuya Akiba, Yusuke Niitani, Toru Ogawa, Shunta Saito, Shuji Suzuki, Kota Uenishi, Brian Vogel, Hiroyuki Yamazaki Vincent
Abstract	Software frameworks for neural networks play a key role in the development and application of deep learning methods. In this paper, we introduce the Chainer framework, which intends to provide a flexible, intuitive, and high performance means of implementing the full range of deep learning models needed by researchers and practitioners. Chainer provides acceleration using Graphics Processing Units with a familiar NumPy-like API through CuPy, supports general and dynamic models in Python through Define-by-Run, and also provides add-on packages for state-of-the-art computer vision models as well as distributed training.
Tasks
Published	2019-08-01
URL	https://arxiv.org/abs/1908.00213v1
PDF	https://arxiv.org/pdf/1908.00213v1.pdf
PWC	https://paperswithcode.com/paper/chainer-a-deep-learning-framework-for
Repo
Framework

Multi-sense Definition Modeling using Word Sense Decompositions


Title	Multi-sense Definition Modeling using Word Sense Decompositions
Authors	Ruimin Zhu, Thanapon Noraset, Alisa Liu, Wenxin Jiang, Doug Downey
Abstract	Word embeddings capture syntactic and semantic information about words. Definition modeling aims to make the semantic content in each embedding explicit, by outputting a natural language definition based on the embedding. However, existing definition models are limited in their ability to generate accurate definitions for different senses of the same word. In this paper, we introduce a new method that enables definition modeling for multiple senses. We show how a Gumble-Softmax approach outperforms baselines at matching sense-specific embeddings to definitions during training. In experiments, our multi-sense definition model improves recall over a state-of-the-art single-sense definition model by a factor of three, without harming precision.
Tasks	Word Embeddings
Published	2019-09-19
URL	https://arxiv.org/abs/1909.09483v1
PDF	https://arxiv.org/pdf/1909.09483v1.pdf
PWC	https://paperswithcode.com/paper/multi-sense-definition-modeling-using-word
Repo
Framework

Matching Thermal to Visible Face Images Using a Semantic-Guided Generative Adversarial Network


Title	Matching Thermal to Visible Face Images Using a Semantic-Guided Generative Adversarial Network
Authors	Cunjian Chen, Arun Ross
Abstract	Designing face recognition systems that are capable of matching face images obtained in the thermal spectrum with those obtained in the visible spectrum is a challenging problem. In this work, we propose the use of semantic-guided generative adversarial network (SG-GAN) to automatically synthesize visible face images from their thermal counterparts. Specifically, semantic labels, extracted by a face parsing network, are used to compute a semantic loss function to regularize the adversarial network during training. These semantic cues denote high-level facial component information associated with each pixel. Further, an identity extraction network is leveraged to generate multi-scale features to compute an identity loss function. To achieve photo-realistic results, a perceptual loss function is introduced during network training to ensure that the synthesized visible face is perceptually similar to the target visible face image. We extensively evaluate the benefits of individual loss functions, and combine them effectively to learn the mapping from thermal to visible face images. Experiments involving two multispectral face datasets show that the proposed method achieves promising results in both face synthesis and cross-spectral face matching.
Tasks	Face Generation, Face Recognition
Published	2019-03-03
URL	http://arxiv.org/abs/1903.00963v1
PDF	http://arxiv.org/pdf/1903.00963v1.pdf
PWC	https://paperswithcode.com/paper/matching-thermal-to-visible-face-images-using
Repo
Framework

Black-Box Decision based Adversarial Attack with Symmetric $α$-stable Distribution


Title	Black-Box Decision based Adversarial Attack with Symmetric $α$-stable Distribution
Authors	Vignesh Srinivasan, Ercan E. Kuruoglu, Klaus-Robert Müller, Wojciech Samek, Shinichi Nakajima
Abstract	Developing techniques for adversarial attack and defense is an important research field for establishing reliable machine learning and its applications. Many existing methods employ Gaussian random variables for exploring the data space to find the most adversarial (for attacking) or least adversarial (for defense) point. However, the Gaussian distribution is not necessarily the optimal choice when the exploration is required to follow the complicated structure that most real-world data distributions exhibit. In this paper, we investigate how statistics of random variables affect such random walk exploration. Specifically, we generalize the Boundary Attack, a state-of-the-art black-box decision based attacking strategy, and propose the L'evy-Attack, where the random walk is driven by symmetric $\alpha$-stable random variables. Our experiments on MNIST and CIFAR10 datasets show that the L'evy-Attack explores the image data space more efficiently, and significantly improves the performance. Our results also give an insight into the recently found fact in the whitebox attacking scenario that the choice of the norm for measuring the amplitude of the adversarial patterns is essential.
Tasks	Adversarial Attack
Published	2019-04-11
URL	http://arxiv.org/abs/1904.05586v1
PDF	http://arxiv.org/pdf/1904.05586v1.pdf
PWC	https://paperswithcode.com/paper/black-box-decision-based-adversarial-attack
Repo
Framework

Domain-Specific Priors and Meta Learning for Low-shot First-Person Action Recognition


Title	Domain-Specific Priors and Meta Learning for Low-shot First-Person Action Recognition
Authors	Huseyin Coskun, Zeeshan Zia, Bugra Tekin, Federica Bogo, Nassir Navab, Federico Tombari, Harpreet Sawhney
Abstract	The lack of large-scale real datasets with annotationsmakes transfer learning a necessity for video activity under-standing. Within this scope, we aim at developing an effec-tive method for low-shot transfer learning for first-personaction classification. We leverage independently trained lo-cal visual cues to learn representations that can be trans-ferred from a source domain providing primitive action la-bels to a target domain with only a handful of examples.Such visual cues include object-object interactions, handgrasps and motion within regions that are a function of handlocations. We suggest a framework based on meta-learningto appropriately extract the distinctive and domain invari-ant components of the deployed visual cues, so to be able totransfer action classification models across public datasetscaptured with different scene configurations. We thoroughlyevaluate our methodology and report promising results overstate-of-the-art action classification approaches for bothinter-class and inter-dataset transfer.
Tasks	Action Classification, Meta-Learning, Transfer Learning
Published	2019-07-22
URL	https://arxiv.org/abs/1907.09382v1
PDF	https://arxiv.org/pdf/1907.09382v1.pdf
PWC	https://paperswithcode.com/paper/domain-specific-priors-and-meta-learning-for
Repo
Framework

Selection-based Question Answering of an MOOC


Title	Selection-based Question Answering of an MOOC
Authors	Atul Sahay, Smita Gholkar, Kavi Arya
Abstract	e-Yantra Robotics Competition (eYRC) is a unique Robotics Competition hosted by IIT Bombay that is actually an Embedded Systems and Robotics MOOC. Registrations have been growing exponentially in each year from 4500 in 2012 to over 34000 in 2019. In this 5-month long competition students learn complex skills under severe time pressure and have access to a discussion forum to post doubts about the learning material. Responding to questions in real-time is a challenge for project staff. Here, we illustrate the advantage of Deep Learning for real-time question answering in the eYRC discussion forum. We illustrate the advantage of Transformer based contextual embedding mechanisms such as Bidirectional Encoder Representation From Transformer (BERT) over word embedding mechanisms such as Word2Vec. We propose a weighted similarity metric as a measure of matching and find it more reliable than Content-Content or Title-Title similarities alone. The automation of replying to questions has brought the turn around response time(TART) down from a minimum of 21 mins to a minimum of 0.3 secs.
Tasks	Question Answering
Published	2019-11-15
URL	https://arxiv.org/abs/1911.07629v1
PDF	https://arxiv.org/pdf/1911.07629v1.pdf
PWC	https://paperswithcode.com/paper/selection-based-question-answering-of-an-mooc
Repo
Framework

Augmented Memory Networks for Streaming-Based Active One-Shot Learning


Title	Augmented Memory Networks for Streaming-Based Active One-Shot Learning
Authors	Andreas Kvistad, Massimiliano Ruocco, Eliezer de Souza da Silva, Erlend Aune
Abstract	One of the major challenges in training deep architectures for predictive tasks is the scarcity and cost of labeled training data. Active Learning (AL) is one way of addressing this challenge. In stream-based AL, observations are continuously made available to the learner that have to decide whether to request a label or to make a prediction. The goal is to reduce the request rate while at the same time maximize prediction performance. In previous research, reinforcement learning has been used for learning the AL request/prediction strategy. In our work, we propose to equip a reinforcement learning process with memory augmented neural networks, to enhance the one-shot capabilities. Moreover, we introduce Class Margin Sampling (CMS) as an extension of the standard margin sampling to the reinforcement learning setting. This strategy aims to reduce training time and improve sample efficiency in the training process. We evaluate the proposed method on a classification task using empirical accuracy of label predictions and percentage of label requests. The results indicates that the proposed method, by making use of the memory augmented networks and CMS in the training process, outperforms existing baselines.
Tasks	Active Learning, One-Shot Learning
Published	2019-09-04
URL	https://arxiv.org/abs/1909.01757v1
PDF	https://arxiv.org/pdf/1909.01757v1.pdf
PWC	https://paperswithcode.com/paper/augmented-memory-networks-for-streaming-based
Repo
Framework

General Fragment Model for Information Artifacts


Title	General Fragment Model for Information Artifacts
Authors	Sandro Rama Fiorini, Wallas Sousa dos Santos, Rodrigo Costa Mesquita, Guilherme Ferreira Lima, Marcio F. Moreno
Abstract	The use of semantic descriptions in data intensive domains require a systematic model for linking semantic descriptions with their manifestations in fragments of heterogeneous information and data objects. Such information heterogeneity requires a fragment model that is general enough to support the specification of anchors from conceptual models to multiple types of information artifacts. While diverse proposals of anchoring models exist in the literature, they are usually focused in audiovisual information. We propose a generalized fragment model that can be instantiated to different kinds of information artifacts. Our objective is to systematize the way in which fragments and anchors can be described in conceptual models, without committing to a specific vocabulary.
Tasks
Published	2019-09-09
URL	https://arxiv.org/abs/1909.04117v1
PDF	https://arxiv.org/pdf/1909.04117v1.pdf
PWC	https://paperswithcode.com/paper/general-fragment-model-for-information
Repo
Framework

SPM-Tracker: Series-Parallel Matching for Real-Time Visual Object Tracking


Title	SPM-Tracker: Series-Parallel Matching for Real-Time Visual Object Tracking
Authors	Guangting Wang, Chong Luo, Zhiwei Xiong, Wenjun Zeng
Abstract	The greatest challenge facing visual object tracking is the simultaneous requirements on robustness and discrimination power. In this paper, we propose a SiamFC-based tracker, named SPM-Tracker, to tackle this challenge. The basic idea is to address the two requirements in two separate matching stages. Robustness is strengthened in the coarse matching (CM) stage through generalized training while discrimination power is enhanced in the fine matching (FM) stage through a distance learning network. The two stages are connected in series as the input proposals of the FM stage are generated by the CM stage. They are also connected in parallel as the matching scores and box location refinements are fused to generate the final results. This innovative series-parallel structure takes advantage of both stages and results in superior performance. The proposed SPM-Tracker, running at 120fps on GPU, achieves an AUC of 0.687 on OTB-100 and an EAO of 0.434 on VOT-16, exceeding other real-time trackers by a notable margin.
Tasks	Object Tracking, Visual Object Tracking
Published	2019-04-09
URL	http://arxiv.org/abs/1904.04452v1
PDF	http://arxiv.org/pdf/1904.04452v1.pdf
PWC	https://paperswithcode.com/paper/spm-tracker-series-parallel-matching-for-real
Repo
Framework

Unsupervised Ensemble Classification with Dependent Data


Title	Unsupervised Ensemble Classification with Dependent Data
Authors	Panagiotis A. Traganitis, Georgios B. Giannakis
Abstract	Ensemble learning, the machine learning paradigm where multiple algorithms are combined, has exhibited promising perfomance in a variety of tasks. The present work focuses on unsupervised ensemble classification. The term unsupervised refers to the ensemble combiner who has no knowledge of the ground-truth labels that each classifier has been trained on. While most prior works on unsupervised ensemble classification are designed for independent and identically distributed (i.i.d.) data, the present work introduces an unsupervised scheme for learning from ensembles of classifiers in the presence of data dependencies. Two types of data dependencies are considered: sequential data and networked data whose dependencies are captured by a graph. Moment matching and Expectation Maximization algorithms are developed for the aforementioned cases, and their performance is evaluated on synthetic and real datasets.
Tasks
Published	2019-06-22
URL	https://arxiv.org/abs/1906.09356v1
PDF	https://arxiv.org/pdf/1906.09356v1.pdf
PWC	https://paperswithcode.com/paper/unsupervised-ensemble-classification-with
Repo
Framework

Margin-Based Generalization Lower Bounds for Boosted Classifiers


Title	Margin-Based Generalization Lower Bounds for Boosted Classifiers
Authors	Allan Grønlund, Lior Kamma, Kasper Green Larsen, Alexander Mathiasen, Jelani Nelson
Abstract	Boosting is one of the most successful ideas in machine learning. The most well-accepted explanations for the low generalization error of boosting algorithms such as AdaBoost stem from margin theory. The study of margins in the context of boosting algorithms was initiated by Schapire, Freund, Bartlett and Lee (1998) and has inspired numerous boosting algorithms and generalization bounds. To date, the strongest known generalization (upper bound) is the $k$th margin bound of Gao and Zhou (2013). Despite the numerous generalization upper bounds that have been proved over the last two decades, nothing is known about the tightness of these bounds. In this paper, we give the first margin-based lower bounds on the generalization error of boosted classifiers. Our lower bounds nearly match the $k$th margin bound and thus almost settle the generalization performance of boosted classifiers in terms of margins.
Tasks
Published	2019-09-27
URL	https://arxiv.org/abs/1909.12518v2
PDF	https://arxiv.org/pdf/1909.12518v2.pdf
PWC	https://paperswithcode.com/paper/margin-based-generalization-lower-bounds-for
Repo
Framework

Detection of the Group of Traffic Signs with Central Slice Theorem


Title	Detection of the Group of Traffic Signs with Central Slice Theorem
Authors	Koba Natroshvili
Abstract	Our sensor system consists of a combination of Photonic Mixer Device - PMD and Mono optical cameras. Some traffic signs have stripes at 45{deg}. These traffic signs cancel different restrictions on the road. We detect this class of signs with Radon transformation. Here the Radon transformation is calculated using Central Slice Theorem. We approximate the slice of spectrum by the Discrete Cosine Transformation (DCT).
Tasks
Published	2019-08-04
URL	https://arxiv.org/abs/1908.04386v1
PDF	https://arxiv.org/pdf/1908.04386v1.pdf
PWC	https://paperswithcode.com/paper/detection-of-the-group-of-traffic-signs-with
Repo
Framework

On the Robustness of Data-Driven Controllers for Linear Systems


Title	On the Robustness of Data-Driven Controllers for Linear Systems
Authors	Rajasekhar Anguluri, Abed AlRahman Al Makdah, Vaibhav Katewa, Fabio Pasqualetti
Abstract	This paper proposes a new framework and several results to quantify the performance of data-driven state-feedback controllers for linear systems against targeted perturbations of the training data. We focus on the case where subsets of the training data are randomly corrupted by an adversary, and derive lower and upper bounds for the stability of the closed-loop system with compromised controller as a function of the perturbation statistics, size of the training data, sensitivity of the data-driven algorithm to perturbation of the training data, and properties of the nominal closed-loop system. Our stability and convergence bounds are probabilistic in nature, and rely on a first-order approximation of the data-driven procedure that designs the state-feedback controller, which can be computed directly using the training data. We illustrate our findings via multiple numerical studies.
Tasks
Published	2019-12-21
URL	https://arxiv.org/abs/1912.10231v1
PDF	https://arxiv.org/pdf/1912.10231v1.pdf
PWC	https://paperswithcode.com/paper/on-the-robustness-of-data-driven-controllers
Repo
Framework

A Concept-Value Network as a Brain Model


Title	A Concept-Value Network as a Brain Model
Authors	Kieran Greer
Abstract	This paper suggests a statistical framework for describing the relations between the physical and conceptual entities of a brain-like model. In particular, features and concept instances are put into context. This may help with understanding or implementing a similar model. The paper suggests that features are in fact the wiring. With this idea, the actual length of the connection is important, because it is related to firing rates and neuron synchronization. The paper then suggests that concepts are neuron groups that link features and concept instances are the signals from those groups. Therefore, features become the static framework of the interconnected neural system and concepts are combinations of these, as determined by the external stimulus and the neural synaptic strengths. Along with this statistical model, it is possible to propose a simplified design for a neuron, based on an action potential and variable output signal. A strong comparison with Hebbian theory is then proposed, with some test results to support the theory.
Tasks
Published	2019-04-09
URL	https://arxiv.org/abs/1904.04579v2
PDF	https://arxiv.org/pdf/1904.04579v2.pdf
PWC	https://paperswithcode.com/paper/a-feature-value-network-as-a-brain-model
Repo
Framework

Quant GANs: Deep Generation of Financial Time Series


Title	Quant GANs: Deep Generation of Financial Time Series
Authors	Magnus Wiese, Robert Knobloch, Ralf Korn, Peter Kretschmer
Abstract	Modeling financial time series by stochastic processes is a challenging task and a central area of research in financial mathematics. As an alternative, we introduce Quant GANs, a data-driven model which is inspired by the recent success of generative adversarial networks (GANs). Quant GANs consist of a generator and discriminator function, which utilize temporal convolutional networks (TCNs) and thereby achieve to capture long-range dependencies such as the presence of volatility clusters. The generator function is explicitly constructed such that the induced stochastic process allows a transition to its risk-neutral distribution. Our numerical results highlight that distributional properties for small and large lags are in an excellent agreement and dependence properties such as volatility clusters, leverage effects, and serial autocorrelations can be generated by the generator function of Quant GANs, demonstrably in high fidelity.
Tasks	Time Series
Published	2019-07-15
URL	https://arxiv.org/abs/1907.06673v2
PDF	https://arxiv.org/pdf/1907.06673v2.pdf
PWC	https://paperswithcode.com/paper/quant-gans-deep-generation-of-financial-time
Repo
Framework