January 25, 2020

2848 words 14 mins read

Paper Group ANR 1687

Paper Group ANR 1687

A Framework for Building Closed-Domain Chat Dialogue Systems. A Bulirsch-Stoer algorithm using Gaussian processes. Object detection on aerial imagery using CenterNet. Robust Synthesis of Adversarial Visual Examples Using a Deep Image Prior. Facial Micro-Expression Spotting and Recognition using Time Contrasted Feature with Visual Memory. Interval t …

A Framework for Building Closed-Domain Chat Dialogue Systems

Title A Framework for Building Closed-Domain Chat Dialogue Systems
Authors Mikio Nakano, Kazunori Komatani
Abstract This paper presents HRIChat, a framework for developing closed-domain chat dialogue systems. Being able to engage in chat dialogues has been found effective for improving communication between humans and dialogue systems. This paper focuses on closed-domain systems because they would be useful when combined with task-oriented dialogue systems in the same domain. HRIChat enables domain-dependent language understanding so that it can deal well with domain-specific utterances. In addition, HRIChat makes it possible to integrate state transition network-based dialogue management and reaction-based dialogue management. FoodChatbot, which is an application in the food and restaurant domain, has been developed and evaluated through a user study. Its results suggest that reasonably good systems can be developed with HRIChat. This paper also reports lessons learned from the development and evaluation of FoodChatbot.
Tasks Dialogue Management, Task-Oriented Dialogue Systems
Published 2019-10-30
URL https://arxiv.org/abs/1910.13826v2
PDF https://arxiv.org/pdf/1910.13826v2.pdf
PWC https://paperswithcode.com/paper/a-framework-for-building-closed-domain-chat
Repo
Framework

A Bulirsch-Stoer algorithm using Gaussian processes

Title A Bulirsch-Stoer algorithm using Gaussian processes
Authors Philip G. Breen, Christopher N. Foley
Abstract In this paper, we treat the problem of evaluating the asymptotic error in a numerical integration scheme as one with inherent uncertainty. Adding to the growing field of probabilistic numerics, we show that Gaussian process regression (GPR) can be embedded into a numerical integration scheme to allow for (i) robust selection of the adaptive step-size parameter and; (ii) uncertainty quantification in predictions of putatively converged numerical solutions. We present two examples of our approach using Richardson’s extrapolation technique and the Bulirsch-Stoer algorithm. In scenarios where the error-surface is smooth and bounded, our proposed approach can match the results of the traditional polynomial (parametric) extrapolation methods. In scenarios where the error surface is not well approximated by a finite-order polynomial, e.g. in the vicinity of a pole or in the assessment of a chaotic system, traditional methods can fail, however, the non-parametric GPR approach demonstrates the potential to continue to furnish reasonable solutions in these situations.
Tasks Gaussian Processes
Published 2019-05-23
URL https://arxiv.org/abs/1905.09892v1
PDF https://arxiv.org/pdf/1905.09892v1.pdf
PWC https://paperswithcode.com/paper/a-bulirsch-stoer-algorithm-using-gaussian
Repo
Framework

Object detection on aerial imagery using CenterNet

Title Object detection on aerial imagery using CenterNet
Authors Dheeraj Reddy Pailla, Varghese Kollerathu, Sai Saketh Chennamsetty
Abstract Detection and classification of objects in aerial imagery have several applications like urban planning, crop surveillance, and traffic surveillance. However, due to the lower resolution of the objects and the effect of noise in aerial images, extracting distinguishing features for the objects is a challenge. We evaluate CenterNet, a state of the art method for real-time 2D object detection, on the VisDrone2019 dataset. We evaluate the performance of the model with different backbone networks in conjunction with varying resolutions during training and testing.
Tasks Object Detection
Published 2019-08-22
URL https://arxiv.org/abs/1908.08244v1
PDF https://arxiv.org/pdf/1908.08244v1.pdf
PWC https://paperswithcode.com/paper/object-detection-on-aerial-imagery-using
Repo
Framework

Robust Synthesis of Adversarial Visual Examples Using a Deep Image Prior

Title Robust Synthesis of Adversarial Visual Examples Using a Deep Image Prior
Authors Thomas Gittings, Steve Schneider, John Collomosse
Abstract We present a novel method for generating robust adversarial image examples building upon the recent deep image prior' (DIP) that exploits convolutional network architectures to enforce plausible texture in image synthesis. Adversarial images are commonly generated by perturbing images to introduce high frequency noise that induces image misclassification, but that is fragile to subsequent digital manipulation of the image. We show that using DIP to reconstruct an image under adversarial constraint induces perturbations that are more robust to affine deformation, whilst remaining visually imperceptible. Furthermore we show that our DIP approach can also be adapted to produce local adversarial patches (adversarial stickers’). We demonstrate robust adversarial examples over a broad gamut of images and object classes drawn from the ImageNet dataset.
Tasks Image Generation
Published 2019-07-03
URL https://arxiv.org/abs/1907.01996v1
PDF https://arxiv.org/pdf/1907.01996v1.pdf
PWC https://paperswithcode.com/paper/robust-synthesis-of-adversarial-visual
Repo
Framework

Facial Micro-Expression Spotting and Recognition using Time Contrasted Feature with Visual Memory

Title Facial Micro-Expression Spotting and Recognition using Time Contrasted Feature with Visual Memory
Authors Sauradip Nag, Ayan Kumar Bhunia, Aishik Konwer, Partha Pratim Roy
Abstract Facial micro-expressions are sudden involuntary minute muscle movements which reveal true emotions that people try to conceal. Spotting a micro-expression and recognizing it is a major challenge owing to its short duration and intensity. Many works pursued traditional and deep learning based approaches to solve this issue but compromised on learning low-level features and higher accuracy due to unavailability of datasets. This motivated us to propose a novel joint architecture of spatial and temporal network which extracts time-contrasted features from the feature maps to contrast out micro-expression from rapid muscle movements. The usage of time contrasted features greatly improved the spotting of micro-expression from inconspicuous facial movements. Also, we include a memory module to predict the class and intensity of the micro-expression across the temporal frames of the micro-expression clip. Our method achieves superior performance in comparison to other conventional approaches on CASMEII dataset.
Tasks
Published 2019-02-09
URL http://arxiv.org/abs/1902.03514v2
PDF http://arxiv.org/pdf/1902.03514v2.pdf
PWC https://paperswithcode.com/paper/facial-micro-expression-spotting-and
Repo
Framework

Interval timing in deep reinforcement learning agents

Title Interval timing in deep reinforcement learning agents
Authors Ben Deverett, Ryan Faulkner, Meire Fortunato, Greg Wayne, Joel Z. Leibo
Abstract The measurement of time is central to intelligent behavior. We know that both animals and artificial agents can successfully use temporal dependencies to select actions. In artificial agents, little work has directly addressed (1) which architectural components are necessary for successful development of this ability, (2) how this timing ability comes to be represented in the units and actions of the agent, and (3) whether the resulting behavior of the system converges on solutions similar to those of biology. Here we studied interval timing abilities in deep reinforcement learning agents trained end-to-end on an interval reproduction paradigm inspired by experimental literature on mechanisms of timing. We characterize the strategies developed by recurrent and feedforward agents, which both succeed at temporal reproduction using distinct mechanisms, some of which bear specific and intriguing similarities to biological systems. These findings advance our understanding of how agents come to represent time, and they highlight the value of experimentally inspired approaches to characterizing agent abilities.
Tasks
Published 2019-05-31
URL https://arxiv.org/abs/1905.13469v2
PDF https://arxiv.org/pdf/1905.13469v2.pdf
PWC https://paperswithcode.com/paper/interval-timing-in-deep-reinforcement
Repo
Framework

Graph Convolutional Gaussian Processes

Title Graph Convolutional Gaussian Processes
Authors Ian Walker, Ben Glocker
Abstract We propose a novel Bayesian nonparametric method to learn translation-invariant relationships on non-Euclidean domains. The resulting graph convolutional Gaussian processes can be applied to problems in machine learning for which the input observations are functions with domains on general graphs. The structure of these models allows for high dimensional inputs while retaining expressibility, as is the case with convolutional neural networks. We present applications of graph convolutional Gaussian processes to images and triangular meshes, demonstrating their versatility and effectiveness, comparing favorably to existing methods, despite being relatively simple models.
Tasks Gaussian Processes
Published 2019-05-14
URL https://arxiv.org/abs/1905.05739v1
PDF https://arxiv.org/pdf/1905.05739v1.pdf
PWC https://paperswithcode.com/paper/graph-convolutional-gaussian-processes
Repo
Framework

Suppressing Model Overfitting for Image Super-Resolution Networks

Title Suppressing Model Overfitting for Image Super-Resolution Networks
Authors Ruicheng Feng, Jinjin Gu, Yu Qiao, Chao Dong
Abstract Large deep networks have demonstrated competitive performance in single image super-resolution (SISR), with a huge volume of data involved. However, in real-world scenarios, due to the limited accessible training pairs, large models exhibit undesirable behaviors such as overfitting and memorization. To suppress model overfitting and further enjoy the merits of large model capacity, we thoroughly investigate generic approaches for supplying additional training data pairs. In particular, we introduce a simple learning principle MixUp to train networks on interpolations of sample pairs, which encourages networks to support linear behavior in-between training samples. In addition, we propose a data synthesis method with learned degradation, enabling models to use extra high-quality images with higher content diversity. This strategy proves to be successful in reducing biases of data. By combining these components – MixUp and synthetic training data, large models can be trained without overfitting under very limited data samples and achieve satisfactory generalization performance. Our method won the second place in NTIRE2019 Real SR Challenge.
Tasks Image Super-Resolution, Super-Resolution
Published 2019-06-11
URL https://arxiv.org/abs/1906.04809v1
PDF https://arxiv.org/pdf/1906.04809v1.pdf
PWC https://paperswithcode.com/paper/190604809
Repo
Framework
Title Fine-Grained Neural Architecture Search
Authors Heewon Kim, Seokil Hong, Bohyung Han, Heesoo Myeong, Kyoung Mu Lee
Abstract We present an elegant framework of fine-grained neural architecture search (FGNAS), which allows to employ multiple heterogeneous operations within a single layer and can even generate compositional feature maps using several different base operations. FGNAS runs efficiently in spite of significantly large search space compared to other methods because it trains networks end-to-end by a stochastic gradient descent method. Moreover, the proposed framework allows to optimize the network under predefined resource constraints in terms of number of parameters, FLOPs and latency. FGNAS has been applied to two crucial applications in resource demanding computer vision tasks—large-scale image classification and image super-resolution—and demonstrates the state-of-the-art performance through flexible operation search and channel pruning.
Tasks Image Classification, Image Super-Resolution, Neural Architecture Search, Super-Resolution
Published 2019-11-18
URL https://arxiv.org/abs/1911.07478v1
PDF https://arxiv.org/pdf/1911.07478v1.pdf
PWC https://paperswithcode.com/paper/fine-grained-neural-architecture-search
Repo
Framework
Title A Comprehensive Analysis of Twitter Trending Topics
Authors Issa Annamoradnejad, Jafar Habibi
Abstract Twitter is among the most used microblogging and online social networking services. In Twitter, a name, phrase, or topic that is mentioned at a greater rate than others is called a “trending topic” or simply “trend”. Twitter trends has shown their powerful ability in many public events, elections and market changes. Nevertheless, there has been very few works focusing on understanding the dynamics of these trending topics. In this article, we thoroughly examined the Twitter’s trending topics of 2018. To this end, we accessed Twitter’s trends API for the full year of 2018 and devised six criteria to analyze our dataset. These six criteria are: lexical analysis, time to reach, trend reoccurrence, trending time, tweets count, and language analysis. In addition to providing general statistics and top trending topics regarding each criterion, we computed several distributions that explain this bulk of data.
Tasks Lexical Analysis
Published 2019-07-21
URL https://arxiv.org/abs/1907.09007v1
PDF https://arxiv.org/pdf/1907.09007v1.pdf
PWC https://paperswithcode.com/paper/a-comprehensive-analysis-of-twitter-trending
Repo
Framework

Learning Causality: Synthesis of Large-Scale Causal Networks from High-Dimensional Time Series Data

Title Learning Causality: Synthesis of Large-Scale Causal Networks from High-Dimensional Time Series Data
Authors Mark-Oliver Stehr, Peter Avar, Andrew R. Korte, Lida Parvin, Ziad J. Sahab, Deborah I. Bunin, Merrill Knapp, Denise Nishita, Andrew Poggio, Carolyn L. Talcott, Brian M. Davis, Christine A. Morton, Christopher J. Sevinsky, Maria I. Zavodszky, Akos Vertes
Abstract There is an abundance of complex dynamic systems that are critical to our daily lives and our society but that are hardly understood, and even with today’s possibilities to sense and collect large amounts of experimental data, they are so complex and continuously evolving that it is unlikely that their dynamics will ever be understood in full detail. Nevertheless, through computational tools we can try to make the best possible use of the current technologies and available data. We believe that the most useful models will have to take into account the imbalance between system complexity and available data in the context of limited knowledge or multiple hypotheses. The complex system of biological cells is a prime example of such a system that is studied in systems biology and has motivated the methods presented in this paper. They were developed as part of the DARPA Rapid Threat Assessment (RTA) program, which is concerned with understanding of the mechanism of action (MoA) of toxins or drugs affecting human cells. Using a combination of Gaussian processes and abstract network modeling, we present three fundamentally different machine-learning-based approaches to learn causal relations and synthesize causal networks from high-dimensional time series data. While other types of data are available and have been analyzed and integrated in our RTA work, we focus on transcriptomics (that is gene expression) data obtained from high-throughput microarray experiments in this paper to illustrate capabilities and limitations of our algorithms. Our algorithms make different but overall relatively few biological assumptions, so that they are applicable to other types of biological data and potentially even to other complex systems that exhibit high dimensionality but are not of biological nature.
Tasks Gaussian Processes, Time Series
Published 2019-05-06
URL https://arxiv.org/abs/1905.02291v1
PDF https://arxiv.org/pdf/1905.02291v1.pdf
PWC https://paperswithcode.com/paper/learning-causality-synthesis-of-large-scale
Repo
Framework

Weakly Supervised Open-set Domain Adaptation by Dual-domain Collaboration

Title Weakly Supervised Open-set Domain Adaptation by Dual-domain Collaboration
Authors Shuhan Tan, Jiening Jiao, Wei-Shi Zheng
Abstract In conventional domain adaptation, a critical assumption is that there exists a fully labeled domain (source) that contains the same label space as another unlabeled or scarcely labeled domain (target). However, in the real world, there often exist application scenarios in which both domains are partially labeled and not all classes are shared between these two domains. Thus, it is meaningful to let partially labeled domains learn from each other to classify all the unlabeled samples in each domain under an open-set setting. We consider this problem as weakly supervised open-set domain adaptation. To address this practical setting, we propose the Collaborative Distribution Alignment (CDA) method, which performs knowledge transfer bilaterally and works collaboratively to classify unlabeled data and identify outlier samples. Extensive experiments on the Office benchmark and an application on person reidentification show that our method achieves state-of-the-art performance.
Tasks Domain Adaptation, Transfer Learning
Published 2019-04-30
URL http://arxiv.org/abs/1904.13179v1
PDF http://arxiv.org/pdf/1904.13179v1.pdf
PWC https://paperswithcode.com/paper/weakly-supervised-open-set-domain-adaptation
Repo
Framework

Multitask Text-to-Visual Embedding with Titles and Clickthrough Data

Title Multitask Text-to-Visual Embedding with Titles and Clickthrough Data
Authors Pranav Aggarwal, Zhe Lin, Baldo Faieta, Saeid Motiian
Abstract Text-visual (or called semantic-visual) embedding is a central problem in vision-language research. It typically involves mapping of an image and a text description to a common feature space through a CNN image encoder and a RNN language encoder. In this paper, we propose a new method for learning text-visual embedding using both image titles and click-through data from an image search engine. We also propose a new triplet loss function by modeling positive awareness of the embedding, and introduce a novel mini-batch-based hard negative sampling approach for better data efficiency in the learning process. Experimental results show that our proposed method outperforms existing methods, and is also effective for real-world text-to-visual retrieval.
Tasks Image Retrieval
Published 2019-05-30
URL https://arxiv.org/abs/1905.13339v1
PDF https://arxiv.org/pdf/1905.13339v1.pdf
PWC https://paperswithcode.com/paper/multitask-text-to-visual-embedding-with
Repo
Framework

Making the Last Iterate of SGD Information Theoretically Optimal

Title Making the Last Iterate of SGD Information Theoretically Optimal
Authors Prateek Jain, Dheeraj Nagaraj, Praneeth Netrapalli
Abstract Stochastic gradient descent (SGD) is one of the most widely used algorithms for large scale optimization problems. While classical theoretical analysis of SGD for convex problems studies (suffix) \emph{averages} of iterates and obtains information theoretically optimal bounds on suboptimality, the \emph{last point} of SGD is, by far, the most preferred choice in practice. The best known results for last point of SGD \cite{shamir2013stochastic} however, are suboptimal compared to information theoretic lower bounds by a $\log T$ factor, where $T$ is the number of iterations. \cite{harvey2018tight} shows that in fact, this additional $\log T$ factor is tight for standard step size sequences of $\OTheta{\frac{1}{\sqrt{t}}}$ and $\OTheta{\frac{1}{t}}$ for non-strongly convex and strongly convex settings, respectively. Similarly, even for subgradient descent (GD) when applied to non-smooth, convex functions, the best known step-size sequences still lead to $O(\log T)$-suboptimal convergence rates (on the final iterate). The main contribution of this work is to design new step size sequences that enjoy information theoretically optimal bounds on the suboptimality of \emph{last point} of SGD as well as GD. We achieve this by designing a modification scheme, that converts one sequence of step sizes to another so that the last point of SGD/GD with modified sequence has the same suboptimality guarantees as the average of SGD/GD with original sequence. We also show that our result holds with high-probability. We validate our results through simulations which demonstrate that the new step size sequence indeed improves the final iterate significantly compared to the standard step size sequences.
Tasks
Published 2019-04-29
URL https://arxiv.org/abs/1904.12443v2
PDF https://arxiv.org/pdf/1904.12443v2.pdf
PWC https://paperswithcode.com/paper/making-the-last-iterate-of-sgd-information
Repo
Framework

Multispectral snapshot demosaicing via non-convex matrix completion

Title Multispectral snapshot demosaicing via non-convex matrix completion
Authors Giancarlo A. Antonucci, Simon Vary, David Humphreys, Robert A. Lamb, Jonathan Piper, Jared Tanner
Abstract Snapshot mosaic multispectral imagery acquires an undersampled data cube by acquiring a single spectral measurement per spatial pixel. Sensors which acquire $p$ frequencies, therefore, suffer from severe $1/p$ undersampling of the full data cube. We show that the missing entries can be accurately imputed using non-convex techniques from sparse approximation and matrix completion initialised with traditional demosaicing algorithms. In particular, we observe the peak signal-to-noise ratio can typically be improved by 2 to 5 dB over current state-of-the-art methods when simulating a $p=16$ mosaic sensor measuring both high and low altitude urban and rural scenes as well as ground-based scenes.
Tasks Demosaicking, Matrix Completion
Published 2019-02-28
URL http://arxiv.org/abs/1902.11032v2
PDF http://arxiv.org/pdf/1902.11032v2.pdf
PWC https://paperswithcode.com/paper/multilspectral-snapshot-demosaicing-via-non
Repo
Framework
comments powered by Disqus