January 28, 2020

2961 words 14 mins read

Paper Group ANR 1042

Paper Group ANR 1042

Video Captioning with Text-based Dynamic Attention and Step-by-Step Learning. An adaptive stochastic optimization algorithm for resource allocation. Safe Sample Screening for Robust Support Vector Machine. The Bach Doodle: Approachable music composition with machine learning at scale. Semantics of higher-order probabilistic programs with conditioni …

Video Captioning with Text-based Dynamic Attention and Step-by-Step Learning

Title Video Captioning with Text-based Dynamic Attention and Step-by-Step Learning
Authors Huanhou Xiao, Jinglun Shi
Abstract Automatically describing video content with natural language has been attracting much attention in CV and NLP communities. Most existing methods predict one word at a time, and by feeding the last generated word back as input at the next time, while the other generated words are not fully exploited. Furthermore, traditional methods optimize the model using all the training samples in each epoch without considering their learning situations, which leads to a lot of unnecessary training and can not target the difficult samples. To address these issues, we propose a text-based dynamic attention model named TDAM, which imposes a dynamic attention mechanism on all the generated words with the motivation to improve the context semantic information and enhance the overall control of the whole sentence. Moreover, the text-based dynamic attention mechanism and the visual attention mechanism are linked together to focus on the important words. They can benefit from each other during training. Accordingly, the model is trained through two steps: “starting from scratch” and “checking for gaps”. The former uses all the samples to optimize the model, while the latter only trains for samples with poor control. Experimental results on the popular datasets MSVD and MSR-VTT demonstrate that our non-ensemble model outperforms the state-of-the-art video captioning benchmarks.
Tasks Video Captioning
Published 2019-11-05
URL https://arxiv.org/abs/1911.01857v1
PDF https://arxiv.org/pdf/1911.01857v1.pdf
PWC https://paperswithcode.com/paper/video-captioning-with-text-based-dynamic
Repo
Framework

An adaptive stochastic optimization algorithm for resource allocation

Title An adaptive stochastic optimization algorithm for resource allocation
Authors Xavier Fontaine, Shie Mannor, Vianney Perchet
Abstract We consider the classical problem of sequential resource allocation where a decision maker must repeatedly divide a budget between several resources, each with diminishing returns. This can be recast as a specific stochastic optimization problem where the objective is to maximize the cumulative reward, or equivalently to minimize the regret. We construct an algorithm that is {\em adaptive} to the complexity of the problem, expressed in term of the regularity of the returns of the resources, measured by the exponent in the {\L}ojasiewicz inequality (or by their universal concavity parameter). Our parameter-independent algorithm recovers the optimal rates for strongly-concave functions and the classical fast rates of multi-armed bandit (for linear reward functions). Moreover, the algorithm improves existing results on stochastic optimization in this regret minimization setting for intermediate cases.
Tasks Stochastic Optimization
Published 2019-02-12
URL https://arxiv.org/abs/1902.04376v3
PDF https://arxiv.org/pdf/1902.04376v3.pdf
PWC https://paperswithcode.com/paper/a-problem-adaptive-algorithm-for-resource
Repo
Framework

Safe Sample Screening for Robust Support Vector Machine

Title Safe Sample Screening for Robust Support Vector Machine
Authors Zhou Zhai, Bin Gu, Xiang Li, Heng Huang
Abstract Robust support vector machine (RSVM) has been shown to perform remarkably well to improve the generalization performance of support vector machine under the noisy environment. Unfortunately, in order to handle the non-convexity induced by ramp loss in RSVM, existing RSVM solvers often adopt the DC programming framework which is computationally inefficient for running multiple outer loops. This hinders the application of RSVM to large-scale problems. Safe sample screening that allows for the exclusion of training samples prior to or early in the training process is an effective method to greatly reduce computational time. However, existing safe sample screening algorithms are limited to convex optimization problems while RSVM is a non-convex problem. To address this challenge, in this paper, we propose two safe sample screening rules for RSVM based on the framework of concave-convex procedure (CCCP). Specifically, we provide screening rule for the inner solver of CCCP and another rule for propagating screened samples between two successive solvers of CCCP. To the best of our knowledge, this is the first work of safe sample screening to a non-convex optimization problem. More importantly, we provide the security guarantee to our sample screening rules to RSVM. Experimental results on a variety of benchmark datasets verify that our safe sample screening rules can significantly reduce the computational time.
Tasks
Published 2019-12-24
URL https://arxiv.org/abs/1912.11217v1
PDF https://arxiv.org/pdf/1912.11217v1.pdf
PWC https://paperswithcode.com/paper/safe-sample-screening-for-robust-support
Repo
Framework

The Bach Doodle: Approachable music composition with machine learning at scale

Title The Bach Doodle: Approachable music composition with machine learning at scale
Authors Cheng-Zhi Anna Huang, Curtis Hawthorne, Adam Roberts, Monica Dinculescu, James Wexler, Leon Hong, Jacob Howcroft
Abstract To make music composition more approachable, we designed the first AI-powered Google Doodle, the Bach Doodle, where users can create their own melody and have it harmonized by a machine learning model Coconet (Huang et al., 2017) in the style of Bach. For users to input melodies, we designed a simplified sheet-music based interface. To support an interactive experience at scale, we re-implemented Coconet in TensorFlow.js (Smilkov et al., 2019) to run in the browser and reduced its runtime from 40s to 2s by adopting dilated depth-wise separable convolutions and fusing operations. We also reduced the model download size to approximately 400KB through post-training weight quantization. We calibrated a speed test based on partial model evaluation time to determine if the harmonization request should be performed locally or sent to remote TPU servers. In three days, people spent 350 years worth of time playing with the Bach Doodle, and Coconet received more than 55 million queries. Users could choose to rate their compositions and contribute them to a public dataset, which we are releasing with this paper. We hope that the community finds this dataset useful for applications ranging from ethnomusicological studies, to music education, to improving machine learning models.
Tasks Quantization
Published 2019-07-14
URL https://arxiv.org/abs/1907.06637v1
PDF https://arxiv.org/pdf/1907.06637v1.pdf
PWC https://paperswithcode.com/paper/the-bach-doodle-approachable-music
Repo
Framework

Semantics of higher-order probabilistic programs with conditioning

Title Semantics of higher-order probabilistic programs with conditioning
Authors Fredrik Dahlqvist, Dexter Kozen
Abstract We present a denotational semantics for higher-order probabilistic programs in terms of linear operators between Banach spaces. Our semantics is rooted in the classical theory of Banach spaces and their tensor products, but bears similarities with the well-known Scott semantics of higher-order programs through the use ordered Banach spaces which allow definitions in terms of fixed points. Being based on a monoidal rather than cartesian closed structure, our semantics effectively treats randomness as a resource.
Tasks
Published 2019-02-28
URL http://arxiv.org/abs/1902.11189v1
PDF http://arxiv.org/pdf/1902.11189v1.pdf
PWC https://paperswithcode.com/paper/semantics-of-higher-order-probabilistic
Repo
Framework

Identifying Malicious Players in GWAP-based Disaster Monitoring Crowdsourcing System

Title Identifying Malicious Players in GWAP-based Disaster Monitoring Crowdsourcing System
Authors Changkun Ou, Yifei Zhan, Yaxi Chen
Abstract Disaster monitoring is challenging due to the lake of infrastructures in monitoring areas. Based on the theory of Game-With-A-Purpose (GWAP), this paper contributes to a novel large-scale crowdsourcing disaster monitoring system. The system analyzes tagged satellite pictures from anonymous players, and then reports aggregated and evaluated monitoring results to its stakeholders. An algorithm based on directed graph centralities is presented to address the core issues of malicious user detection and disaster level calculation. Our method can be easily applied in other human computation systems. In the end, some issues with possible solutions are discussed for our future work.
Tasks
Published 2019-09-14
URL https://arxiv.org/abs/1910.01459v1
PDF https://arxiv.org/pdf/1910.01459v1.pdf
PWC https://paperswithcode.com/paper/identifying-malicious-players-in-gwap-based
Repo
Framework

DeepMimic: Mentor-Student Unlabeled Data Based Training

Title DeepMimic: Mentor-Student Unlabeled Data Based Training
Authors Itay Mosafi, Eli David, Nathan S. Netanyahu
Abstract In this paper, we present a deep neural network (DNN) training approach called the “DeepMimic” training method. Enormous amounts of data are available nowadays for training usage. Yet, only a tiny portion of these data is manually labeled, whereas almost all of the data are unlabeled. The training approach presented utilizes, in a most simplified manner, the unlabeled data to the fullest, in order to achieve remarkable (classification) results. Our DeepMimic method uses a small portion of labeled data and a large amount of unlabeled data for the training process, as expected in a real-world scenario. It consists of a mentor model and a student model. Employing a mentor model trained on a small portion of the labeled data and then feeding it only with unlabeled data, we show how to obtain a (simplified) student model that reaches the same accuracy and loss as the mentor model, on the same test set, without using any of the original data labels in the training of the student model. Our experiments demonstrate that even on challenging classification tasks the student network architecture can be simplified significantly with a minor influence on the performance, i.e., we need not even know the original network architecture of the mentor. In addition, the time required for training the student model to reach the mentor’s performance level is shorter, as a result of a simplified architecture and more available data. The proposed method highlights the disadvantages of regular supervised training and demonstrates the benefits of a less traditional training approach.
Tasks
Published 2019-11-24
URL https://arxiv.org/abs/1912.00079v1
PDF https://arxiv.org/pdf/1912.00079v1.pdf
PWC https://paperswithcode.com/paper/deepmimic-mentor-student-unlabeled-data-based
Repo
Framework

Classification of chaotic time series with deep learning

Title Classification of chaotic time series with deep learning
Authors Nicolas Boullé, Vassilios Dallas, Yuji Nakatsukasa, D. Samaddar
Abstract We use standard deep neural networks to classify univariate time series generated by discrete and continuous dynamical systems based on their chaotic or non-chaotic behaviour. Our approach to circumvent the lack of precise models for some of the most challenging real-life applications is to train different neural networks on a data set from a dynamical system with a basic or low-dimensional phase space and then use these networks to classify univariate time series of a dynamical system with more intricate or high-dimensional phase space. We illustrate this generalisation approach using the logistic map, the sine-circle map, the Lorenz system, and the Kuramoto–Sivashinsky equation. We observe that a convolutional neural network without batch normalization layers outperforms state-of-the-art neural networks for time series classification and is able to generalise and classify time series as chaotic or not with high accuracy.
Tasks Time Series, Time Series Classification
Published 2019-07-26
URL https://arxiv.org/abs/1908.06848v3
PDF https://arxiv.org/pdf/1908.06848v3.pdf
PWC https://paperswithcode.com/paper/classification-of-chaotic-time-series-with
Repo
Framework

Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based Model

Title Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based Model
Authors Giuseppe Castellucci, Valentina Bellomaria, Andrea Favalli, Raniero Romagnoli
Abstract Intent Detection and Slot Filling are two pillar tasks in Spoken Natural Language Understanding. Common approaches adopt joint Deep Learning architectures in attention-based recurrent frameworks. In this work, we aim at exploiting the success of “recurrence-less” models for these tasks. We introduce Bert-Joint, i.e., a multi-lingual joint text classification and sequence labeling framework. The experimental evaluation over two well-known English benchmarks demonstrates the strong performances that can be obtained with this model, even when few annotated data is available. Moreover, we annotated a new dataset for the Italian language, and we observed similar performances without the need for changing the model.
Tasks Intent Detection, Slot Filling, Text Classification
Published 2019-07-05
URL https://arxiv.org/abs/1907.02884v1
PDF https://arxiv.org/pdf/1907.02884v1.pdf
PWC https://paperswithcode.com/paper/multi-lingual-intent-detection-and-slot
Repo
Framework

Automated Search for Configurations of Deep Neural Network Architectures

Title Automated Search for Configurations of Deep Neural Network Architectures
Authors Salah Ghamizi, Maxime Cordy, Mike Papadakis, Yves Le Traon
Abstract Deep Neural Networks (DNNs) are intensively used to solve a wide variety of complex problems. Although powerful, such systems require manual configuration and tuning. To this end, we view DNNs as configurable systems and propose an end-to-end framework that allows the configuration, evaluation and automated search for DNN architectures. Therefore, our contribution is threefold. First, we model the variability of DNN architectures with a Feature Model (FM) that generalizes over existing architectures. Each valid configuration of the FM corresponds to a valid DNN model that can be built and trained. Second, we implement, on top of Tensorflow, an automated procedure to deploy, train and evaluate the performance of a configured model. Third, we propose a method to search for configurations and demonstrate that it leads to good DNN models. We evaluate our method by applying it on image classification tasks (MNIST, CIFAR-10) and show that, with limited amount of computation and training, our method can identify high-performing architectures (with high accuracy). We also demonstrate that we outperform existing state-of-the-art architectures handcrafted by ML researchers. Our FM and framework have been released %and are publicly available to support replication and future research.
Tasks Image Classification
Published 2019-04-09
URL http://arxiv.org/abs/1904.04612v1
PDF http://arxiv.org/pdf/1904.04612v1.pdf
PWC https://paperswithcode.com/paper/automated-search-for-configurations-of-deep
Repo
Framework

Formal Language Constraints for Markov Decision Processes

Title Formal Language Constraints for Markov Decision Processes
Authors Eleanor Quint, Dong Xu, Haluk Dogan, Zeynep Hakguder, Stephen Scott, Matthew Dwyer
Abstract In order to satisfy safety conditions, a reinforcement learned (RL) agent maybe constrained from acting freely, e.g., to prevent trajectories that might cause unwanted behavior or physical damage in a robot. We propose a general framework for augmenting a Markov decision process (MDP) with constraints that are described in formal languages over sequences of MDP states and agent actions. Constraint enforcement is implemented by filtering the allowed action set or by applying potential-based reward shaping to implement hard and soft constraint enforcement, respectively. We instantiate this framework using deterministic finite automata to encode constraints and propose methods of augmenting MDP observations with the state of the constraint automaton for learning. We empirically evaluate these methods with a variety of constraints by training Deep Q-Networks in Atari games as well as Proximal Policy Optimization in MuJoCo environments. We experimentally find that our approaches are effective in significantly reducing or eliminating constraint violations with either minimal negative or, depending on the constraint, a clear positive impact on final performance.
Tasks Atari Games
Published 2019-10-02
URL https://arxiv.org/abs/1910.01074v1
PDF https://arxiv.org/pdf/1910.01074v1.pdf
PWC https://paperswithcode.com/paper/formal-language-constraints-for-markov
Repo
Framework

CorrGAN: Sampling Realistic Financial Correlation Matrices Using Generative Adversarial Networks

Title CorrGAN: Sampling Realistic Financial Correlation Matrices Using Generative Adversarial Networks
Authors Gautier Marti
Abstract We propose a novel approach for sampling realistic financial correlation matrices. This approach is based on generative adversarial networks. Experiments demonstrate that generative adversarial networks are able to recover most of the known stylized facts about empirical correlation matrices estimated on asset returns. This is the first time such results are documented in the literature. Practical financial applications range from trading strategies enhancement to risk and portfolio stress testing. Such generative models can also help ground empirical finance deeper into science by allowing for falsifiability of statements and more objective comparison of empirical methods.
Tasks
Published 2019-10-21
URL https://arxiv.org/abs/1910.09504v2
PDF https://arxiv.org/pdf/1910.09504v2.pdf
PWC https://paperswithcode.com/paper/corrgan-sampling-realistic-financial
Repo
Framework

PSDNet and DPDNet: Efficient channel expansion, Depthwise-Pointwise-Depthwise Inverted Bottleneck Block

Title PSDNet and DPDNet: Efficient channel expansion, Depthwise-Pointwise-Depthwise Inverted Bottleneck Block
Authors Guoqing Li, Meng Zhang, Qianru Zhang, Ziyang Chen, Wenzhao Liu, Jiaojie Li, Xuzhao Shen, Jianjun Li, Zhenyu Zhu, Chau Yuen
Abstract In many real-time applications, the deployment of deep neural networks is constrained by high computational cost and efficient lightweight neural networks are widely concerned. In this paper, we propose that depthwise convolution (DWC) is used to expand the number of channels in a bottleneck block, which is more efficient than 1 x 1 convolution. The proposed Pointwise-Standard-Depthwise network (PSDNet) based on channel expansion with DWC has fewer number of parameters, less computational cost and higher accuracy than corresponding ResNet on CIFAR datasets. To design more efficient lightweight concolutional neural netwok, Depthwise-Pointwise-Depthwise inverted bottleneck block (DPD block) is proposed and DPDNet is designed by stacking DPD block. Meanwhile, the number of parameters of DPDNet is only about 60% of that of MobileNetV2 for networks with the same number of layers, but can achieve approximated accuracy. Additionally, two hyperparameters of DPDNet can make the trade-off between accuracy and computational cost, which makes DPDNet suitable for diverse tasks. Furthermore, we find the networks with more DWC layers outperform the networks with more 1x1 convolution layers, which indicates that extracting spatial information is more important than combining channel information.
Tasks
Published 2019-09-03
URL https://arxiv.org/abs/1909.01026v2
PDF https://arxiv.org/pdf/1909.01026v2.pdf
PWC https://paperswithcode.com/paper/psdnet-and-dpdnet-efficient-channel-expansion
Repo
Framework

DeepWait: Pedestrian Wait Time Estimation in Mixed Traffic Conditions Using Deep Survival Analysis

Title DeepWait: Pedestrian Wait Time Estimation in Mixed Traffic Conditions Using Deep Survival Analysis
Authors Arash Kalatian, Bilal Farooq
Abstract Pedestrian’s road crossing behaviour is one of the important aspects of urban dynamics that will be affected by the introduction of autonomous vehicles. In this study we introduce DeepSurvival, a novel framework for estimating pedestrian’s waiting time at unsignalized mid-block crosswalks in mixed traffic conditions. We exploit the strengths of deep learning in capturing the nonlinearities in the data and develop a cox proportional hazard model with a deep neural network as the log-risk function. An embedded feature selection algorithm for reducing data dimensionality and enhancing the interpretability of the network is also developed. We test our framework on a dataset collected from 160 participants using an immersive virtual reality environment. Validation results showed that with a C-index of 0.64 our proposed framework outperformed the standard cox proportional hazard-based model with a C-index of 0.58.
Tasks Autonomous Vehicles, Feature Selection, Survival Analysis
Published 2019-04-16
URL https://arxiv.org/abs/1904.11008v3
PDF https://arxiv.org/pdf/1904.11008v3.pdf
PWC https://paperswithcode.com/paper/190411008
Repo
Framework

An Information-Theoretical Approach to the Information Capacity and Cost-Effectiveness Evaluation of Color Palettes

Title An Information-Theoretical Approach to the Information Capacity and Cost-Effectiveness Evaluation of Color Palettes
Authors R. Tanju Sirmen, B. Burak Ustundag
Abstract Colors are used as effective tools of representing and transferring information. Number of colors in a palette is the direct arbiter of the information conveying capacity. Yet it should be well elaborated, since increasing the entropy by adding colors comes with its cost on decoding. Despite the possible effects upon diverse applications, a methodology for cost-effectiveness evaluation of palettes seems deficient. In this work, this need is being addressed from an information-theoretical perspective, via the articulated metrics and formulae. Besides, the proposed metrics are computed for some developed and known palettes, and observed results are evaluated.
Tasks
Published 2019-06-03
URL https://arxiv.org/abs/1906.02567v1
PDF https://arxiv.org/pdf/1906.02567v1.pdf
PWC https://paperswithcode.com/paper/190602567
Repo
Framework
comments powered by Disqus