July 27, 2019

2793 words 14 mins read

Paper Group ANR 552

Paper Group ANR 552

Multi-Stage Feature Selection Based Intelligent Classifier for Classification of Incipient Stage Fire in Building. Detecting the Moment of Completion: Temporal Models for Localising Action Completion. Counterfactual Causality from First Principles?. Picasso, Matisse, or a Fake? Automated Analysis of Drawings at the Stroke Level for Attribution and …

Multi-Stage Feature Selection Based Intelligent Classifier for Classification of Incipient Stage Fire in Building

Title Multi-Stage Feature Selection Based Intelligent Classifier for Classification of Incipient Stage Fire in Building
Authors Allan Melvin Andrew, Ammar Zakaria, Shaharil Mad Saad, Ali Yeon Md Shakaff
Abstract In this study, an early fire detection algorithm has been proposed based on low cost array sensing system, utilizing gas sensors, dust particles and ambient sensors such as temperature and humidity sensor. The odor or smell-print emanated from various fire sources and building construction materials at early stage are measured. For this purpose, odor profile data from five common fire sources and three common building construction materials were used to develop the classification model. Normalized feature extractions of the smell print data were performed before subjected to prediction classifier. These features represent the odor signals in the time domain. The obtained features undergo the proposed multi-stage feature selection technique and lastly, further reduced by Principal Component Analysis (PCA), a dimension reduction technique. The hybrid PCA-PNN based approach has been applied on different datasets from in-house developed system and the portable electronic nose unit. Experimental classification results show that the dimension reduction process performed by PCA has improved the classification accuracy and provided high reliability, regardless of ambient temperature and humidity variation, baseline sensor drift, the different gas concentration level and exposure towards different heating temperature range.
Tasks Dimensionality Reduction, Feature Selection
Published 2017-08-12
URL http://arxiv.org/abs/1708.08750v1
PDF http://arxiv.org/pdf/1708.08750v1.pdf
PWC https://paperswithcode.com/paper/multi-stage-feature-selection-based
Repo
Framework

Detecting the Moment of Completion: Temporal Models for Localising Action Completion

Title Detecting the Moment of Completion: Temporal Models for Localising Action Completion
Authors Farnoosh Heidarivincheh, Majid Mirmehdi, Dima Damen
Abstract Action completion detection is the problem of modelling the action’s progression towards localising the moment of completion - when the action’s goal is confidently considered achieved. In this work, we assess the ability of two temporal models, namely Hidden Markov Models (HMM) and Long-Short Term Memory (LSTM), to localise completion for six object interactions: switch, plug, open, pull, pick and drink. We use a supervised approach, where annotations of pre-completion and post-completion frames are available per action, and fine-tuned CNN features are used to train temporal models. Tested on the Action-Completion-2016 dataset, we detect completion within 10 frames of annotations for ~75% of completed action sequences using both temporal models. Results show that fine-tuned CNN features outperform hand-crafted features for localisation, and that observing incomplete instances is necessary when incomplete sequences are also present in the test set.
Tasks
Published 2017-10-06
URL http://arxiv.org/abs/1710.02310v1
PDF http://arxiv.org/pdf/1710.02310v1.pdf
PWC https://paperswithcode.com/paper/detecting-the-moment-of-completion-temporal
Repo
Framework

Counterfactual Causality from First Principles?

Title Counterfactual Causality from First Principles?
Authors Gregor Gössler, Oleg Sokolsky, Jean-Bernard Stefani
Abstract In this position paper we discuss three main shortcomings of existing approaches to counterfactual causality from the computer science perspective, and sketch lines of work to try and overcome these issues: (1) causality definitions should be driven by a set of precisely specified requirements rather than specific examples; (2) causality frameworks should support system dynamics; (3) causality analysis should have a well-understood behavior in presence of abstraction.
Tasks
Published 2017-10-10
URL http://arxiv.org/abs/1710.03393v1
PDF http://arxiv.org/pdf/1710.03393v1.pdf
PWC https://paperswithcode.com/paper/counterfactual-causality-from-first
Repo
Framework

Picasso, Matisse, or a Fake? Automated Analysis of Drawings at the Stroke Level for Attribution and Authentication

Title Picasso, Matisse, or a Fake? Automated Analysis of Drawings at the Stroke Level for Attribution and Authentication
Authors Ahmed Elgammal, Yan Kang, Milko Den Leeuw
Abstract This paper proposes a computational approach for analysis of strokes in line drawings by artists. We aim at developing an AI methodology that facilitates attribution of drawings of unknown authors in a way that is not easy to be deceived by forged art. The methodology used is based on quantifying the characteristics of individual strokes in drawings. We propose a novel algorithm for segmenting individual strokes. We designed and compared different hand-crafted and learned features for the task of quantifying stroke characteristics. We also propose and compare different classification methods at the drawing level. We experimented with a dataset of 300 digitized drawings with over 80 thousands strokes. The collection mainly consisted of drawings of Pablo Picasso, Henry Matisse, and Egon Schiele, besides a small number of representative works of other artists. The experiments shows that the proposed methodology can classify individual strokes with accuracy 70%-90%, and aggregate over drawings with accuracy above 80%, while being robust to be deceived by fakes (with accuracy 100% for detecting fakes in most settings).
Tasks
Published 2017-11-08
URL http://arxiv.org/abs/1711.03536v1
PDF http://arxiv.org/pdf/1711.03536v1.pdf
PWC https://paperswithcode.com/paper/picasso-matisse-or-a-fake-automated-analysis
Repo
Framework

Automated Diagnosis of Epilepsy Employing Multifractal Detrended Fluctuation Analysis Based Features

Title Automated Diagnosis of Epilepsy Employing Multifractal Detrended Fluctuation Analysis Based Features
Authors S Pratiher, S Chatterjee, R Bose
Abstract This contribution reports an application of MultiFractal Detrended Fluctuation Analysis, MFDFA based novel feature extraction technique for automated detection of epilepsy. In fractal geometry, Multifractal Detrended Fluctuation Analysis MFDFA is a popular technique to examine the self-similarity of a nonlinear, chaotic and noisy time series. In the present research work, EEG signals representing healthy, interictal (seizure free) and ictal activities (seizure) are acquired from an existing available database. The acquired EEG signals of different states are at first analyzed using MFDFA. To requisite the time series singularity quantification at local and global scales, a novel set of fourteen different features. Suitable feature ranking employing students t-test has been done to select the most statistically significant features which are henceforth being used as inputs to a support vector machines (SVM) classifier for the classification of different EEG signals. Eight different classification problems have been presented in this paper and it has been observed that the overall classification accuracy using MFDFA based features are reasonably satisfactory for all classification problems. The performance of the proposed method are also found to be quite commensurable and in some cases even better when compared with the results published in existing literature studied on the similar data set.
Tasks EEG, Time Series
Published 2017-04-05
URL http://arxiv.org/abs/1704.01297v1
PDF http://arxiv.org/pdf/1704.01297v1.pdf
PWC https://paperswithcode.com/paper/automated-diagnosis-of-epilepsy-employing
Repo
Framework

A Proof of Orthogonal Double Machine Learning with $Z$-Estimators

Title A Proof of Orthogonal Double Machine Learning with $Z$-Estimators
Authors Vasilis Syrgkanis
Abstract We consider two stage estimation with a non-parametric first stage and a generalized method of moments second stage, in a simpler setting than (Chernozhukov et al. 2016). We give an alternative proof of the theorem given in (Chernozhukov et al. 2016) that orthogonal second stage moments, sample splitting and $n^{1/4}$-consistency of the first stage, imply $\sqrt{n}$-consistency and asymptotic normality of second stage estimates. Our proof is for a variant of their estimator, which is based on the empirical version of the moment condition (Z-estimator), rather than a minimization of a norm of the empirical vector of moments (M-estimator). This note is meant primarily for expository purposes, rather than as a new technical contribution.
Tasks
Published 2017-04-12
URL http://arxiv.org/abs/1704.03754v2
PDF http://arxiv.org/pdf/1704.03754v2.pdf
PWC https://paperswithcode.com/paper/a-proof-of-orthogonal-double-machine-learning
Repo
Framework

A Unified Optimization View on Generalized Matching Pursuit and Frank-Wolfe

Title A Unified Optimization View on Generalized Matching Pursuit and Frank-Wolfe
Authors Francesco Locatello, Rajiv Khanna, Michael Tschannen, Martin Jaggi
Abstract Two of the most fundamental prototypes of greedy optimization are the matching pursuit and Frank-Wolfe algorithms. In this paper, we take a unified view on both classes of methods, leading to the first explicit convergence rates of matching pursuit methods in an optimization sense, for general sets of atoms. We derive sublinear ($1/t$) convergence for both classes on general smooth objectives, and linear convergence on strongly convex objectives, as well as a clear correspondence of algorithm variants. Our presented algorithms and rates are affine invariant, and do not need any incoherence or sparsity assumptions.
Tasks
Published 2017-02-21
URL http://arxiv.org/abs/1702.06457v2
PDF http://arxiv.org/pdf/1702.06457v2.pdf
PWC https://paperswithcode.com/paper/a-unified-optimization-view-on-generalized
Repo
Framework

Learning Where to Look: Data-Driven Viewpoint Set Selection for 3D Scenes

Title Learning Where to Look: Data-Driven Viewpoint Set Selection for 3D Scenes
Authors Kyle Genova, Manolis Savva, Angel X. Chang, Thomas Funkhouser
Abstract The use of rendered images, whether from completely synthetic datasets or from 3D reconstructions, is increasingly prevalent in vision tasks. However, little attention has been given to how the selection of viewpoints affects the performance of rendered training sets. In this paper, we propose a data-driven approach to view set selection. Given a set of example images, we extract statistics describing their contents and generate a set of views matching the distribution of those statistics. Motivated by semantic segmentation tasks, we model the spatial distribution of each semantic object category within an image view volume. We provide a search algorithm that generates a sampling of likely candidate views according to the example distribution, and a set selection algorithm that chooses a subset of the candidates that jointly cover the example distribution. Results of experiments with these algorithms on SUNCG indicate that they are indeed able to produce view distributions similar to an example set from NYUDv2 according to the earth mover’s distance. Furthermore, the selected views improve performance on semantic segmentation compared to alternative view selection algorithms.
Tasks Semantic Segmentation
Published 2017-04-07
URL http://arxiv.org/abs/1704.02393v1
PDF http://arxiv.org/pdf/1704.02393v1.pdf
PWC https://paperswithcode.com/paper/learning-where-to-look-data-driven-viewpoint
Repo
Framework

The Zero Resource Speech Challenge 2017

Title The Zero Resource Speech Challenge 2017
Authors Ewan Dunbar, Xuan Nga Cao, Juan Benjumea, Julien Karadayi, Mathieu Bernard, Laurent Besacier, Xavier Anguera, Emmanuel Dupoux
Abstract We describe a new challenge aimed at discovering subword and word units from raw speech. This challenge is the followup to the Zero Resource Speech Challenge 2015. It aims at constructing systems that generalize across languages and adapt to new speakers. The design features and evaluation metrics of the challenge are presented and the results of seventeen models are discussed.
Tasks
Published 2017-12-12
URL http://arxiv.org/abs/1712.04313v1
PDF http://arxiv.org/pdf/1712.04313v1.pdf
PWC https://paperswithcode.com/paper/the-zero-resource-speech-challenge-2017
Repo
Framework

k-FFNN: A priori knowledge infused Feed-forward Neural Networks

Title k-FFNN: A priori knowledge infused Feed-forward Neural Networks
Authors Sri Harsha Dumpala, Rupayan Chakraborty, Sunil Kumar Kopparapu
Abstract Recurrent neural network (RNN) are being extensively used over feed-forward neural networks (FFNN) because of their inherent capability to capture temporal relationships that exist in the sequential data such as speech. This aspect of RNN is advantageous especially when there is no a priori knowledge about the temporal correlations within the data. However, RNNs require large amount of data to learn these temporal correlations, limiting their advantage in low resource scenarios. It is not immediately clear (a) how a priori temporal knowledge can be used in a FFNN architecture (b) how a FFNN performs when provided with this knowledge about temporal correlations (assuming available) during training. The objective of this paper is to explore k-FFNN, namely a FFNN architecture that can incorporate the a priori knowledge of the temporal relationships within the data sequence during training and compare k-FFNN performance with RNN in a low resource scenario. We evaluate the performance of k-FFNN and RNN by extensive experimentation on MediaEval 2016 audio data (“Emotional Impact of Movies” task). Experimental results show that the performance of k-FFNN is comparable to RNN, and in some scenarios k-FFNN performs better than RNN when temporal knowledge is injected into FFNN architecture. The main contributions of this paper are (a) fusing a priori knowledge into FFNN architecture to construct a k-FFNN and (b) analyzing the performance of k-FFNN with respect to RNN for different size of training data.
Tasks
Published 2017-04-24
URL http://arxiv.org/abs/1704.07055v1
PDF http://arxiv.org/pdf/1704.07055v1.pdf
PWC https://paperswithcode.com/paper/k-ffnn-a-priori-knowledge-infused-feed
Repo
Framework

Specifying Non-Markovian Rewards in MDPs Using LDL on Finite Traces (Preliminary Version)

Title Specifying Non-Markovian Rewards in MDPs Using LDL on Finite Traces (Preliminary Version)
Authors Ronen Brafman, Giuseppe De Giacomo, Fabio Patrizi
Abstract In Markov Decision Processes (MDPs), the reward obtained in a state depends on the properties of the last state and action. This state dependency makes it difficult to reward more interesting long-term behaviors, such as always closing a door after it has been opened, or providing coffee only following a request. Extending MDPs to handle such non-Markovian reward function was the subject of two previous lines of work, both using variants of LTL to specify the reward function and then compiling the new model back into a Markovian model. Building upon recent progress in the theories of temporal logics over finite traces, we adopt LDLf for specifying non-Markovian rewards and provide an elegant automata construction for building a Markovian model, which extends that of previous work and offers strong minimality and compositionality guarantees.
Tasks
Published 2017-06-25
URL http://arxiv.org/abs/1706.08100v1
PDF http://arxiv.org/pdf/1706.08100v1.pdf
PWC https://paperswithcode.com/paper/specifying-non-markovian-rewards-in-mdps
Repo
Framework

Deep Automated Multi-task Learning

Title Deep Automated Multi-task Learning
Authors Davis Liang, Yan Shu
Abstract Multi-task learning (MTL) has recently contributed to learning better representations in service of various NLP tasks. MTL aims at improving the performance of a primary task, by jointly training on a secondary task. This paper introduces automated tasks, which exploit the sequential nature of the input data, as secondary tasks in an MTL model. We explore next word prediction, next character prediction, and missing word completion as potential automated tasks. Our results show that training on a primary task in parallel with a secondary automated task improves both the convergence speed and accuracy for the primary task. We suggest two methods for augmenting an existing network with automated tasks and establish better performance in topic prediction, sentiment analysis, and hashtag recommendation. Finally, we show that the MTL models can perform well on datasets that are small and colloquial by nature.
Tasks Multi-Task Learning, Sentiment Analysis
Published 2017-09-16
URL http://arxiv.org/abs/1709.05554v2
PDF http://arxiv.org/pdf/1709.05554v2.pdf
PWC https://paperswithcode.com/paper/deep-automated-multi-task-learning
Repo
Framework

Beyond Forward Shortcuts: Fully Convolutional Master-Slave Networks (MSNets) with Backward Skip Connections for Semantic Segmentation

Title Beyond Forward Shortcuts: Fully Convolutional Master-Slave Networks (MSNets) with Backward Skip Connections for Semantic Segmentation
Authors Abrar H. Abdulnabi, Stefan Winkler, Gang Wang
Abstract Recent deep CNNs contain forward shortcut connections; i.e. skip connections from low to high layers. Reusing features from lower layers that have higher resolution (location information) benefit higher layers to recover lost details and mitigate information degradation. However, during inference the lower layers do not know about high layer features, although they contain contextual high semantics that benefit low layers to adaptively extract informative features for later layers. In this paper, we study the influence of backward skip connections which are in the opposite direction to forward shortcuts, i.e. paths from high layers to low layers. To achieve this – which indeed runs counter to the nature of feed-forward networks – we propose a new fully convolutional model that consists of a pair of networks. A Slave' network is dedicated to provide the backward connections from its top layers to the Master’ network’s bottom layers. The Master network is used to produce the final label predictions. In our experiments we validate the proposed FCN model on ADE20K (ImageNet scene parsing), PASCAL-Context, and PASCAL VOC 2011 datasets.
Tasks Scene Parsing, Semantic Segmentation
Published 2017-07-18
URL http://arxiv.org/abs/1707.05537v1
PDF http://arxiv.org/pdf/1707.05537v1.pdf
PWC https://paperswithcode.com/paper/beyond-forward-shortcuts-fully-convolutional
Repo
Framework

Audio-replay attack detection countermeasures

Title Audio-replay attack detection countermeasures
Authors Galina Lavrentyeva, Sergey Novoselov, Egor Malykh, Alexander Kozlov, Oleg Kudashev, Vadim Shchemelinin
Abstract This paper presents the Speech Technology Center (STC) replay attack detection systems proposed for Automatic Speaker Verification Spoofing and Countermeasures Challenge 2017. In this study we focused on comparison of different spoofing detection approaches. These were GMM based methods, high level features extraction with simple classifier and deep learning frameworks. Experiments performed on the development and evaluation parts of the challenge dataset demonstrated stable efficiency of deep learning approaches in case of changing acoustic conditions. At the same time SVM classifier with high level features provided a substantial input in the efficiency of the resulting STC systems according to the fusion systems results.
Tasks Speaker Verification
Published 2017-05-24
URL http://arxiv.org/abs/1705.08858v1
PDF http://arxiv.org/pdf/1705.08858v1.pdf
PWC https://paperswithcode.com/paper/audio-replay-attack-detection-countermeasures
Repo
Framework
Title Neural Machine Translation Leveraging Phrase-based Models in a Hybrid Search
Authors Leonard Dahlmann, Evgeny Matusov, Pavel Petrushkov, Shahram Khadivi
Abstract In this paper, we introduce a hybrid search for attention-based neural machine translation (NMT). A target phrase learned with statistical MT models extends a hypothesis in the NMT beam search when the attention of the NMT model focuses on the source words translated by this phrase. Phrases added in this way are scored with the NMT model, but also with SMT features including phrase-level translation probabilities and a target language model. Experimental results on German->English news domain and English->Russian e-commerce domain translation tasks show that using phrase-based models in NMT search improves MT quality by up to 2.3% BLEU absolute as compared to a strong NMT baseline.
Tasks Language Modelling, Machine Translation
Published 2017-08-10
URL http://arxiv.org/abs/1708.03271v1
PDF http://arxiv.org/pdf/1708.03271v1.pdf
PWC https://paperswithcode.com/paper/neural-machine-translation-leveraging-phrase
Repo
Framework
comments powered by Disqus