October 19, 2019

3021 words 15 mins read

Paper Group ANR 257

Window detection in aerial texture images of the Berlin 3D CityGML Model. Reachability-based safe learning for optimal control problem. Automatic Assessment of Artistic Quality of Photos. Grounding Perception: A Developmental Approach to Sensorimotor Contingencies. A simple probabilistic deep generative model for learning generalizable disentangled …

Window detection in aerial texture images of the Berlin 3D CityGML Model


Title	Window detection in aerial texture images of the Berlin 3D CityGML Model
Authors	Franziska Lippoldt, Marius Erdt
Abstract	This article explores the usage of the state-of-art neural network Mask R-CNN to be used for window detection of texture files from the CityGML model of Berlin. As texture files are very irregular in terms of size, exposure settings and orientation, we use several parameter optimisation methods to improve the precision. Those textures are cropped from aerial photos, which implies that the angle of the facade, the exposure as well as contrast are calibrated towards the mean and not towards the single facade. The analysis of a single texture image with the human eye itself is challenging: A combination of window and facade estimation and perspective analysis is necessary in order to determine the facades and windows. We train and detect bounding boxes and masks from two data sets with image size 128 and 256. We explore various configuration optimisation methods and the relation of the Region Proposal Network, detected ROIs and the mask output. Our final results shows that the we can improve the average precision scores for both data set sizes, yet the initial AP score varies and leads to different resulting scores.
Tasks	Window Detection
Published	2018-12-19
URL	http://arxiv.org/abs/1812.08095v1
PDF	http://arxiv.org/pdf/1812.08095v1.pdf
PWC	https://paperswithcode.com/paper/window-detection-in-aerial-texture-images-of
Repo
Framework

Reachability-based safe learning for optimal control problem


Title	Reachability-based safe learning for optimal control problem
Authors	Stanislav Fedorov, Antonio Candelieri
Abstract	In this work we seek for an approach to integrate safety in the learning process that relies on a partly known state-space model of the system and regards the unknown dynamics as an additive bounded disturbance. We introduce a framework for safely learning a control strategy for a given system with an additive disturbance. On the basis of the known part of the model, a safe set in which the system can learn safely, the algorithm can choose optimal actions for pursuing the target set as long as the safety-preserving condition is satisfied. After some learning episodes, the disturbance can be updated based on real-world data. To this end, Gaussian Process regression is conducted on the collected disturbance samples. Since the unstable nature of the law of the real world, for example, change of friction or conductivity with the temperature, we expect to have the more robust solution of optimal control problem. For evaluation of approach described above we choose an inverted pendulum as a benchmark model. The proposed algorithm manages to learn a policy that does not violate the pre-specified safety constraints. Observed performance is improved when it was incorporated exploration set up to make sure that an optimal policy is learned everywhere in the safe set. Finally, we outline some promising directions for future research beyond the scope of this paper.
Tasks
Published	2018-11-09
URL	http://arxiv.org/abs/1811.04006v1
PDF	http://arxiv.org/pdf/1811.04006v1.pdf
PWC	https://paperswithcode.com/paper/reachability-based-safe-learning-for-optimal
Repo
Framework

Automatic Assessment of Artistic Quality of Photos


Title	Automatic Assessment of Artistic Quality of Photos
Authors	Ashish Verma, Kranthi Koukuntla, Rohit Varma, Snehasis Mukherjee
Abstract	This paper proposes a technique to assess the aesthetic quality of photographs. The goal of the study is to predict whether a given photograph is captured by professional photographers, or by common people, based on a measurement of artistic quality of the photograph. We propose a Multi-Layer-Perceptron based system to analyze some low, mid and high level image features and find their effectiveness to measure artistic quality of the image and produce a measurement of the artistic quality of the image on a scale of 10. We validate the proposed system on a large dataset, containing images downloaded from the internet. The dataset contains some images captured by professional photographers and the rest of the images captured by common people. The proposed measurement of artistic quality of images provides higher value of photo quality for the images captured by professional photographers, compared to the values provided for the other images.
Tasks
Published	2018-04-17
URL	http://arxiv.org/abs/1804.06124v1
PDF	http://arxiv.org/pdf/1804.06124v1.pdf
PWC	https://paperswithcode.com/paper/automatic-assessment-of-artistic-quality-of
Repo
Framework

Grounding Perception: A Developmental Approach to Sensorimotor Contingencies


Title	Grounding Perception: A Developmental Approach to Sensorimotor Contingencies
Authors	Alban Laflaquière, Nikolas Hemion, Michaël Garcia Ortiz, Jean-Christophe Baillie
Abstract	Sensorimotor contingency theory offers a promising account of the nature of perception, a topic rarely addressed in the robotics community. We propose a developmental framework to address the problem of the autonomous acquisition of sensorimotor contingencies by a naive robot. While exploring the world, the robot internally encodes contingencies as predictive models that capture the structure they imply in its sensorimotor experience. Three preliminary applications are presented to illustrate our approach to the acquisition of perceptive abilities: discovering the environment, discovering objects, and discovering a visual field.
Tasks
Published	2018-10-03
URL	http://arxiv.org/abs/1810.01870v1
PDF	http://arxiv.org/pdf/1810.01870v1.pdf
PWC	https://paperswithcode.com/paper/grounding-perception-a-developmental-approach
Repo
Framework

A simple probabilistic deep generative model for learning generalizable disentangled representations from grouped data


Title	A simple probabilistic deep generative model for learning generalizable disentangled representations from grouped data
Authors	Haruo Hosoya
Abstract	The disentangling problem is to discover multiple complex factors of variations hidden in data. One recent approach is to take a dataset with grouping structure and separately estimate a factor common within a group (content) and a factor specific to each group member (transformation). Notably, this approach can learn to represent a continuous space of contents, which allows for generalization to data with unseen contents. In this study, we aim at cultivating this approach within probabilistic deep generative models. Motivated by technical complication in existing group-based methods, we propose a simpler probabilistic method, called group-contrastive variational autoencoders. Despite its simplicity, our approach achieves reasonable disentanglement with generalizability for three grouped datasets of 3D object images. In comparison with a previous model, although conventional qualitative evaluation shows little difference, our qualitative evaluation using few-shot classification exhibits superior performances for some datasets. We analyze the content representations from different methods and discuss their transformation-dependency and potential performance impacts.
Tasks
Published	2018-09-07
URL	http://arxiv.org/abs/1809.02383v1
PDF	http://arxiv.org/pdf/1809.02383v1.pdf
PWC	https://paperswithcode.com/paper/a-simple-probabilistic-deep-generative-model
Repo
Framework

Asynchronous, Photometric Feature Tracking using Events and Frames


Title	Asynchronous, Photometric Feature Tracking using Events and Frames
Authors	Daniel Gehrig, Henri Rebecq, Guillermo Gallego, Davide Scaramuzza
Abstract	We present a method that leverages the complementarity of event cameras and standard cameras to track visual features with low-latency. Event cameras are novel sensors that output pixel-level brightness changes, called “events”. They offer significant advantages over standard cameras, namely a very high dynamic range, no motion blur, and a latency in the order of microseconds. However, because the same scene pattern can produce different events depending on the motion direction, establishing event correspondences across time is challenging. By contrast, standard cameras provide intensity measurements (frames) that do not depend on motion direction. Our method extracts features on frames and subsequently tracks them asynchronously using events, thereby exploiting the best of both types of data: the frames provide a photometric representation that does not depend on motion direction and the events provide low-latency updates. In contrast to previous works, which are based on heuristics, this is the first principled method that uses raw intensity measurements directly, based on a generative event model within a maximum-likelihood framework. As a result, our method produces feature tracks that are both more accurate (subpixel accuracy) and longer than the state of the art, across a wide variety of scenes.
Tasks
Published	2018-07-25
URL	http://arxiv.org/abs/1807.09713v1
PDF	http://arxiv.org/pdf/1807.09713v1.pdf
PWC	https://paperswithcode.com/paper/asynchronous-photometric-feature-tracking
Repo
Framework

Early Prediction of Course Grades: Models and Feature Selection


Title	Early Prediction of Course Grades: Models and Feature Selection
Authors	Hengxuan Li, Collin F. Lynch, Tiffany Barnes
Abstract	In this paper, we compare predictive models for students’ final performance in a blended course using a set of generic features collected from the first six weeks of class. These features were extracted from students’ online homework submission logs as well as other online actions. We compare the effectiveness of 5 different ML algorithms (SVMs, Support Vector Regression, Decision Tree, Naive Bayes and K-Nearest Neighbor). We found that SVMs outperform other models and improve when compared to the baseline. This study demonstrates feasible implementations for predictive models that rely on common data from blended courses that can be used to monitor students’ progress and to tailor instruction.
Tasks	Feature Selection
Published	2018-12-03
URL	http://arxiv.org/abs/1812.00843v1
PDF	http://arxiv.org/pdf/1812.00843v1.pdf
PWC	https://paperswithcode.com/paper/early-prediction-of-course-grades-models-and
Repo
Framework

A Study of Clustering Techniques and Hierarchical Matrix Formats for Kernel Ridge Regression


Title	A Study of Clustering Techniques and Hierarchical Matrix Formats for Kernel Ridge Regression
Authors	Elizaveta Rebrova, Gustavo Chavez, Yang Liu, Pieter Ghysels, Xiaoye Sherry Li
Abstract	We present memory-efficient and scalable algorithms for kernel methods used in machine learning. Using hierarchical matrix approximations for the kernel matrix the memory requirements, the number of floating point operations, and the execution time are drastically reduced compared to standard dense linear algebra routines. We consider both the general $\mathcal{H}$ matrix hierarchical format as well as Hierarchically Semi-Separable (HSS) matrices. Furthermore, we investigate the impact of several preprocessing and clustering techniques on the hierarchical matrix compression. Effective clustering of the input leads to a ten-fold increase in efficiency of the compression. The algorithms are implemented using the STRUMPACK solver library. These results confirm that — with correct tuning of the hyperparameters — classification using kernel ridge regression with the compressed matrix does not lose prediction accuracy compared to the exact — not compressed — kernel matrix and that our approach can be extended to $\mathcal{O}(1M)$ datasets, for which computation with the full kernel matrix becomes prohibitively expensive. We present numerical experiments in a distributed memory environment up to 1,024 processors of the NERSC’s Cori supercomputer using well-known datasets to the machine learning community that range from dimension 8 up to 784.
Tasks
Published	2018-03-27
URL	http://arxiv.org/abs/1803.10274v1
PDF	http://arxiv.org/pdf/1803.10274v1.pdf
PWC	https://paperswithcode.com/paper/a-study-of-clustering-techniques-and
Repo
Framework

Dynamic Visual Analytics for Elicitation Meetings with ELICA


Title	Dynamic Visual Analytics for Elicitation Meetings with ELICA
Authors	Zahra Shakeri Hossein Abad, Munib Rahman, Abdullah Cheema, Vincenzo Gervasi, Didar Zowghi, Ken Barker
Abstract	Requirements elicitation can be very challenging in projects that require deep domain knowledge about the system at hand. As analysts have the full control over the elicitation process, their lack of knowledge about the system under study inhibits them from asking related questions and reduces the accuracy of requirements provided by stakeholders. We present ELICA, a generic interactive visual analytics tool to assist analysts during requirements elicitation process. ELICA uses a novel information extraction algorithm based on a combination of Weighted Finite State Transducers (WFSTs) (generative model) and SVMs (discriminative model). ELICA presents the extracted relevant information in an interactive GUI (including zooming, panning, and pinching) that allows analysts to explore which parts of the ongoing conversation (or specification document) match with the extracted information. In this demonstration, we show that ELICA is usable and effective in practice, and is able to extract the related information in real-time. We also demonstrate how carefully designed features in ELICA facilitate the interactive and dynamic process of information extraction.
Tasks
Published	2018-07-10
URL	http://arxiv.org/abs/1807.06076v1
PDF	http://arxiv.org/pdf/1807.06076v1.pdf
PWC	https://paperswithcode.com/paper/dynamic-visual-analytics-for-elicitation
Repo
Framework

Dissecting Contextual Word Embeddings: Architecture and Representation


Title	Dissecting Contextual Word Embeddings: Architecture and Representation
Authors	Matthew E. Peters, Mark Neumann, Luke Zettlemoyer, Wen-tau Yih
Abstract	Contextual word representations derived from pre-trained bidirectional language models (biLMs) have recently been shown to provide significant improvements to the state of the art for a wide range of NLP tasks. However, many questions remain as to how and why these models are so effective. In this paper, we present a detailed empirical study of how the choice of neural architecture (e.g. LSTM, CNN, or self attention) influences both end task accuracy and qualitative properties of the representations that are learned. We show there is a tradeoff between speed and accuracy, but all architectures learn high quality contextual representations that outperform word embeddings for four challenging NLP tasks. Additionally, all architectures learn representations that vary with network depth, from exclusively morphological based at the word embedding layer through local syntax based in the lower contextual layers to longer range semantics such coreference at the upper layers. Together, these results suggest that unsupervised biLMs, independent of architecture, are learning much more about the structure of language than previously appreciated.
Tasks	Word Embeddings
Published	2018-08-27
URL	http://arxiv.org/abs/1808.08949v2
PDF	http://arxiv.org/pdf/1808.08949v2.pdf
PWC	https://paperswithcode.com/paper/dissecting-contextual-word-embeddings
Repo
Framework

Non-deterministic Behavior of Ranking-based Metrics when Evaluating Embeddings


Title	Non-deterministic Behavior of Ranking-based Metrics when Evaluating Embeddings
Authors	Anguelos Nicolaou, Sounak Dey, Vincent Christlein, Andreas Maier, Dimosthenis Karatzas
Abstract	Embedding data into vector spaces is a very popular strategy of pattern recognition methods. When distances between embeddings are quantized, performance metrics become ambiguous. In this paper, we present an analysis of the ambiguity quantized distances introduce and provide bounds on the effect. We demonstrate that it can have a measurable effect in empirical data in state-of-the-art systems. We also approach the phenomenon from a computer security perspective and demonstrate how someone being evaluated by a third party can exploit this ambiguity and greatly outperform a random predictor without even access to the input data. We also suggest a simple solution making the performance metrics, which rely on ranking, totally deterministic and impervious to such exploits.
Tasks
Published	2018-06-19
URL	http://arxiv.org/abs/1806.07171v2
PDF	http://arxiv.org/pdf/1806.07171v2.pdf
PWC	https://paperswithcode.com/paper/non-deterministic-behavior-of-ranking-based
Repo
Framework

Sometimes You Want to Go Where Everybody Knows your Name


Title	Sometimes You Want to Go Where Everybody Knows your Name
Authors	Reuben Brasher, Nat Roth, Justin Wagle
Abstract	We introduce a new metric for measuring how well a model personalizes to a user’s specific preferences. We define personalization as a weighting between performance on user specific data and performance on a more general global dataset that represents many different users. This global term serves as a form of regularization that forces us to not overfit to individual users who have small amounts of data. In order to protect user privacy, we add the constraint that we may not centralize or share user data. We also contribute a simple experiment in which we simulate classifying sentiment for users with very distinct vocabularies. This experiment functions as an example of the tension between doing well globally on all users, and doing well on any specific individual user. It also provides a concrete example of how to employ our new metric to help reason about and resolve this tension. We hope this work can help frame and ground future work into personalization.
Tasks
Published	2018-01-30
URL	http://arxiv.org/abs/1801.10182v1
PDF	http://arxiv.org/pdf/1801.10182v1.pdf
PWC	https://paperswithcode.com/paper/sometimes-you-want-to-go-where-everybody
Repo
Framework

A Coarse-to-fine Deep Convolutional Neural Network Framework for Frame Duplication Detection and Localization in Forged Videos


Title	A Coarse-to-fine Deep Convolutional Neural Network Framework for Frame Duplication Detection and Localization in Forged Videos
Authors	Chengjiang Long, Arslan Basharat, Anthony Hoogs
Abstract	Videos can be manipulated by duplicating a sequence of consecutive frames with the goal of concealing or imitating a specific content in the same video. In this paper, we propose a novel coarse-to-fine framework based on deep Convolutional Neural Networks to automatically detect and localize such frame duplication. First, an I3D network finds coarse-level matches between candidate duplicated frame sequences and the corresponding selected original frame sequences. Then a Siamese network based on ResNet architecture identifies fine-level correspondences between an individual duplicated frame and the corresponding selected frame. We also propose a robust statistical approach to compute a video-level score indicating the likelihood of manipulation or forgery. Additionally, for providing manipulation localization information we develop an inconsistency detector based on the I3D network to distinguish the duplicated frames from the selected original frames. Quantified evaluation on two challenging video forgery datasets clearly demonstrates that this approach performs significantly better than four recent state-of-the-art methods.
Tasks	Frame Duplication Detection, Localization In Video Forgery
Published	2018-11-27
URL	https://arxiv.org/abs/1811.10762v2
PDF	https://arxiv.org/pdf/1811.10762v2.pdf
PWC	https://paperswithcode.com/paper/a-coarse-to-fine-deep-convolutional-neural
Repo
Framework

Being curious about the answers to questions: novelty search with learned attention


Title	Being curious about the answers to questions: novelty search with learned attention
Authors	Nicholas Guttenberg, Martin Biehl, Nathaniel Virgo, Ryota Kanai
Abstract	We investigate the use of attentional neural network layers in order to learn a `behavior characterization' which can be used to drive novelty search and curiosity-based policies. The space is structured towards answering a particular distribution of questions, which are used in a supervised way to train the attentional neural network. We find that in a 2d exploration task, the structure of the space successfully encodes local sensory-motor contingencies such that even a greedy local` do the most novel action’ policy with no reinforcement learning or evolution can explore the space quickly. We also apply this to a high/low number guessing game task, and find that guessing according to the learned attention profile performs active inference and can discover the correct number more quickly than an exact but passive approach.
Tasks
Published	2018-06-01
URL	http://arxiv.org/abs/1806.00201v1
PDF	http://arxiv.org/pdf/1806.00201v1.pdf
PWC	https://paperswithcode.com/paper/being-curious-about-the-answers-to-questions
Repo
Framework

GESF: A Universal Discriminative Mapping Mechanism for Graph Representation Learning


Title	GESF: A Universal Discriminative Mapping Mechanism for Graph Representation Learning
Authors	Shupeng Gui, Xiangliang Zhang, Shuang Qiu, Mingrui Wu, Jieping Ye, Ji Liu
Abstract	Graph embedding is a central problem in social network analysis and many other applications, aiming to learn the vector representation for each node. While most existing approaches need to specify the neighborhood and the dependence form to the neighborhood, which may significantly degrades the flexibility of representation, we propose a novel graph node embedding method (namely GESF) via the set function technique. Our method can 1) learn an arbitrary form of representation function from neighborhood, 2) automatically decide the significance of neighbors at different distances, and 3) be applied to heterogeneous graph embedding, which may contain multiple types of nodes. Theoretical guarantee for the representation capability of our method has been proved for general homogeneous and heterogeneous graphs and evaluation results on benchmark data sets show that the proposed GESF outperforms the state-of-the-art approaches on producing node vectors for classification tasks.
Tasks	Graph Embedding, Graph Representation Learning, Representation Learning
Published	2018-05-28
URL	http://arxiv.org/abs/1805.11182v3
PDF	http://arxiv.org/pdf/1805.11182v3.pdf
PWC	https://paperswithcode.com/paper/gesf-a-universal-discriminative-mapping
Repo
Framework