January 28, 2020

2861 words 14 mins read

Paper Group ANR 1060

Morphology-Agnostic Visual Robotic Control. Investigating the effect of competitiveness power in estimating the average weighted price in electricity market. Deep splitting method for parabolic PDEs. Countering Noisy Labels By Learning From Auxiliary Clean Labels. Information Aggregation for Multi-Head Attention with Routing-by-Agreement. Deep mate …

Morphology-Agnostic Visual Robotic Control


Title	Morphology-Agnostic Visual Robotic Control
Authors	Brian Yang, Dinesh Jayaraman, Glen Berseth, Alexei Efros, Sergey Levine
Abstract	Existing approaches for visuomotor robotic control typically require characterizing the robot in advance by calibrating the camera or performing system identification. We propose MAVRIC, an approach that works with minimal prior knowledge of the robot’s morphology, and requires only a camera view containing the robot and its environment and an unknown control interface. MAVRIC revolves around a mutual information-based method for self-recognition, which discovers visual “control points” on the robot body within a few seconds of exploratory interaction, and these control points in turn are then used for visual servoing. MAVRIC can control robots with imprecise actuation, no proprioceptive feedback, unknown morphologies including novel tools, unknown camera poses, and even unsteady handheld cameras. We demonstrate our method on visually-guided 3D point reaching, trajectory following, and robot-to-robot imitation.
Tasks
Published	2019-12-31
URL	https://arxiv.org/abs/1912.13360v1
PDF	https://arxiv.org/pdf/1912.13360v1.pdf
PWC	https://paperswithcode.com/paper/morphology-agnostic-visual-robotic-control
Repo
Framework

Investigating the effect of competitiveness power in estimating the average weighted price in electricity market


Title	Investigating the effect of competitiveness power in estimating the average weighted price in electricity market
Authors	Naser Rostamni, Tarik A. Rashid
Abstract	This paper evaluates the impact of the power extent on price in the electricity market. The competitiveness extent of the electricity market during specific times in a day is considered to achieve this. Then, the effect of competitiveness extent on the forecasting precision of the daily power price is assessed. A price forecasting model based on multi-layer perception via back propagation with the Levenberg-Marquardt mechanism is used. The Residual Supply Index (RSI) and other variables that affect prices are used as inputs to the model to evaluate the market competitiveness. The results show that using market power indices as inputs helps to increase forecasting accuracy. Thus, the competitiveness extent of the market power in different daily time periods is a notable variable in price formation. Moreover, market players cannot ignore the explanatory power of market power in price forecasting. In this research, the real data of the electricity market from 2013 is used and the main source of data is the Grid Management Company in Iran.
Tasks
Published	2019-07-27
URL	https://arxiv.org/abs/1907.11984v1
PDF	https://arxiv.org/pdf/1907.11984v1.pdf
PWC	https://paperswithcode.com/paper/investigating-the-effect-of-competitiveness
Repo
Framework

Deep splitting method for parabolic PDEs


Title	Deep splitting method for parabolic PDEs
Authors	Christian Beck, Sebastian Becker, Patrick Cheridito, Arnulf Jentzen, Ariel Neufeld
Abstract	In this paper we introduce a numerical method for parabolic PDEs that combines operator splitting with deep learning. It divides the PDE approximation problem into a sequence of separate learning problems. Since the computational graph for each of the subproblems is comparatively small, the approach can handle extremely high-dimensional PDEs. We test the method on different examples from physics, stochastic control, and mathematical finance. In all cases, it yields very good results in up to 10,000 dimensions with short run times.
Tasks
Published	2019-07-08
URL	https://arxiv.org/abs/1907.03452v1
PDF	https://arxiv.org/pdf/1907.03452v1.pdf
PWC	https://paperswithcode.com/paper/deep-splitting-method-for-parabolic-pdes
Repo
Framework

Countering Noisy Labels By Learning From Auxiliary Clean Labels


Title	Countering Noisy Labels By Learning From Auxiliary Clean Labels
Authors	Tsung Wei Tsai, Chongxuan Li, Jun Zhu
Abstract	We consider the learning from noisy labels (NL) problem which emerges in many real-world applications. In addition to the widely-studied synthetic noise in the NL literature, we also consider the pseudo labels in semi-supervised learning (Semi-SL) as a special case of NL. For both types of noise, we argue that the generalization performance of existing methods is highly coupled with the quality of noisy labels. Therefore, we counter the problem from a novel and unified perspective: learning from the auxiliary clean labels. Specifically, we propose the Rotational-Decoupling Consistency Regularization (RDCR) framework that integrates the consistency-based methods with the self-supervised rotation task to learn noise-tolerant representations. The experiments show that RDCR achieves comparable or superior performance than the state-of-the-art methods under small noise, while outperforms the existing methods significantly when there is large noise.
Tasks
Published	2019-05-23
URL	https://arxiv.org/abs/1905.13305v2
PDF	https://arxiv.org/pdf/1905.13305v2.pdf
PWC	https://paperswithcode.com/paper/190513305
Repo
Framework

Information Aggregation for Multi-Head Attention with Routing-by-Agreement


Title	Information Aggregation for Multi-Head Attention with Routing-by-Agreement
Authors	Jian Li, Baosong Yang, Zi-Yi Dou, Xing Wang, Michael R. Lyu, Zhaopeng Tu
Abstract	Multi-head attention is appealing for its ability to jointly extract different types of information from multiple representation subspaces. Concerning the information aggregation, a common practice is to use a concatenation followed by a linear transformation, which may not fully exploit the expressiveness of multi-head attention. In this work, we propose to improve the information aggregation for multi-head attention with a more powerful routing-by-agreement algorithm. Specifically, the routing algorithm iteratively updates the proportion of how much a part (i.e. the distinct information learned from a specific subspace) should be assigned to a whole (i.e. the final output representation), based on the agreement between parts and wholes. Experimental results on linguistic probing tasks and machine translation tasks prove the superiority of the advanced information aggregation over the standard linear transformation.
Tasks	Machine Translation
Published	2019-04-05
URL	http://arxiv.org/abs/1904.03100v1
PDF	http://arxiv.org/pdf/1904.03100v1.pdf
PWC	https://paperswithcode.com/paper/information-aggregation-for-multi-head
Repo
Framework

Deep material network with cohesive layers: Multi-stage training and interfacial failure analysis


Title	Deep material network with cohesive layers: Multi-stage training and interfacial failure analysis
Authors	Zeliang Liu
Abstract	A fundamental issue in multiscale materials modeling and design is the consideration of traction-separation behavior at the interface. By enriching the deep material network (DMN) with cohesive layers, the paper presents a novel data-driven material model which enables accurate and efficient prediction of multiscale responses for heterogeneous materials with interfacial effect. In the newly invoked cohesive building block, the fitting parameters have physical meanings related to the length scale and orientation of the cohesive layer. It is shown that the enriched material network can be effectively optimized via a multi-stage training strategy, with training data generated only from linear elastic direct numerical simulation (DNS). The extrapolation capability of the method to unknown material and loading spaces is demonstrated through the debonding analysis of a unidirectional fiber-reinforced composite, where the interface behavior is governed by an irreversible softening mixed-mode cohesive law. Its predictive accuracy is validated against the nonlinear path-dependent DNS results, and the reduction in computational time is particularly significant.
Tasks
Published	2019-08-07
URL	https://arxiv.org/abs/1908.04858v2
PDF	https://arxiv.org/pdf/1908.04858v2.pdf
PWC	https://paperswithcode.com/paper/deep-material-network-with-cohesive-layers
Repo
Framework

Lifting AutoEncoders: Unsupervised Learning of a Fully-Disentangled 3D Morphable Model using Deep Non-Rigid Structure from Motion


Title	Lifting AutoEncoders: Unsupervised Learning of a Fully-Disentangled 3D Morphable Model using Deep Non-Rigid Structure from Motion
Authors	Mihir Sahasrabudhe, Zhixin Shu, Edward Bartrum, Riza Alp Guler, Dimitris Samaras, Iasonas Kokkinos
Abstract	In this work we introduce Lifting Autoencoders, a generative 3D surface-based model of object categories. We bring together ideas from non-rigid structure from motion, image formation, and morphable models to learn a controllable, geometric model of 3D categories in an entirely unsupervised manner from an unstructured set of images. We exploit the 3D geometric nature of our model and use normal information to disentangle appearance into illumination, shading and albedo. We further use weak supervision to disentangle the non-rigid shape variability of human faces into identity and expression. We combine the 3D representation with a differentiable renderer to generate RGB images and append an adversarially trained refinement network to obtain sharp, photorealistic image reconstruction results. The learned generative model can be controlled in terms of interpretable geometry and appearance factors, allowing us to perform photorealistic image manipulation of identity, expression, 3D pose, and illumination properties.
Tasks	Image Reconstruction
Published	2019-04-26
URL	http://arxiv.org/abs/1904.11960v1
PDF	http://arxiv.org/pdf/1904.11960v1.pdf
PWC	https://paperswithcode.com/paper/lifting-autoencoders-unsupervised-learning-of
Repo
Framework

3D pavement surface reconstruction using an RGB-D sensor


Title	3D pavement surface reconstruction using an RGB-D sensor
Authors	Ahmadreza Mahmoudzadeh, Sayna Firoozi Yeganeh, Amir Golroo
Abstract	A core procedure of pavement management systems is data collection. The modern technologies which are used for this purpose, such as point-based lasers and laser scanners, are too expensive to purchase, operate, and maintain. Thus, it is rarely feasible for city officials in developing countries to conduct data collection using these devices. This paper aims to introduce a cost-effective technology which can be used for pavement distress data collection and 3D pavement surface reconstruction. The applied technology in this research is the Kinect sensor which is not only cost-effective but also sufficiently precise. The Kinect sensor can register both depth and color images simultaneously. A cart is designed to mount an array of Kinect sensors. The cameras are calibrated and the slopes of collected surfaces are corrected via the Singular Value Decomposition (SVD) algorithm. Then, a procedure is proposed for stitching the RGB-D (Red Green Blue Depth) images using SURF (Speeded-up Robust Features) and MSAC (M-estimator SAmple Consensus) algorithms in order to create a 3D-structure of the pavement surface. Finally, transverse profiles are extracted and some field experiments are conducted to evaluate the reliability of the proposed approach for detecting pavement surface defects.
Tasks
Published	2019-07-09
URL	https://arxiv.org/abs/1907.04124v2
PDF	https://arxiv.org/pdf/1907.04124v2.pdf
PWC	https://paperswithcode.com/paper/3d-pavement-surface-reconstruction-using-an
Repo
Framework

Continuous Outlier Mining of Streaming Data in Flink


Title	Continuous Outlier Mining of Streaming Data in Flink
Authors	Theodoros Toliopoulos, Anastasios Gounaris, Kostas Tsichlas, Apostolos Papadopoulos, Sandra Sampaio
Abstract	In this work, we focus on distance-based outliers in a metric space, where the status of an entity as to whether it is an outlier is based on the number of other entities in its neighborhood. In recent years, several solutions have tackled the problem of distance-based outliers in data streams, where outliers must be mined continuously as new elements become available. An interesting research problem is to combine the streaming environment with massively parallel systems to provide scalable streambased algorithms. However, none of the previously proposed techniques refer to a massively parallel setting. Our proposal fills this gap and investigates the challenges in transferring state-of-the-art techniques to Apache Flink, a modern platform for intensive streaming analytics. We thoroughly present the technical challenges encountered and the alternatives that may be applied. We show speed-ups of up to 117 (resp. 2076) times over a naive parallel (resp. non-parallel) solution in Flink, by using just an ordinary four-core machine and a real-world dataset. When moving to a three-machine cluster, due to less contention, we manage to achieve both better scalability in terms of the window slide size and the data dimensionality, and even higher speed-ups, e.g., by a factor of 510. Overall, our results demonstrate that oulier mining can be achieved in an efficient and scalable manner. The resulting techniques have been made publicly available as open-source software.
Tasks
Published	2019-02-21
URL	http://arxiv.org/abs/1902.07901v1
PDF	http://arxiv.org/pdf/1902.07901v1.pdf
PWC	https://paperswithcode.com/paper/continuous-outlier-mining-of-streaming-data
Repo
Framework

Deep Reinforcement Learning Architecture for Continuous Power Allocation in High Throughput Satellites


Title	Deep Reinforcement Learning Architecture for Continuous Power Allocation in High Throughput Satellites
Authors	Juan Jose Garau Luis, Markus Guerster, Inigo del Portillo, Edward Crawley, Bruce Cameron
Abstract	In the coming years, the satellite broadband market will experience significant increases in the service demand, especially for the mobility sector, where demand is burstier. Many of the next generation of satellites will be equipped with numerous degrees of freedom in power and bandwidth allocation capabilities, making manual resource allocation impractical and inefficient. Therefore, it is desirable to automate the operation of these highly flexible satellites. This paper presents a novel power allocation approach based on Deep Reinforcement Learning (DRL) that represents the problem as continuous state and action spaces. We make use of the Proximal Policy Optimization (PPO) algorithm to optimize the allocation policy for minimum Unmet System Demand (USD) and power consumption. The performance of the algorithm is analyzed through simulations of a multibeam satellite system, which show promising results for DRL to be used as a dynamic resource allocation algorithm.
Tasks
Published	2019-06-03
URL	https://arxiv.org/abs/1906.00571v1
PDF	https://arxiv.org/pdf/1906.00571v1.pdf
PWC	https://paperswithcode.com/paper/190600571
Repo
Framework

Meta Label Correction for Learning with Weak Supervision


Title	Meta Label Correction for Learning with Weak Supervision
Authors	Guoqing Zheng, Ahmed Hassan Awadallah, Susan Dumais
Abstract	Leveraging weak or noisy supervision for building effective machine learning models has long been an important research problem. The growing need for large-scale datasets to train deep learning models has increased its importance. Weak or noisy supervision could originate from multiple sources including non-expert annotators or automatic labeling based on heuristics or user interaction signals. Previous work on modeling and correcting weak labels have been focused on various aspects, including loss correction, training instance re-weighting, etc. In this paper, we approach this problem from a novel perspective based on meta-learning. We view the label correction procedure as a meta-process and propose a new meta-learning based framework termed MLC for learning with weak supervision. Experiments with different label noise levels on multiple datasets show that MLC can achieve large improvement over previous methods incorporating weak labels for learning.
Tasks	Meta-Learning
Published	2019-11-10
URL	https://arxiv.org/abs/1911.03809v1
PDF	https://arxiv.org/pdf/1911.03809v1.pdf
PWC	https://paperswithcode.com/paper/meta-label-correction-for-learning-with-weak-1
Repo
Framework

Variable-sized input, character-level recurrent neural networks in lead generation: predicting close rates from raw user inputs


Title	Variable-sized input, character-level recurrent neural networks in lead generation: predicting close rates from raw user inputs
Authors	Giulio Giorcelli
Abstract	Predicting lead close rates is one of the most problematic tasks in the lead generation industry. In most cases, the only available data on the prospect is the self-reported information inputted by the user on the lead form and a few other data points publicly available through social media and search engine usage. All the major market niches for lead generation [1], such as insurance, health & medical and real estate, deal with life-altering decision making that no amount of data will be ever be able to describe or predict. This paper illustrates how character-level, deep long short-term memory networks can be applied to raw user inputs to help predict close rates. The output of the model is then used as an additional, highly predictive feature to significantly boost performance of lead scoring models.
Tasks	Decision Making
Published	2019-01-16
URL	http://arxiv.org/abs/1901.05115v1
PDF	http://arxiv.org/pdf/1901.05115v1.pdf
PWC	https://paperswithcode.com/paper/variable-sized-input-character-level
Repo
Framework

MAIN: Multi-Attention Instance Network for Video Segmentation


Title	MAIN: Multi-Attention Instance Network for Video Segmentation
Authors	Juan Leon Alcazar, Maria A. Bravo, Ali K. Thabet, Guillaume Jeanneret, Thomas Brox, Pablo Arbelaez, Bernard Ghanem
Abstract	Instance-level video segmentation requires a solid integration of spatial and temporal information. However, current methods rely mostly on domain-specific information (online learning) to produce accurate instance-level segmentations. We propose a novel approach that relies exclusively on the integration of generic spatio-temporal attention cues. Our strategy, named Multi-Attention Instance Network (MAIN), overcomes challenging segmentation scenarios over arbitrary videos without modelling sequence- or instance-specific knowledge. We design MAIN to segment multiple instances in a single forward pass, and optimize it with a novel loss function that favors class agnostic predictions and assigns instance-specific penalties. We achieve state-of-the-art performance on the challenging Youtube-VOS dataset and benchmark, improving the unseen Jaccard and F-Metric by 6.8% and 12.7% respectively, while operating at real-time (30.3 FPS).
Tasks	Video Semantic Segmentation
Published	2019-04-11
URL	http://arxiv.org/abs/1904.05847v1
PDF	http://arxiv.org/pdf/1904.05847v1.pdf
PWC	https://paperswithcode.com/paper/main-multi-attention-instance-network-for
Repo
Framework

The Bregman-Tweedie Classification Model


Title	The Bregman-Tweedie Classification Model
Authors	Hyenkyun Woo
Abstract	This work proposes the Bregman-Tweedie classification model and analyzes the domain structure of the extended exponential function, an extension of the classic generalized exponential function with additional scaling parameter, and related high-level mathematical structures, such as the Bregman-Tweedie loss function and the Bregman-Tweedie divergence. The base function of this divergence is the convex function of Legendre type induced from the extended exponential function. The Bregman-Tweedie loss function of the proposed classification model is the regular Legendre transformation of the Bregman-Tweedie divergence. This loss function is a polynomial parameterized function between unhinge loss and the logistic loss function. Actually, we have two sub-models of the Bregman-Tweedie classification model; H-Bregman with hinge-like loss function and L-Bregman with logistic-like loss function. Although the proposed classification model is nonconvex and unbounded, empirically, we have observed that the H-Bregman and L-Bregman outperform, in terms of the Friedman ranking, logistic regression and SVM and show reasonable performance in terms of the classification accuracy in the category of the binary linear classification problem.
Tasks
Published	2019-07-16
URL	https://arxiv.org/abs/1907.06923v1
PDF	https://arxiv.org/pdf/1907.06923v1.pdf
PWC	https://paperswithcode.com/paper/the-bregman-tweedie-classification-model
Repo
Framework

Robust Alignment for Panoramic Stitching via an Exact Rank Constraint


Title	Robust Alignment for Panoramic Stitching via an Exact Rank Constraint
Authors	Yuelong Li, Mohammad Tofighi, Vishal Monga
Abstract	We study the problem of image alignment for panoramic stitching. Unlike most existing approaches that are feature-based, our algorithm works on pixels directly, and accounts for errors across the whole images globally. Technically, we formulate the alignment problem as rank-1 and sparse matrix decomposition over transformed images, and develop an efficient algorithm for solving this challenging non-convex optimization problem. The algorithm reduces to solving a sequence of subproblems, where we analytically establish exact recovery conditions, convergence and optimality, together with convergence rate and complexity. We generalize it to simultaneously align multiple images and recover multiple homographies, extending its application scope towards vast majority of practical scenarios. Experimental results demonstrate that the proposed algorithm is capable of more accurately aligning the images and generating higher quality stitched images than state-of-the-art methods.
Tasks
Published	2019-04-01
URL	http://arxiv.org/abs/1904.04158v1
PDF	http://arxiv.org/pdf/1904.04158v1.pdf
PWC	https://paperswithcode.com/paper/robust-alignment-for-panoramic-stitching-via
Repo
Framework