January 30, 2020

2777 words 14 mins read

Paper Group ANR 317

Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting. Evaluating Crowd Density Estimators via Their Uncertainty Bounds. Thompson Sampling with Information Relaxation Penalties. Extension of Convolutional Neural Network with General Image Processing Kernels. Fashion++: Minimal Edits for Outfit Improvement. The Function Representat …

Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting


Title	Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting
Authors	Vishwanath A Sindagi, Vishal M. Patel
Abstract	Crowd counting presents enormous challenges in the form of large variation in scales within images and across the dataset. These issues are further exacerbated in highly congested scenes. Approaches based on straightforward fusion of multi-scale features from a deep network seem to be obvious solutions to this problem. However, these fusion approaches do not yield significant improvements in the case of crowd counting in congested scenes. This is usually due to their limited abilities in effectively combining the multi-scale features for problems like crowd counting. To overcome this, we focus on how to efficiently leverage information present in different layers of the network. Specifically, we present a network that involves: (i) a multi-level bottom-top and top-bottom fusion (MBTTBF) method to combine information from shallower to deeper layers and vice versa at multiple levels, (ii) scale complementary feature extraction blocks (SCFB) involving cross-scale residual functions to explicitly enable flow of complementary features from adjacent conv layers along the fusion paths. Furthermore, in order to increase the effectiveness of the multi-scale fusion, we employ a principled way of generating scale-aware ground-truth density maps for training. Experiments conducted on three datasets that contain highly congested scenes (ShanghaiTech, UCF_CC_50, and UCF-QNRF) demonstrate that the proposed method is able to outperform several recent methods in all the datasets.
Tasks	Crowd Counting
Published	2019-08-28
URL	https://arxiv.org/abs/1908.10937v1
PDF	https://arxiv.org/pdf/1908.10937v1.pdf
PWC	https://paperswithcode.com/paper/multi-level-bottom-top-and-top-bottom-feature
Repo
Framework

Evaluating Crowd Density Estimators via Their Uncertainty Bounds


Title	Evaluating Crowd Density Estimators via Their Uncertainty Bounds
Authors	Jennifer Vandoni, Emanuel Aldea, Sylvie Le Hégarat-Mascle
Abstract	In this work, we use the Belief Function Theory which extends the probabilistic framework in order to provide uncertainty bounds to different categories of crowd density estimators. Our method allows us to compare the multi-scale performance of the estimators, and also to characterize their reliability for crowd monitoring applications requiring varying degrees of prudence.
Tasks
Published	2019-02-07
URL	http://arxiv.org/abs/1902.02831v1
PDF	http://arxiv.org/pdf/1902.02831v1.pdf
PWC	https://paperswithcode.com/paper/evaluating-crowd-density-estimators-via-their
Repo
Framework

Thompson Sampling with Information Relaxation Penalties


Title	Thompson Sampling with Information Relaxation Penalties
Authors	Seungki Min, Costis Maglaras, Ciamac C. Moallemi
Abstract	We consider a finite time horizon multi-armed bandit (MAB) problem in a Bayesian framework, for which we develop a general set of control policies that leverage ideas from information relaxations of stochastic dynamic optimization problems. In crude terms, an information relaxation allows the decision maker (DM) to have access to the future (unknown) rewards and incorporate them in her optimization problem to pick an action at time $t$, but penalizes the decision maker for using this information. In our setting, the future rewards allow the DM to better estimate the unknown mean reward parameters of the multiple arms, and optimize her sequence of actions. By picking different information penalties, the DM can construct a family of policies of increasing complexity that, for example, include Thompson Sampling and the true optimal (but intractable) policy as special cases. We systematically develop this framework of information relaxation sampling, propose an intuitive family of control policies for our motivating finite time horizon Bayesian MAB problem, and prove associated structural results and performance bounds. Numerical experiments suggest that this new class of policies performs well, in particular in settings where the finite time horizon introduces significant tension in the problem. Finally, inspired by the finite time horizon Gittins index, we propose an index policy that builds on our framework that particularly outperforms to the state-of-the-art algorithms in our numerical experiments.
Tasks
Published	2019-02-12
URL	http://arxiv.org/abs/1902.04251v1
PDF	http://arxiv.org/pdf/1902.04251v1.pdf
PWC	https://paperswithcode.com/paper/thompson-sampling-with-information-relaxation
Repo
Framework

Extension of Convolutional Neural Network with General Image Processing Kernels


Title	Extension of Convolutional Neural Network with General Image Processing Kernels
Authors	Jay Hoon Jung, Yousun Shin, YoungMin Kwon
Abstract	We applied pre-defined kernels also known as filters or masks developed for image processing to convolution neural network. Instead of letting neural networks find its own kernels, we used 41 different general-purpose kernels of blurring, edge detecting, sharpening, discrete cosine transformation, etc. for the first layer of the convolution neural networks. This architecture, thus named as general filter convolutional neural network (GFNN), can reduce training time by 30% with a better accuracy compared to the regular convolutional neural network (CNN). GFNN also can be trained to achieve 90% accuracy with only 500 samples. Furthermore, even though these kernels are not specialized for the MNIST dataset, we achieved 99.56% accuracy without ensemble nor any other special algorithms.
Tasks
Published	2019-01-16
URL	http://arxiv.org/abs/1901.07375v1
PDF	http://arxiv.org/pdf/1901.07375v1.pdf
PWC	https://paperswithcode.com/paper/extension-of-convolutional-neural-network
Repo
Framework

Fashion++: Minimal Edits for Outfit Improvement


Title	Fashion++: Minimal Edits for Outfit Improvement
Authors	Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, Kristen Grauman
Abstract	Given an outfit, what small changes would most improve its fashionability? This question presents an intriguing new vision challenge. We introduce Fashion++, an approach that proposes minimal adjustments to a full-body clothing outfit that will have maximal impact on its fashionability. Our model consists of a deep image generation neural network that learns to synthesize clothing conditioned on learned per-garment encodings. The latent encodings are explicitly factorized according to shape and texture, thereby allowing direct edits for both fit/presentation and color/patterns/material, respectively. We show how to bootstrap Web photos to automatically train a fashionability model, and develop an activation maximization-style approach to transform the input image into its more fashionable self. The edits suggested range from swapping in a new garment to tweaking its color, how it is worn (e.g., rolling up sleeves), or its fit (e.g., making pants baggier). Experiments demonstrate that Fashion++ provides successful edits, both according to automated metrics and human opinion. Project page is at http://vision.cs.utexas.edu/projects/FashionPlus.
Tasks	Image Generation
Published	2019-04-19
URL	https://arxiv.org/abs/1904.09261v3
PDF	https://arxiv.org/pdf/1904.09261v3.pdf
PWC	https://paperswithcode.com/paper/fashion-minimal-edits-for-outfit-improvement
Repo
Framework

The Function Representation of Artificial Neural Network


Title	The Function Representation of Artificial Neural Network
Authors	Zhongkui Ma
Abstract	This paper expresses the structure of artificial neural network (ANN) as a functional form, using the activation integral concept derived from the activation function. In this way, the structure of ANN can be represented by a simple function, and it is possible to find the mathematical solutions of ANN. Thus, it can be recognized that the current ANN can be placed in a more reasonable framework. Perhaps all questions about ANN will be eliminated.
Tasks
Published	2019-08-27
URL	https://arxiv.org/abs/1908.10493v2
PDF	https://arxiv.org/pdf/1908.10493v2.pdf
PWC	https://paperswithcode.com/paper/the-function-representation-of-artificial
Repo
Framework

Automatic Ambiguity Detection


Title	Automatic Ambiguity Detection
Authors	Richard Sproat, Jan van Santen
Abstract	Most work on sense disambiguation presumes that one knows beforehand – e.g. from a thesaurus – a set of polysemous terms. But published lists invariably give only partial coverage. For example, the English word tan has several obvious senses, but one may overlook the abbreviation for tangent. In this paper, we present an algorithm for identifying interesting polysemous terms and measuring their degree of polysemy, given an unlabeled corpus. The algorithm involves: (i) collecting all terms within a k-term window of the target term; (ii) computing the inter-term distances of the contextual terms, and reducing the multi-dimensional distance space to two dimensions using standard methods; (iii) converting the two-dimensional representation into radial coordinates and using isotonic/antitonic regression to compute the degree to which the distribution deviates from a single-peak model. The amount of deviation is the proposed polysemy index
Tasks
Published	2019-05-28
URL	https://arxiv.org/abs/1905.12065v1
PDF	https://arxiv.org/pdf/1905.12065v1.pdf
PWC	https://paperswithcode.com/paper/automatic-ambiguity-detection
Repo
Framework

Towards Unsupervised Segmentation of Extreme Weather Events


Title	Towards Unsupervised Segmentation of Extreme Weather Events
Authors	Adam Rupe, Karthik Kashinath, Nalini Kumar, Victor Lee, Prabhat, James P. Crutchfield
Abstract	Extreme weather is one of the main mechanisms through which climate change will directly impact human society. Coping with such change as a global community requires markedly improved understanding of how global warming drives extreme weather events. While alternative climate scenarios can be simulated using sophisticated models, identifying extreme weather events in these simulations requires automation due to the vast amounts of complex high-dimensional data produced. Atmospheric dynamics, and hydrodynamic flows more generally, are highly structured and largely organize around a lower dimensional skeleton of coherent structures. Indeed, extreme weather events are a special case of more general hydrodynamic coherent structures. We present a scalable physics-based representation learning method that decomposes spatiotemporal systems into their structurally relevant components, which are captured by latent variables known as local causal states. For complex fluid flows we show our method is capable of capturing known coherent structures, and with promising segmentation results on CAM5.1 water vapor data we outline the path to extreme weather identification from unlabeled climate model simulation data.
Tasks	Representation Learning
Published	2019-09-16
URL	https://arxiv.org/abs/1909.07520v1
PDF	https://arxiv.org/pdf/1909.07520v1.pdf
PWC	https://paperswithcode.com/paper/towards-unsupervised-segmentation-of-extreme
Repo
Framework

Noise Induces Loss Discrepancy Across Groups for Linear Regression


Title	Noise Induces Loss Discrepancy Across Groups for Linear Regression
Authors	Fereshte Khani, Percy Liang
Abstract	We study the effect of feature noise (measurement error) on the discrepancy between losses across two groups (e.g., men and women) in the context of linear regression. Our main finding is that adding even the same amount of noise on all individuals impacts groups differently. We characterize several forms of loss discrepancy in terms of the amount of noise and difference between moments of the two groups, for estimators that either do or do not use group membership information. We then study how long it takes for an estimator to adapt to a shift in the population that makes the groups have the same mean. We finally validate our results on three real-world datasets.
Tasks
Published	2019-11-22
URL	https://arxiv.org/abs/1911.09876v1
PDF	https://arxiv.org/pdf/1911.09876v1.pdf
PWC	https://paperswithcode.com/paper/noise-induces-loss-discrepancy-across-groups
Repo
Framework

Equation Discovery for Nonlinear System Identification


Title	Equation Discovery for Nonlinear System Identification
Authors	Nikola Simidjievski, Ljupčo Todorovski, Juš Kocijan, Sašo Džeroski
Abstract	Equation discovery methods enable modelers to combine domain-specific knowledge and system identification to construct models most suitable for a selected modeling task. The method described and evaluated in this paper can be used as a nonlinear system identification method for gray-box modeling. It consists of two interlaced parts of modeling that are computer-aided. The first performs computer-aided identification of a model structure composed of elements selected from user-specified domain-specific modeling knowledge, while the second part performs parameter estimation. In this paper, recent developments of the equation discovery method called process-based modeling, suited for nonlinear system identification, are elaborated and illustrated on two continuous-time case studies. The first case study illustrates the use of the process-based modeling on synthetic data while the second case-study evaluates on measured data for a standard system-identification benchmark. The experimental results clearly demonstrate the ability of process-based modeling to reconstruct both model structure and parameters from measured data.
Tasks
Published	2019-07-01
URL	https://arxiv.org/abs/1907.00821v1
PDF	https://arxiv.org/pdf/1907.00821v1.pdf
PWC	https://paperswithcode.com/paper/equation-discovery-for-nonlinear-system
Repo
Framework


Title	Cluster-wise Unsupervised Hashing for Cross-Modal Similarity Search
Authors	Lu Wang, Jie Yang
Abstract	Large-scale cross-modal hashing similarity retrieval has attracted more and more attention in modern search applications such as search engines and autopilot, showing great superiority in computation and storage. However, current unsupervised cross-modal hashing methods still have some limitations: (1)many methods relax the discrete constraints to solve the optimization objective which may significantly degrade the retrieval performance;(2)most existing hashing model project heterogenous data into a common latent space, which may always lose sight of diversity in heterogenous data;(3)transforming real-valued data point to binary codes always results in abundant loss of information, producing the suboptimal continuous latent space. To overcome above problems, in this paper, a novel Cluster-wise Unsupervised Hashing (CUH) method is proposed. Specifically, CUH jointly performs the multi-view clustering that projects the original data points from different modalities into its own low-dimensional latent semantic space and finds the cluster centroid points and the common clustering indicators in its own low-dimensional space, and learns the compact hash codes and the corresponding linear hash functions. An discrete optimization framework is developed to learn the unified binary codes across modalities under the guidance cluster-wise code-prototypes. The reasonableness and effectiveness of CUH is well demonstrated by comprehensive experiments on diverse benchmark datasets.
Tasks
Published	2019-11-11
URL	https://arxiv.org/abs/1911.07923v2
PDF	https://arxiv.org/pdf/1911.07923v2.pdf
PWC	https://paperswithcode.com/paper/cluster-wise-unsupervised-hashing-for-cross
Repo
Framework

SemEval-2017 Task 3: Community Question Answering


Title	SemEval-2017 Task 3: Community Question Answering
Authors	Preslav Nakov, Doris Hoogeveen, Lluís Màrquez, Alessandro Moschitti, Hamdy Mubarak, Timothy Baldwin, Karin Verspoor
Abstract	We describe SemEval-2017 Task 3 on Community Question Answering. This year, we reran the four subtasks from SemEval-2016:(A) Question-Comment Similarity,(B) Question-Question Similarity,(C) Question-External Comment Similarity, and (D) Rerank the correct answers for a new question in Arabic, providing all the data from 2015 and 2016 for training, and fresh data for testing. Additionally, we added a new subtask E in order to enable experimentation with Multi-domain Question Duplicate Detection in a larger-scale scenario, using StackExchange subforums. A total of 23 teams participated in the task, and submitted a total of 85 runs (36 primary and 49 contrastive) for subtasks A-D. Unfortunately, no teams participated in subtask E. A variety of approaches and features were used by the participating systems to address the different subtasks. The best systems achieved an official score (MAP) of 88.43, 47.22, 15.46, and 61.16 in subtasks A, B, C, and D, respectively. These scores are better than the baselines, especially for subtasks A-C.
Tasks	Community Question Answering, Question Answering, Question Similarity
Published	2019-12-02
URL	https://arxiv.org/abs/1912.00730v1
PDF	https://arxiv.org/pdf/1912.00730v1.pdf
PWC	https://paperswithcode.com/paper/semeval-2017-task-3-community-question-1
Repo
Framework

Multi-sequence Cardiac MR Segmentation with Adversarial Domain Adaptation Network


Title	Multi-sequence Cardiac MR Segmentation with Adversarial Domain Adaptation Network
Authors	Jiexiang Wang, Hongyu Huang, Chaoqi Chen, Wenao Ma, Yue Huang, Xinghao Ding
Abstract	Automatic and accurate segmentation of the ventricles and myocardium from multi-sequence cardiac MRI (CMR) is crucial for the diagnosis and treatment management for patients suffering from myocardial infarction (MI). However, due to the existence of domain shift among different modalities of datasets, the performance of deep neural networks drops significantly when the training and testing datasets are distinct. In this paper, we propose an unsupervised domain alignment method to explicitly alleviate the domain shifts among different modalities of CMR sequences, \emph{e.g.,} bSSFP, LGE, and T2-weighted. Our segmentation network is attention U-Net with pyramid pooling module, where multi-level feature space and output space adversarial learning are proposed to transfer discriminative domain knowledge across different datasets. Moreover, we further introduce a group-wise feature recalibration module to enforce the fine-grained semantic-level feature alignment that matching features from different networks but with the same class label. We evaluate our method on the multi-sequence cardiac MR Segmentation Challenge 2019 datasets, which contain three different modalities of MRI sequences. Extensive experimental results show that the proposed methods can obtain significant segmentation improvements compared with the baseline models.
Tasks	Domain Adaptation
Published	2019-10-28
URL	https://arxiv.org/abs/1910.12514v1
PDF	https://arxiv.org/pdf/1910.12514v1.pdf
PWC	https://paperswithcode.com/paper/multi-sequence-cardiac-mr-segmentation-with
Repo
Framework

A Rule-Based Relational XML Access Control Model in the Presence of Authorization Conflicts


Title	A Rule-Based Relational XML Access Control Model in the Presence of Authorization Conflicts
Authors	Ali Alwehaibi, Mustafa Atay
Abstract	There is considerable amount of sensitive XML data stored in relational databases. It is a challenge to enforce node level fine-grained authorization policies for XML data stored in relational databases which typically support table and column level access control. Moreover, it is common to have conflicting authorization policies over the hierarchical nested structure of XML data. There are a couple of XML access control models for relational XML databases proposed in the literature. However, to our best knowledge, none of them discussed handling authorization conflicts with conditions in the domain of relational XML databases. Therefore, we believe that there is a need to define and incorporate effective fine-grained XML authorization models with conflict handling mechanisms in the presence of conditions into relational XML databases. We address this issue in this study.
Tasks
Published	2019-09-24
URL	https://arxiv.org/abs/1909.11057v1
PDF	https://arxiv.org/pdf/1909.11057v1.pdf
PWC	https://paperswithcode.com/paper/a-rule-based-relational-xml-access-control
Repo
Framework

Evaluating the distribution learning capabilities of GANs


Title	Evaluating the distribution learning capabilities of GANs
Authors	Amit Rege, Claire Monteleoni
Abstract	We evaluate the distribution learning capabilities of generative adversarial networks by testing them on synthetic datasets. The datasets include common distributions of points in $R^n$ space and images containing polygons of various shapes and sizes. We find that by and large GANs fail to faithfully recreate point datasets which contain discontinous support or sharp bends with noise. Additionally, on image datasets, we find that GANs do not seem to learn to count the number of objects of the same kind in an image. We also highlight the apparent tension between generalization and learning in GANs.
Tasks
Published	2019-07-05
URL	https://arxiv.org/abs/1907.02662v1
PDF	https://arxiv.org/pdf/1907.02662v1.pdf
PWC	https://paperswithcode.com/paper/evaluating-the-distribution-learning
Repo
Framework