October 18, 2019

2928 words 14 mins read

Paper Group ANR 520

Paper Group ANR 520

An End-to-End Deep Learning Histochemical Scoring System for Breast Cancer Tissue Microarray. Writing Style Invariant Deep Learning Model for Historical Manuscripts Alignment. Anomaly detection with Wasserstein GAN. Probabilistic DL Reasoning with Pinpointing Formulas: A Prolog-based Approach. Learning sparse mixtures of rankings from noisy informa …

An End-to-End Deep Learning Histochemical Scoring System for Breast Cancer Tissue Microarray

Title An End-to-End Deep Learning Histochemical Scoring System for Breast Cancer Tissue Microarray
Authors Jingxin Liu, Bolei Xu, Chi Zheng, Yuanhao Gong, Jon Garibaldi, Daniele Soria, Andew Green, Ian O. Ellis, Wenbin Zou, Guoping Qiu
Abstract One of the methods for stratifying different molecular classes of breast cancer is the Nottingham Prognostic Index Plus (NPI+) which uses breast cancer relevant biomarkers to stain tumour tissues prepared on tissue microarray (TMA). To determine the molecular class of the tumour, pathologists will have to manually mark the nuclei activity biomarkers through a microscope and use a semi-quantitative assessment method to assign a histochemical score (H-Score) to each TMA core. Manually marking positively stained nuclei is a time consuming, imprecise and subjective process which will lead to inter-observer and intra-observer discrepancies. In this paper, we present an end-to-end deep learning system which directly predicts the H-Score automatically. Our system imitates the pathologists’ decision process and uses one fully convolutional network (FCN) to extract all nuclei region (tumour and non-tumour), a second FCN to extract tumour nuclei region, and a multi-column convolutional neural network which takes the outputs of the first two FCNs and the stain intensity description image as input and acts as the high-level decision making mechanism to directly output the H-Score of the input TMA image. To the best of our knowledge, this is the first end-to-end system that takes a TMA image as input and directly outputs a clinical score. We will present experimental results which demonstrate that the H-Scores predicted by our model have very high and statistically significant correlation with experienced pathologists’ scores and that the H-Score discrepancy between our algorithm and the pathologists is on par with the inter-subject discrepancy between the pathologists.
Tasks Decision Making
Published 2018-01-19
URL http://arxiv.org/abs/1801.06288v1
PDF http://arxiv.org/pdf/1801.06288v1.pdf
PWC https://paperswithcode.com/paper/an-end-to-end-deep-learning-histochemical
Repo
Framework

Writing Style Invariant Deep Learning Model for Historical Manuscripts Alignment

Title Writing Style Invariant Deep Learning Model for Historical Manuscripts Alignment
Authors Majeed Kassis, Jumana Nassour, Jihad El-Sana
Abstract Historical manuscript alignment is a widely known problem in document analysis. Finding the differences between manuscript editions is mostly done manually. In this paper, we present a writer independent deep learning model which is trained on several writing styles, and able to achieve high detection accuracy when tested on writing styles not present in training data. We test our model using cross validation, each time we train the model on five manuscripts, and test it on the other two manuscripts, never seen in the training data. We’ve applied cross validation on seven manuscripts, netting 21 different tests, achieving average accuracy of $%92.17$. We also present a new alignment algorithm based on dynamic sized sliding window, which is able to successfully handle complex cases.
Tasks
Published 2018-06-07
URL http://arxiv.org/abs/1806.03987v1
PDF http://arxiv.org/pdf/1806.03987v1.pdf
PWC https://paperswithcode.com/paper/writing-style-invariant-deep-learning-model
Repo
Framework

Anomaly detection with Wasserstein GAN

Title Anomaly detection with Wasserstein GAN
Authors Ilyass Haloui, Jayant Sen Gupta, Vincent Feuillard
Abstract Generative adversarial networks are a class of generative algorithms that have been widely used to produce state-of-the-art samples. In this paper, we investigate GAN to perform anomaly detection on time series dataset. In order to achieve this goal, a bibliography is made focusing on theoretical properties of GAN and GAN used for anomaly detection. A Wasserstein GAN has been chosen to learn the representation of normal data distribution and a stacked encoder with the generator performs the anomaly detection. W-GAN with encoder seems to produce state of the art anomaly detection scores on MNIST dataset and we investigate its usage on multi-variate time series.
Tasks Anomaly Detection, Time Series
Published 2018-12-06
URL http://arxiv.org/abs/1812.02463v2
PDF http://arxiv.org/pdf/1812.02463v2.pdf
PWC https://paperswithcode.com/paper/anomaly-detection-with-wasserstein-gan
Repo
Framework

Probabilistic DL Reasoning with Pinpointing Formulas: A Prolog-based Approach

Title Probabilistic DL Reasoning with Pinpointing Formulas: A Prolog-based Approach
Authors Riccardo Zese, Giuseppe Cota, Evelina Lamma, Elena Bellodi, Fabrizio Riguzzi
Abstract When modeling real world domains we have to deal with information that is incomplete or that comes from sources with different trust levels. This motivates the need for managing uncertainty in the Semantic Web. To this purpose, we introduced a probabilistic semantics, named DISPONTE, in order to combine description logics with probability theory. The probability of a query can be then computed from the set of its explanations by building a Binary Decision Diagram (BDD). The set of explanations can be found using the tableau algorithm, which has to handle non-determinism. Prolog, with its efficient handling of non-determinism, is suitable for implementing the tableau algorithm. TRILL and TRILLP are systems offering a Prolog implementation of the tableau algorithm. TRILLP builds a pinpointing formula, that compactly represents the set of explanations and can be directly translated into a BDD. Both reasoners were shown to outperform state-of-the-art DL reasoners. In this paper, we present an improvement of TRILLP, named TORNADO, in which the BDD is directly built during the construction of the tableau, further speeding up the overall inference process. An experimental comparison shows the effectiveness of TORNADO. All systems can be tried online in the TRILL on SWISH web application at http://trill.ml.unife.it/.
Tasks
Published 2018-09-17
URL http://arxiv.org/abs/1809.06180v3
PDF http://arxiv.org/pdf/1809.06180v3.pdf
PWC https://paperswithcode.com/paper/probabilistic-dl-reasoning-with-pinpointing
Repo
Framework

Learning sparse mixtures of rankings from noisy information

Title Learning sparse mixtures of rankings from noisy information
Authors Anindya De, Ryan O’Donnell, Rocco Servedio
Abstract We study the problem of learning an unknown mixture of $k$ rankings over $n$ elements, given access to noisy samples drawn from the unknown mixture. We consider a range of different noise models, including natural variants of the “heat kernel” noise framework and the Mallows model. For each of these noise models we give an algorithm which, under mild assumptions, learns the unknown mixture to high accuracy and runs in $n^{O(\log k)}$ time. The best previous algorithms for closely related problems have running times which are exponential in $k$.
Tasks
Published 2018-11-03
URL http://arxiv.org/abs/1811.01216v1
PDF http://arxiv.org/pdf/1811.01216v1.pdf
PWC https://paperswithcode.com/paper/learning-sparse-mixtures-of-rankings-from
Repo
Framework

IDSGAN: Generative Adversarial Networks for Attack Generation against Intrusion Detection

Title IDSGAN: Generative Adversarial Networks for Attack Generation against Intrusion Detection
Authors Zilong Lin, Yong Shi, Zhi Xue
Abstract As an important tool in security, the intrusion detection system bears the responsibility of the defense to network attacks performed by malicious traffic. Nowadays, with the help of machine learning algorithms, the intrusion detection system develops rapidly. However, the robustness of this system is questionable when it faces the adversarial attacks. To improve the detection system, more potential attack approaches should be researched. In this paper, a framework of the generative adversarial networks, IDSGAN, is proposed to generate the adversarial attacks, which can deceive and evade the intrusion detection system. Considering that the internal structure of the detection system is unknown to attackers, adversarial attack examples perform the black-box attacks against the detection system. IDSGAN leverages a generator to transform original malicious traffic into adversarial malicious traffic. A discriminator classifies traffic examples and simulates the black-box detection system. More significantly, we only modify part of the attacks’ nonfunctional features to guarantee the validity of the intrusion. Based on the dataset NSL-KDD, the feasibility of the model is demonstrated to attack many detection systems with different attacks and the excellent results are achieved. Moreover, the robustness of IDSGAN is verified by changing the amount of the unmodified features.
Tasks Adversarial Attack, Intrusion Detection
Published 2018-09-06
URL https://arxiv.org/abs/1809.02077v3
PDF https://arxiv.org/pdf/1809.02077v3.pdf
PWC https://paperswithcode.com/paper/idsgan-generative-adversarial-networks-for
Repo
Framework

An Interpretable Model with Globally Consistent Explanations for Credit Risk

Title An Interpretable Model with Globally Consistent Explanations for Credit Risk
Authors Chaofan Chen, Kangcheng Lin, Cynthia Rudin, Yaron Shaposhnik, Sijia Wang, Tong Wang
Abstract We propose a possible solution to a public challenge posed by the Fair Isaac Corporation (FICO), which is to provide an explainable model for credit risk assessment. Rather than present a black box model and explain it afterwards, we provide a globally interpretable model that is as accurate as other neural networks. Our “two-layer additive risk model” is decomposable into subscales, where each node in the second layer represents a meaningful subscale, and all of the nonlinearities are transparent. We provide three types of explanations that are simpler than, but consistent with, the global model. One of these explanation methods involves solving a minimum set cover problem to find high-support globally-consistent explanations. We present a new online visualization tool to allow users to explore the global model and its explanations.
Tasks
Published 2018-11-30
URL http://arxiv.org/abs/1811.12615v1
PDF http://arxiv.org/pdf/1811.12615v1.pdf
PWC https://paperswithcode.com/paper/an-interpretable-model-with-globally
Repo
Framework

BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling

Title BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling
Authors Fisher Yu, Wenqi Xian, Yingying Chen, Fangchen Liu, Mike Liao, Vashisht Madhavan, Trevor Darrell
Abstract Datasets drive vision progress and autonomous driving is a critical vision application, yet existing driving datasets are impoverished in terms of visual content. Driving imagery is becoming plentiful, but annotation is slow and expensive, as annotation tools have not kept pace with the flood of data. Our first contribution is the design and implementation of a scalable annotation system that can provide a comprehensive set of image labels for large-scale driving datasets. Our second contribution is a new driving dataset, facilitated by our tooling, which is an order of magnitude larger than previous efforts, and is comprised of over 100K videos with diverse kinds of annotations including image level tagging, object bounding boxes, drivable areas, lane markings, and full-frame instance segmentation. The dataset possesses geographic, environmental, and weather diversity, which is useful for training models so that they are less likely to be surprised by new conditions. The dataset can be requested at http://bdd-data.berkeley.edu.
Tasks Autonomous Driving, Instance Segmentation, Semantic Segmentation
Published 2018-05-12
URL http://arxiv.org/abs/1805.04687v1
PDF http://arxiv.org/pdf/1805.04687v1.pdf
PWC https://paperswithcode.com/paper/bdd100k-a-diverse-driving-video-database-with
Repo
Framework

A Matrix Approach for Weighted Argumentation Frameworks: a Preliminary Report

Title A Matrix Approach for Weighted Argumentation Frameworks: a Preliminary Report
Authors Stefano Bistarelli, Alessandra Tappini, Carlo Taticchi
Abstract The assignment of weights to attacks in a classical Argumentation Framework allows to compute semantics by taking into account the different importance of each argument. We represent a Weighted Argumentation Framework by a non-binary matrix, and we characterize the basic extensions (such as w-admissible, w- stable, w-complete) by analysing sub-blocks of this matrix. Also, we show how to reduce the matrix into another one of smaller size, that is equivalent to the original one for the determination of extensions. Furthermore, we provide two algorithms that allow to build incrementally w-grounded and w-preferred extensions starting from a w-admissible extension.
Tasks
Published 2018-02-23
URL http://arxiv.org/abs/1802.08445v1
PDF http://arxiv.org/pdf/1802.08445v1.pdf
PWC https://paperswithcode.com/paper/a-matrix-approach-for-weighted-argumentation
Repo
Framework

A Scale Invariant Approach for Sparse Signal Recovery

Title A Scale Invariant Approach for Sparse Signal Recovery
Authors Yaghoub Rahimi, Chao Wang, Hongbo Dong, Yifei Lou
Abstract In this paper, we study the ratio of the $L_1 $ and $L_2 $ norms, denoted as $L_1/L_2$, to promote sparsity. Due to the non-convexity and non-linearity, there has been little attention to this scale-invariant model. Compared to popular models in the literature such as the $L_p$ model for $p\in(0,1)$ and the transformed $L_1$ (TL1), this ratio model is parameter free. Theoretically, we present a strong null space property (sNSP) and prove that any sparse vector is a local minimizer of the $L_1 /L_2 $ model provided with this sNSP condition. Computationally, we focus on a constrained formulation that can be solved via the alternating direction method of multipliers (ADMM). Experiments show that the proposed approach is comparable to the state-of-the-art methods in sparse recovery. In addition, a variant of the $L_1/L_2$ model to apply on the gradient is also discussed with a proof-of-concept example of the MRI reconstruction.
Tasks
Published 2018-12-20
URL https://arxiv.org/abs/1812.08852v4
PDF https://arxiv.org/pdf/1812.08852v4.pdf
PWC https://paperswithcode.com/paper/a-scale-invariant-approach-for-sparse-signal
Repo
Framework

Active Learning of Uniformly Accurate Inter-atomic Potentials for Materials Simulation

Title Active Learning of Uniformly Accurate Inter-atomic Potentials for Materials Simulation
Authors Linfeng Zhang, De-Ye Lin, Han Wang, Roberto Car, Weinan E
Abstract An active learning procedure called Deep Potential Generator (DP-GEN) is proposed for the construction of accurate and transferable machine learning-based models of the potential energy surface (PES) for the molecular modeling of materials. This procedure consists of three main components: exploration, generation of accurate reference data, and training. Application to the sample systems of Al, Mg and Al-Mg alloys demonstrates that DP-GEN can produce uniformly accurate PES models with a minimal number of reference data.
Tasks Active Learning
Published 2018-10-28
URL http://arxiv.org/abs/1810.11890v2
PDF http://arxiv.org/pdf/1810.11890v2.pdf
PWC https://paperswithcode.com/paper/active-learning-of-uniformly-accurate-inter
Repo
Framework

A comprehensive study of sparse representation techniques for offline signature verification

Title A comprehensive study of sparse representation techniques for offline signature verification
Authors Elias N. Zois, Dimitrios Tsourounis, Ilias Theodorakopoulos, Anastasios Kesidis, George Economou
Abstract In this work, a feature extraction method for offline signature verification is presented that harnesses the power of sparse representation in order to deliver state-of-the-art verification performance in several signature datasets like CEDAR, MCYT-75, GPDS and UTSIG. Beyond the accuracy improvements, several major parameters associated with sparse representation; such as selected configuration, dictionary size, sparsity level and positivity priors are investigated. Besides, it is evinced that 2nd order statistics of the sparse codes is a powerful pooling function for the formation of the global signature descriptor. Also, a thorough evaluation of the effects of preprocessing is introduced by an automated algorithm in order to select the optimum thinning level. Finally, a segmentation strategy which employs a special form of spatial pyramid tailored to the problem of sparse representation is presented along with the enhancing of the produced descriptor on meaningful areas of the signature as emerged from the BRISK key-point detection mechanism. The obtained state-of-the-art results on the most challenging signature datasets provide a strong indication towards the benefits of learned features, even in writer dependent (WD) scenarios with a unique model for each writer and only a few available reference samples of him/her.
Tasks
Published 2018-07-13
URL http://arxiv.org/abs/1807.05039v4
PDF http://arxiv.org/pdf/1807.05039v4.pdf
PWC https://paperswithcode.com/paper/a-comprehensive-study-of-sparse
Repo
Framework

HUMBI: A Large Multiview Dataset of Human Body Expressions

Title HUMBI: A Large Multiview Dataset of Human Body Expressions
Authors Zhixuan Yu, Jae Shin Yoon, In Kyu Lee, Prashanth Venkatesh, Jaesik Park, Jihun Yu, Hyun Soo Park
Abstract This paper presents a new large multiview dataset called HUMBI for human body expressions with natural clothing. The goal of the HUMBI is to facilitate modeling view specific appearance and geometry of gaze, face, hand, body, and garment from assorted people. 107 synchronized high-definition cameras (70 cameras facing at the front body) are used to capture 772 distinctive subjects across gender, ethnicity, age, and physical condition. With the multiview image streams, we reconstruct high fidelity body expressions using 3D mesh models, which allows representing view specific appearance using their canonical atlas. We demonstrate that the HUMBI is highly effective in learning and reconstructing a complete human model and is complementary to the existing datasets of human expressions with sparse views or limited subjects such as MPII Gaze, Multi-PIE, Human 3.6M, and Panoptic Studio datasets.
Tasks
Published 2018-12-01
URL https://arxiv.org/abs/1812.00281v2
PDF https://arxiv.org/pdf/1812.00281v2.pdf
PWC https://paperswithcode.com/paper/humbi-10-human-multiview-behavioral-imaging
Repo
Framework

Compressive Sampling Approach for Image Acquisition with Lensless Endoscope

Title Compressive Sampling Approach for Image Acquisition with Lensless Endoscope
Authors Stéphanie Guérit, Siddharth Sivankutty, Camille Scotté, John Alto Lee, Hervé Rigneault, Laurent Jacques
Abstract The lensless endoscope is a promising device designed to image tissues in vivo at the cellular scale. The traditional acquisition setup consists in raster scanning during which the focused light beam from the optical fiber illuminates sequentially each pixel of the field of view (FOV). The calibration step to focus the beam and the sampling scheme both take time. In this preliminary work, we propose a scanning method based on compressive sampling theory. The method does not rely on a focused beam but rather on the random illumination patterns generated by the single-mode fibers. Experiments are performed on synthetic data for different compression rates (from 10 to 100% of the FOV).
Tasks Calibration
Published 2018-10-29
URL http://arxiv.org/abs/1810.12286v2
PDF http://arxiv.org/pdf/1810.12286v2.pdf
PWC https://paperswithcode.com/paper/compressive-sampling-approach-for-image
Repo
Framework

EgoCoder: Intelligent Program Synthesis with Hierarchical Sequential Neural Network Model

Title EgoCoder: Intelligent Program Synthesis with Hierarchical Sequential Neural Network Model
Authors Jiawei Zhang, Limeng Cui, Fisher B. Gouza
Abstract Programming has been an important skill for researchers and practitioners in computer science and other related areas. To learn basic programing skills, a long-time systematic training is usually required for beginners. According to a recent market report, the computer software market is expected to continue expanding at an accelerating speed, but the market supply of qualified software developers can hardly meet such a huge demand. In recent years, the surge of text generation research works provides the opportunities to address such a dilemma through automatic program synthesis. In this paper, we propose to make our try to solve the program synthesis problem from a data mining perspective. To address the problem, a novel generative model, namely EgoCoder, will be introduced in this paper. EgoCoder effectively parses program code into abstract syntax trees (ASTs), where the tree nodes will contain the program code/comment content and the tree structure can capture the program logic flows. Based on a new unit model called Hsu, EgoCoder can effectively capture both the hierarchical and sequential patterns in the program ASTs. Extensive experiments will be done to compare EgoCoder with the state-of-the-art text generation methods, and the experimental results have demonstrated the effectiveness of EgoCoder in addressing the program synthesis problem.
Tasks Program Synthesis, Text Generation
Published 2018-05-22
URL http://arxiv.org/abs/1805.08747v1
PDF http://arxiv.org/pdf/1805.08747v1.pdf
PWC https://paperswithcode.com/paper/egocoder-intelligent-program-synthesis-with
Repo
Framework
comments powered by Disqus