May 6, 2019

3005 words 15 mins read

Paper Group ANR 316

Accelerating the Super-Resolution Convolutional Neural Network. Deep Convolutional Neural Networks for Microscopy-Based Point of Care Diagnostics. Inverted Bilingual Topic Models for Lexicon Extraction from Non-parallel Data. Deep Learning a Grasp Function for Grasping under Gripper Pose Uncertainty. Recent Advances in Transient Imaging: A Computer …

Accelerating the Super-Resolution Convolutional Neural Network


Title	Accelerating the Super-Resolution Convolutional Neural Network
Authors	Chao Dong, Chen Change Loy, Xiaoou Tang
Abstract	As a successful deep model applied in image super-resolution (SR), the Super-Resolution Convolutional Neural Network (SRCNN) has demonstrated superior performance to the previous hand-crafted models either in speed and restoration quality. However, the high computational cost still hinders it from practical usage that demands real-time performance (24 fps). In this paper, we aim at accelerating the current SRCNN, and propose a compact hourglass-shape CNN structure for faster and better SR. We re-design the SRCNN structure mainly in three aspects. First, we introduce a deconvolution layer at the end of the network, then the mapping is learned directly from the original low-resolution image (without interpolation) to the high-resolution one. Second, we reformulate the mapping layer by shrinking the input feature dimension before mapping and expanding back afterwards. Third, we adopt smaller filter sizes but more mapping layers. The proposed model achieves a speed up of more than 40 times with even superior restoration quality. Further, we present the parameter settings that can achieve real-time performance on a generic CPU while still maintaining good performance. A corresponding transfer strategy is also proposed for fast training and testing across different upscaling factors.
Tasks	Image Super-Resolution, Super-Resolution
Published	2016-08-01
URL	http://arxiv.org/abs/1608.00367v1
PDF	http://arxiv.org/pdf/1608.00367v1.pdf
PWC	https://paperswithcode.com/paper/accelerating-the-super-resolution
Repo
Framework

Deep Convolutional Neural Networks for Microscopy-Based Point of Care Diagnostics


Title	Deep Convolutional Neural Networks for Microscopy-Based Point of Care Diagnostics
Authors	John A. Quinn, Rose Nakasi, Pius K. B. Mugagga, Patrick Byanyima, William Lubega, Alfred Andama
Abstract	Point of care diagnostics using microscopy and computer vision methods have been applied to a number of practical problems, and are particularly relevant to low-income, high disease burden areas. However, this is subject to the limitations in sensitivity and specificity of the computer vision methods used. In general, deep learning has recently revolutionised the field of computer vision, in some cases surpassing human performance for other object recognition tasks. In this paper, we evaluate the performance of deep convolutional neural networks on three different microscopy tasks: diagnosis of malaria in thick blood smears, tuberculosis in sputum samples, and intestinal parasite eggs in stool samples. In all cases accuracy is very high and substantially better than an alternative approach more representative of traditional medical imaging techniques.
Tasks	Object Recognition
Published	2016-08-09
URL	http://arxiv.org/abs/1608.02989v1
PDF	http://arxiv.org/pdf/1608.02989v1.pdf
PWC	https://paperswithcode.com/paper/deep-convolutional-neural-networks-for-5
Repo
Framework

Inverted Bilingual Topic Models for Lexicon Extraction from Non-parallel Data


Title	Inverted Bilingual Topic Models for Lexicon Extraction from Non-parallel Data
Authors	Tengfei Ma, Tetsuya Nasukawa
Abstract	Topic models have been successfully applied in lexicon extraction. However, most previous methods are limited to document-aligned data. In this paper, we try to address two challenges of applying topic models to lexicon extraction in non-parallel data: 1) hard to model the word relationship and 2) noisy seed dictionary. To solve these two challenges, we propose two new bilingual topic models to better capture the semantic information of each word while discriminating the multiple translations in a noisy seed dictionary. We extend the scope of topic models by inverting the roles of “word” and “document”. In addition, to solve the problem of noise in seed dictionary, we incorporate the probability of translation selection in our models. Moreover, we also propose an effective measure to evaluate the similarity of words in different languages and select the optimal translation pairs. Experimental results using real world data demonstrate the utility and efficacy of the proposed models.
Tasks	Topic Models
Published	2016-12-21
URL	http://arxiv.org/abs/1612.07215v2
PDF	http://arxiv.org/pdf/1612.07215v2.pdf
PWC	https://paperswithcode.com/paper/inverted-bilingual-topic-models-for-lexicon
Repo
Framework

Deep Learning a Grasp Function for Grasping under Gripper Pose Uncertainty


Title	Deep Learning a Grasp Function for Grasping under Gripper Pose Uncertainty
Authors	Edward Johns, Stefan Leutenegger, Andrew J. Davison
Abstract	This paper presents a new method for parallel-jaw grasping of isolated objects from depth images, under large gripper pose uncertainty. Whilst most approaches aim to predict the single best grasp pose from an image, our method first predicts a score for every possible grasp pose, which we denote the grasp function. With this, it is possible to achieve grasping robust to the gripper’s pose uncertainty, by smoothing the grasp function with the pose uncertainty function. Therefore, if the single best pose is adjacent to a region of poor grasp quality, that pose will no longer be chosen, and instead a pose will be chosen which is surrounded by a region of high grasp quality. To learn this function, we train a Convolutional Neural Network which takes as input a single depth image of an object, and outputs a score for each grasp pose across the image. Training data for this is generated by use of physics simulation and depth image simulation with 3D object meshes, to enable acquisition of sufficient data without requiring exhaustive real-world experiments. We evaluate with both synthetic and real experiments, and show that the learned grasp score is more robust to gripper pose uncertainty than when this uncertainty is not accounted for.
Tasks
Published	2016-08-07
URL	http://arxiv.org/abs/1608.02239v1
PDF	http://arxiv.org/pdf/1608.02239v1.pdf
PWC	https://paperswithcode.com/paper/deep-learning-a-grasp-function-for-grasping
Repo
Framework

Recent Advances in Transient Imaging: A Computer Graphics and Vision Perspective


Title	Recent Advances in Transient Imaging: A Computer Graphics and Vision Perspective
Authors	Adrian Jarabo, Belen Masia, Julio Marco, Diego Gutierrez
Abstract	Transient imaging has recently made a huge impact in the computer graphics and computer vision fields. By capturing, reconstructing, or simulating light transport at extreme temporal resolutions, researchers have proposed novel techniques to show movies of light in motion, see around corners, detect objects in highly-scattering media, or infer material properties from a distance, to name a few. The key idea is to leverage the wealth of information in the temporal domain at the pico or nanosecond resolution, information usually lost during the capture-time temporal integration. This paper presents recent advances in this field of transient imaging from a graphics and vision perspective, including capture techniques, analysis, applications and simulation.
Tasks
Published	2016-11-03
URL	http://arxiv.org/abs/1611.00939v1
PDF	http://arxiv.org/pdf/1611.00939v1.pdf
PWC	https://paperswithcode.com/paper/recent-advances-in-transient-imaging-a
Repo
Framework

Imaging around corners with single-pixel detector by computational ghost imaging


Title	Imaging around corners with single-pixel detector by computational ghost imaging
Authors	Bin Bai, Jianbin Liu, Yu Zhou, Songlin Zhang, Yuchen He, Zhuo Xu
Abstract	We have designed a single-pixel camera with imaging around corners based on computational ghost imaging. It can obtain the image of an object when the camera cannot look at the object directly. Our imaging system explores the fact that a bucket detector in a ghost imaging setup has no spatial resolution capability. A series of experiments have been designed to confirm our predictions. This camera has potential applications for imaging around corner or other similar environments where the object cannot be observed directly.
Tasks
Published	2016-12-08
URL	http://arxiv.org/abs/1612.07120v1
PDF	http://arxiv.org/pdf/1612.07120v1.pdf
PWC	https://paperswithcode.com/paper/imaging-around-corners-with-single-pixel
Repo
Framework

A Multiple Kernel Learning Approach for Human Behavioral Task Classification using STN-LFP Signal


Title	A Multiple Kernel Learning Approach for Human Behavioral Task Classification using STN-LFP Signal
Authors	Hosein M. Golshan, Adam O. Hebb, Sara J. Hanrahan, Joshua Nedrud, Mohammad H. Mahoor
Abstract	Deep Brain Stimulation (DBS) has gained increasing attention as an effective method to mitigate Parkinsons disease (PD) disorders. Existing DBS systems are open-loop such that the system parameters are not adjusted automatically based on patients behavior. Classification of human behavior is an important step in the design of the next generation of DBS systems that are closed-loop. This paper presents a classification approach to recognize such behavioral tasks using the subthalamic nucleus (STN) Local Field Potential (LFP) signals. In our approach, we use the time-frequency representation (spectrogram) of the raw LFP signals recorded from left and right STNs as the feature vectors. Then these features are combined together via Support Vector Machines (SVM) with Multiple Kernel Learning (MKL) formulation. The MKL-based classification method is utilized to classify different tasks: button press, mouth movement, speech, and arm movement. Our experiments show that the lp-norm MKL significantly outperforms single kernel SVM-based classifiers in classifying behavioral tasks of five subjects even using signals acquired with a low sampling rate of 10 Hz. This leads to a lower computational cost.
Tasks
Published	2016-07-27
URL	http://arxiv.org/abs/1607.07987v1
PDF	http://arxiv.org/pdf/1607.07987v1.pdf
PWC	https://paperswithcode.com/paper/a-multiple-kernel-learning-approach-for-human
Repo
Framework

NodIO, a JavaScript framework for volunteer-based evolutionary algorithms : first results


Title	NodIO, a JavaScript framework for volunteer-based evolutionary algorithms : first results
Authors	Juan-J. Merelo, Mario García-Valdez, Pedro A. Castillo, Pablo García-Sánchez, P. de las Cuevas, Nuria Rico
Abstract	JavaScript is an interpreted language mainly known for its inclusion in web browsers, making them a container for rich Internet based applications. This has inspired its use, for a long time, as a tool for evolutionary algorithms, mainly so in browser-based volunteer computing environments. Several libraries have also been published so far and are in use. However, the last years have seen a resurgence of interest in the language, becoming one of the most popular and thus spawning the improvement of its implementations, which are now the foundation of many new client-server applications. We present such an application for running distributed volunteer-based evolutionary algorithm experiments, and we make a series of measurements to establish the speed of JavaScript in evolutionary algorithms that can serve as a baseline for comparison with other distributed computing experiments. These experiments use different integer and floating point problems, and prove that the speed of JavaScript is actually competitive with other languages commonly used by the evolutionary algorithm practitioner.
Tasks
Published	2016-01-07
URL	http://arxiv.org/abs/1601.01607v1
PDF	http://arxiv.org/pdf/1601.01607v1.pdf
PWC	https://paperswithcode.com/paper/nodio-a-javascript-framework-for-volunteer
Repo
Framework

LSTM-Based System-Call Language Modeling and Robust Ensemble Method for Designing Host-Based Intrusion Detection Systems


Title	LSTM-Based System-Call Language Modeling and Robust Ensemble Method for Designing Host-Based Intrusion Detection Systems
Authors	Gyuwan Kim, Hayoon Yi, Jangho Lee, Yunheung Paek, Sungroh Yoon
Abstract	In computer security, designing a robust intrusion detection system is one of the most fundamental and important problems. In this paper, we propose a system-call language-modeling approach for designing anomaly-based host intrusion detection systems. To remedy the issue of high false-alarm rates commonly arising in conventional methods, we employ a novel ensemble method that blends multiple thresholding classifiers into a single one, making it possible to accumulate ‘highly normal’ sequences. The proposed system-call language model has various advantages leveraged by the fact that it can learn the semantic meaning and interactions of each system call that existing methods cannot effectively consider. Through diverse experiments on public benchmark datasets, we demonstrate the validity and effectiveness of the proposed method. Moreover, we show that our model possesses high portability, which is one of the key aspects of realizing successful intrusion detection systems.
Tasks	Intrusion Detection, Language Modelling
Published	2016-11-06
URL	http://arxiv.org/abs/1611.01726v1
PDF	http://arxiv.org/pdf/1611.01726v1.pdf
PWC	https://paperswithcode.com/paper/lstm-based-system-call-language-modeling-and
Repo
Framework

Attribute Learning for Network Intrusion Detection


Title	Attribute Learning for Network Intrusion Detection
Authors	Jorge Luis Rivero Pérez, Bernardete Ribeiro
Abstract	Network intrusion detection is one of the most visible uses for Big Data analytics. One of the main problems in this application is the constant rise of new attacks. This scenario, characterized by the fact that not enough labeled examples are available for the new classes of attacks is hardly addressed by traditional machine learning approaches. New findings on the capabilities of Zero-Shot learning (ZSL) approach makes it an interesting solution for this problem because it has the ability to classify instances of unseen classes. ZSL has inherently two stages: the attribute learning and the inference stage. In this paper we propose a new algorithm for the attribute learning stage of ZSL. The idea is to learn new values for the attributes based on decision trees (DT). Our results show that based on the rules extracted from the DT a better distribution for the attribute values can be found. We also propose an experimental setup for the evaluation of ZSL on network intrusion detection (NID).
Tasks	Intrusion Detection, Network Intrusion Detection, Zero-Shot Learning
Published	2016-07-28
URL	http://arxiv.org/abs/1607.08634v1
PDF	http://arxiv.org/pdf/1607.08634v1.pdf
PWC	https://paperswithcode.com/paper/attribute-learning-for-network-intrusion
Repo
Framework

Fourier Analysis and q-Gaussian Functions: Analytical and Numerical Results


Title	Fourier Analysis and q-Gaussian Functions: Analytical and Numerical Results
Authors	Paulo Sérgio Silva Rodrigues, Gilson Antonio Giraldi
Abstract	It is a consensus in signal processing that the Gaussian kernel and its partial derivatives enable the development of robust algorithms for feature detection. Fourier analysis and convolution theory have central role in such development. In this paper we collect theoretical elements to follow this avenue but using the q-Gaussian kernel that is a nonextensive generalization of the Gaussian one. Firstly, we review some theoretical elements behind the one-dimensional q-Gaussian and its Fourier transform. Then, we consider the two-dimensional q-Gaussian and we highlight the issues behind its analytical Fourier transform computation. We analyze the q-Gaussian kernel in the space and Fourier domains using the concepts of space window, cut-off frequency, and the Heisenberg inequality.
Tasks
Published	2016-05-02
URL	http://arxiv.org/abs/1605.00452v1
PDF	http://arxiv.org/pdf/1605.00452v1.pdf
PWC	https://paperswithcode.com/paper/fourier-analysis-and-q-gaussian-functions
Repo
Framework

Joint Event Detection and Entity Resolution: a Virtuous Cycle


Title	Joint Event Detection and Entity Resolution: a Virtuous Cycle
Authors	Matthias Galle, Jean-Michel Renders, Guillaume Jacquet
Abstract	Clustering web documents has numerous applications, such as aggregating news articles into meaningful events, detecting trends and hot topics on the Web, preserving diversity in search results, etc. At the same time, the importance of named entities and, in particular, the ability to recognize them and to solve the associated co-reference resolution problem are widely recognized as key enabling factors when mining, aggregating and comparing content on the Web. Instead of considering these two problems separately, we propose in this paper a method that tackles jointly the problem of clustering news articles into events and cross-document co-reference resolution of named entities. The co-occurrence of named entities in the same clusters is used as an additional signal to decide whether two referents should be merged into one entity. These refined entities can in turn be used as enhanced features to re-cluster the documents and then be refined again, entering into a virtuous cycle that improves simultaneously the performances of both tasks. We implemented a prototype system and report results using the TDT5 collection of news articles, demonstrating the potential of our approach.
Tasks	Entity Resolution
Published	2016-07-18
URL	http://arxiv.org/abs/1607.05142v1
PDF	http://arxiv.org/pdf/1607.05142v1.pdf
PWC	https://paperswithcode.com/paper/joint-event-detection-and-entity-resolution-a
Repo
Framework

Towards Better Analysis of Deep Convolutional Neural Networks


Title	Towards Better Analysis of Deep Convolutional Neural Networks
Authors	Mengchen Liu, Jiaxin Shi, Zhen Li, Chongxuan Li, Jun Zhu, Shixia Liu
Abstract	Deep convolutional neural networks (CNNs) have achieved breakthrough performance in many pattern recognition tasks such as image classification. However, the development of high-quality deep models typically relies on a substantial amount of trial-and-error, as there is still no clear understanding of when and why a deep model works. In this paper, we present a visual analytics approach for better understanding, diagnosing, and refining deep CNNs. We formulate a deep CNN as a directed acyclic graph. Based on this formulation, a hybrid visualization is developed to disclose the multiple facets of each neuron and the interactions between them. In particular, we introduce a hierarchical rectangle packing algorithm and a matrix reordering algorithm to show the derived features of a neuron cluster. We also propose a biclustering-based edge bundling method to reduce visual clutter caused by a large number of connections between neurons. We evaluated our method on a set of CNNs and the results are generally favorable.
Tasks	Image Classification
Published	2016-04-24
URL	http://arxiv.org/abs/1604.07043v3
PDF	http://arxiv.org/pdf/1604.07043v3.pdf
PWC	https://paperswithcode.com/paper/towards-better-analysis-of-deep-convolutional
Repo
Framework

Robust Downbeat Tracking Using an Ensemble of Convolutional Networks


Title	Robust Downbeat Tracking Using an Ensemble of Convolutional Networks
Authors	S. Durand, J. P. Bello, B. David, G. Richard
Abstract	In this paper, we present a novel state of the art system for automatic downbeat tracking from music signals. The audio signal is first segmented in frames which are synchronized at the tatum level of the music. We then extract different kind of features based on harmony, melody, rhythm and bass content to feed convolutional neural networks that are adapted to take advantage of each feature characteristics. This ensemble of neural networks is combined to obtain one downbeat likelihood per tatum. The downbeat sequence is finally decoded with a flexible and efficient temporal model which takes advantage of the metrical continuity of a song. We then perform an evaluation of our system on a large base of 9 datasets, compare its performance to 4 other published algorithms and obtain a significant increase of 16.8 percent points compared to the second best system, for altogether a moderate cost in test and training. The influence of each step of the method is studied to show its strengths and shortcomings.
Tasks
Published	2016-05-26
URL	http://arxiv.org/abs/1605.08396v1
PDF	http://arxiv.org/pdf/1605.08396v1.pdf
PWC	https://paperswithcode.com/paper/robust-downbeat-tracking-using-an-ensemble-of
Repo
Framework

Coarse2Fine: Two-Layer Fusion For Image Retrieval


Title	Coarse2Fine: Two-Layer Fusion For Image Retrieval
Authors	Gaipeng Kong, Le Dong, Wenpu Dong, Liang Zheng, Qi Tian
Abstract	This paper addresses the problem of large-scale image retrieval. We propose a two-layer fusion method which takes advantage of global and local cues and ranks database images from coarse to fine (C2F). Departing from the previous methods fusing multiple image descriptors simultaneously, C2F is featured by a layered procedure composed by filtering and refining. In particular, C2F consists of three components. 1) Distractor filtering. With holistic representations, noise images are filtered out from the database, so the number of candidate images to be used for comparison with the query can be greatly reduced. 2) Adaptive weighting. For a certain query, the similarity of candidate images can be estimated by holistic similarity scores in complementary to the local ones. 3) Candidate refining. Accurate retrieval is conducted via local features, combining the pre-computed adaptive weights. Experiments are presented on two benchmarks, \emph{i.e.,} Holidays and Ukbench datasets. We show that our method outperforms recent fusion methods in terms of storage consumption and computation complexity, and that the accuracy is competitive to the state-of-the-arts.
Tasks	Image Retrieval
Published	2016-07-04
URL	http://arxiv.org/abs/1607.00719v1
PDF	http://arxiv.org/pdf/1607.00719v1.pdf
PWC	https://paperswithcode.com/paper/coarse2fine-two-layer-fusion-for-image
Repo
Framework