Paper Group ANR 462
Spatiotemporal Prediction of Ambulance Demand using Gaussian Process Regression. Video Description: A Survey of Methods, Datasets and Evaluation Metrics. Variational Selection of Features for Molecular Kinetics. Redundancy Coefficient Gradual Up-weighting-based Mutual Information Feature Selection Technique for Crypto-ransomware Early Detection. Un …
Spatiotemporal Prediction of Ambulance Demand using Gaussian Process Regression
Title | Spatiotemporal Prediction of Ambulance Demand using Gaussian Process Regression |
Authors | Seth Nabarro, Tristan Fletcher, John Shawe-Taylor |
Abstract | Accurately predicting when and where ambulance call-outs occur can reduce response times and ensure the patient receives urgent care sooner. Here we present a novel method for ambulance demand prediction using Gaussian process regression (GPR) in time and geographic space. The method exhibits superior accuracy to MEDIC, a method which has been used in industry. The use of GPR has additional benefits such as the quantification of uncertainty with each prediction, the choice of kernel functions to encode prior knowledge and the ability to capture spatial correlation. Measures to increase the utility of GPR in the current context, with large training sets and a Poisson-distributed output, are outlined. |
Tasks | |
Published | 2018-06-28 |
URL | http://arxiv.org/abs/1806.10873v1 |
http://arxiv.org/pdf/1806.10873v1.pdf | |
PWC | https://paperswithcode.com/paper/spatiotemporal-prediction-of-ambulance-demand |
Repo | |
Framework | |
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Title | Video Description: A Survey of Methods, Datasets and Evaluation Metrics |
Authors | Nayyer Aafaq, Ajmal Mian, Wei Liu, Syed Zulqarnain Gilani, Mubarak Shah |
Abstract | Video description is the automatic generation of natural language sentences that describe the contents of a given video. It has applications in human-robot interaction, helping the visually impaired and video subtitling. The past few years have seen a surge of research in this area due to the unprecedented success of deep learning in computer vision and natural language processing. Numerous methods, datasets and evaluation metrics have been proposed in the literature, calling the need for a comprehensive survey to focus research efforts in this flourishing new direction. This paper fills the gap by surveying the state of the art approaches with a focus on deep learning models; comparing benchmark datasets in terms of their domains, number of classes, and repository size; and identifying the pros and cons of various evaluation metrics like SPICE, CIDEr, ROUGE, BLEU, METEOR, and WMD. Classical video description approaches combined subject, object and verb detection with template based language models to generate sentences. However, the release of large datasets revealed that these methods can not cope with the diversity in unconstrained open domain videos. Classical approaches were followed by a very short era of statistical methods which were soon replaced with deep learning, the current state of the art in video description. Our survey shows that despite the fast-paced developments, video description research is still in its infancy due to the following reasons. Analysis of video description models is challenging because it is difficult to ascertain the contributions, towards accuracy or errors, of the visual features and the adopted language model in the final description. Existing datasets neither contain adequate visual diversity nor complexity of linguistic structures. Finally, current evaluation metrics … |
Tasks | Language Modelling, Video Description |
Published | 2018-06-01 |
URL | https://arxiv.org/abs/1806.00186v4 |
https://arxiv.org/pdf/1806.00186v4.pdf | |
PWC | https://paperswithcode.com/paper/video-description-a-survey-of-methods |
Repo | |
Framework | |
Variational Selection of Features for Molecular Kinetics
Title | Variational Selection of Features for Molecular Kinetics |
Authors | Martin K. Scherer, Brooke E. Husic, Moritz Hoffmann, Fabian Paul, Hao Wu, Frank Noé |
Abstract | The modeling of atomistic biomolecular simulations using kinetic models such as Markov state models (MSMs) has had many notable algorithmic advances in recent years. The variational principle has opened the door for a nearly fully automated toolkit for selecting models that predict the long-time kinetics from molecular dynamics simulations. However, one yet-unoptimized step of the pipeline involves choosing the features, or collective variables, from which the model should be constructed. In order to build intuitive models, these collective variables are often sought to be interpretable and familiar features, such as torsional angles or contact distances in a protein structure. However, previous approaches for evaluating the chosen features rely on constructing a full MSM, which in turn requires additional hyperparameters to be chosen, and hence leads to a computationally expensive framework. Here, we present a method to optimize the feature choice directly, without requiring the construction of the final kinetic model. We demonstrate our rigorous preprocessing algorithm on a canonical set of twelve fast-folding protein simulations, and show that our procedure leads to more efficient model selection. |
Tasks | Model Selection |
Published | 2018-11-28 |
URL | http://arxiv.org/abs/1811.11714v2 |
http://arxiv.org/pdf/1811.11714v2.pdf | |
PWC | https://paperswithcode.com/paper/variational-selection-of-features-for |
Repo | |
Framework | |
Redundancy Coefficient Gradual Up-weighting-based Mutual Information Feature Selection Technique for Crypto-ransomware Early Detection
Title | Redundancy Coefficient Gradual Up-weighting-based Mutual Information Feature Selection Technique for Crypto-ransomware Early Detection |
Authors | Bander Ali Saleh Al-rimy, Mohd Aizaini Maarof, Syed Zainudeen Mohd Shaid |
Abstract | Crypto-ransomware is characterized by its irreversible effect even after the detection and removal. As such, the early detection is crucial to protect user data and files of being held to ransom. Several solutions have proposed utilizing the data extracted during the initial phases of the attacks before the encryption takes place. However, the lack of enough data at the early phases of the attack along with high dimensional features space renders the model prone to overfitting which decreases its detection accuracy. To this end, this paper proposed a novel redundancy coefficient gradual up-weighting approach that was incorporated to the calculation of redundancy term of mutual information to improve the feature selection process and enhance the accuracy of the detection model. Several machine learning classifiers were used to evaluate the detection performance of the proposed techniques. The experimental results show that the accuracy of proposed techniques achieved higher detection accuracy. Those results demonstrate the efficacy of the proposed techniques for the early detection tasks. |
Tasks | Feature Selection |
Published | 2018-07-22 |
URL | http://arxiv.org/abs/1807.09574v1 |
http://arxiv.org/pdf/1807.09574v1.pdf | |
PWC | https://paperswithcode.com/paper/redundancy-coefficient-gradual-up-weighting |
Repo | |
Framework | |
Uncertainty Propagation in Deep Neural Networks Using Extended Kalman Filtering
Title | Uncertainty Propagation in Deep Neural Networks Using Extended Kalman Filtering |
Authors | Jessica S. Titensky, Hayden Jananthan, Jeremy Kepner |
Abstract | Extended Kalman Filtering (EKF) can be used to propagate and quantify input uncertainty through a Deep Neural Network (DNN) assuming mild hypotheses on the input distribution. This methodology yields results comparable to existing methods of uncertainty propagation for DNNs while lowering the computational overhead considerably. Additionally, EKF allows model error to be naturally incorporated into the output uncertainty. |
Tasks | |
Published | 2018-09-17 |
URL | http://arxiv.org/abs/1809.06009v1 |
http://arxiv.org/pdf/1809.06009v1.pdf | |
PWC | https://paperswithcode.com/paper/uncertainty-propagation-in-deep-neural |
Repo | |
Framework | |
Tunneling Neural Perception and Logic Reasoning through Abductive Learning
Title | Tunneling Neural Perception and Logic Reasoning through Abductive Learning |
Authors | Wang-Zhou Dai, Qiu-Ling Xu, Yang Yu, Zhi-Hua Zhou |
Abstract | Perception and reasoning are basic human abilities that are seamlessly connected as part of human intelligence. However, in current machine learning systems, the perception and reasoning modules are incompatible. Tasks requiring joint perception and reasoning ability are difficult to accomplish autonomously and still demand human intervention. Inspired by the way language experts decoded Mayan scripts by joining two abilities in an abductive manner, this paper proposes the abductive learning framework. The framework learns perception and reasoning simultaneously with the help of a trial-and-error abductive process. We present the Neural-Logical Machine as an implementation of this novel learning framework. We demonstrate that–using human-like abductive learning–the machine learns from a small set of simple hand-written equations and then generalizes well to complex equations, a feat that is beyond the capability of state-of-the-art neural network models. The abductive learning framework explores a new direction for approaching human-level learning ability. |
Tasks | |
Published | 2018-02-04 |
URL | http://arxiv.org/abs/1802.01173v2 |
http://arxiv.org/pdf/1802.01173v2.pdf | |
PWC | https://paperswithcode.com/paper/tunneling-neural-perception-and-logic |
Repo | |
Framework | |
A Novel ILP Framework for Summarizing Content with High Lexical Variety
Title | A Novel ILP Framework for Summarizing Content with High Lexical Variety |
Authors | Wencan Luo, Fei Liu, Zitao Liu, Diane Litman |
Abstract | Summarizing content contributed by individuals can be challenging, because people make different lexical choices even when describing the same events. However, there remains a significant need to summarize such content. Examples include the student responses to post-class reflective questions, product reviews, and news articles published by different news agencies related to the same events. High lexical diversity of these documents hinders the system’s ability to effectively identify salient content and reduce summary redundancy. In this paper, we overcome this issue by introducing an integer linear programming-based summarization framework. It incorporates a low-rank approximation to the sentence-word co-occurrence matrix to intrinsically group semantically-similar lexical items. We conduct extensive experiments on datasets of student responses, product reviews, and news documents. Our approach compares favorably to a number of extractive baselines as well as a neural abstractive summarization system. The paper finally sheds light on when and why the proposed framework is effective at summarizing content with high lexical variety. |
Tasks | Abstractive Text Summarization |
Published | 2018-07-25 |
URL | http://arxiv.org/abs/1807.09671v1 |
http://arxiv.org/pdf/1807.09671v1.pdf | |
PWC | https://paperswithcode.com/paper/a-novel-ilp-framework-for-summarizing-content |
Repo | |
Framework | |
Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition
Title | Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition |
Authors | Yang Liu, Zhaoyang Lu, Jing Li, Tao Yang |
Abstract | Recognizing human actions from varied views is challenging due to huge appearance variations in different views. The key to this problem is to learn discriminant view-invariant representations generalizing well across views. In this paper, we address this problem by learning view-invariant representations hierarchically using a novel method, referred to as Joint Sparse Representation and Distribution Adaptation (JSRDA). To obtain robust and informative feature representations, we first incorporate a sample-affinity matrix into the marginalized stacked denoising Autoencoder (mSDA) to obtain shared features, which are then combined with the private features. In order to make the feature representations of videos across views transferable, we then learn a transferable dictionary pair simultaneously from pairs of videos taken at different views to encourage each action video across views to have the same sparse representation. However, the distribution difference across views still exists because a unified subspace where the sparse representations of one action across views are the same may not exist when the view difference is large. Therefore, we propose a novel unsupervised distribution adaptation method that learns a set of projections that project the source and target views data into respective low-dimensional subspaces where the marginal and conditional distribution differences are reduced simultaneously. Therefore, the finally learned feature representation is view-invariant and robust for substantial distribution difference across views even the view difference is large. Experimental results on four multiview datasets show that our approach outperforms the state-ofthe-art approaches. |
Tasks | Denoising, Temporal Action Localization |
Published | 2018-09-03 |
URL | https://arxiv.org/abs/1809.00421v2 |
https://arxiv.org/pdf/1809.00421v2.pdf | |
PWC | https://paperswithcode.com/paper/hierarchically-learned-view-invariant |
Repo | |
Framework | |
PIVETed-Granite: Computational Phenotypes through Constrained Tensor Factorization
Title | PIVETed-Granite: Computational Phenotypes through Constrained Tensor Factorization |
Authors | Jette Henderson, Bradley A. Malin, Joyce C. Ho, Joydeep Ghosh |
Abstract | It has been recently shown that sparse, nonnegative tensor factorization of multi-modal electronic health record data is a promising approach to high-throughput computational phenotyping. However, such approaches typically do not leverage available domain knowledge while extracting the phenotypes; hence, some of the suggested phenotypes may not map well to clinical concepts or may be very similar to other suggested phenotypes. To address these issues, we present a novel, automatic approach called PIVETed-Granite that mines existing biomedical literature (PubMed) to obtain cannot-link constraints that are then used as side-information during a tensor-factorization based computational phenotyping process. The resulting improvements are clearly observed in experiments using a large dataset from VUMC to identify phenotypes for hypertensive patients. |
Tasks | Computational Phenotyping |
Published | 2018-08-08 |
URL | http://arxiv.org/abs/1808.02602v1 |
http://arxiv.org/pdf/1808.02602v1.pdf | |
PWC | https://paperswithcode.com/paper/piveted-granite-computational-phenotypes |
Repo | |
Framework | |
Optimization of neural networks via finite-value quantum fluctuations
Title | Optimization of neural networks via finite-value quantum fluctuations |
Authors | Masayuki Ohzeki, Shuntaro Okada, Masayoshi Terabe, Shinichiro Taguchi |
Abstract | We numerically test an optimization method for deep neural networks (DNNs) using quantum fluctuations inspired by quantum annealing. For efficient optimization, our method utilizes the quantum tunneling effect beyond the potential barriers. The path integral formulation of the DNN optimization generates an attracting force to simulate the quantum tunneling effect. In the standard quantum annealing method, the quantum fluctuations will vanish at the last stage of optimization. In this study, we propose a learning protocol that utilizes a finite value for quantum fluctuations strength to obtain higher generalization performance, which is a type of robustness. We demonstrate the performance of our method using two well-known open datasets: the MNIST dataset and the Olivetti face dataset. Although computational costs prevent us from testing our method on large datasets with high-dimensional data, results show that our method can enhance generalization performance by induction of the finite value for quantum fluctuations. |
Tasks | |
Published | 2018-07-01 |
URL | http://arxiv.org/abs/1807.00414v1 |
http://arxiv.org/pdf/1807.00414v1.pdf | |
PWC | https://paperswithcode.com/paper/optimization-of-neural-networks-via-finite |
Repo | |
Framework | |
AdaFlow: Domain-Adaptive Density Estimator with Application to Anomaly Detection and Unpaired Cross-Domain Translation
Title | AdaFlow: Domain-Adaptive Density Estimator with Application to Anomaly Detection and Unpaired Cross-Domain Translation |
Authors | Masataka Yamaguchi, Yuma Koizumi, Noboru Harada |
Abstract | We tackle unsupervised anomaly detection (UAD), a problem of detecting data that significantly differ from normal data. UAD is typically solved by using density estimation. Recently, deep neural network (DNN)-based density estimators, such as Normalizing Flows, have been attracting attention. However, one of their drawbacks is the difficulty in adapting them to the change in the normal data’s distribution. To address this difficulty, we propose AdaFlow, a new DNN-based density estimator that can be easily adapted to the change of the distribution. AdaFlow is a unified model of a Normalizing Flow and Adaptive Batch-Normalizations, a module that enables DNNs to adapt to new distributions. AdaFlow can be adapted to a new distribution by just conducting forward propagation once per sample; hence, it can be used on devices that have limited computational resources. We have confirmed the effectiveness of the proposed model through an anomaly detection in a sound task. We also propose a method of applying AdaFlow to the unpaired cross-domain translation problem, in which one has to train a cross-domain translation model with only unpaired samples. We have confirmed that our model can be used for the cross-domain translation problem through experiments on image datasets. |
Tasks | Anomaly Detection, Density Estimation, Unsupervised Anomaly Detection |
Published | 2018-12-14 |
URL | http://arxiv.org/abs/1812.05796v2 |
http://arxiv.org/pdf/1812.05796v2.pdf | |
PWC | https://paperswithcode.com/paper/adaflow-domain-adaptive-density-estimator |
Repo | |
Framework | |
Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data
Title | Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data |
Authors | Hyunwoo Jung, Moonsu Han, Minki Kang, Sungju Hwang |
Abstract | Current generation of memory-augmented neural networks has limited scalability as they cannot efficiently process data that are too large to fit in the external memory storage. One example of this is lifelong learning scenario where the model receives unlimited length of data stream as an input which contains vast majority of uninformative entries. We tackle this problem by proposing a memory network fit for long-term lifelong learning scenario, which we refer to as Long-term Episodic Memory Networks (LEMN), that features a RNN-based retention agent that learns to replace less important memory entries based on the retention probability generated on each entry that is learned to identify data instances of generic importance relative to other memory entries, as well as its historical importance. Such learning of retention agent allows our long-term episodic memory network to retain memory entries of generic importance for a given task. We validate our model on a path-finding task as well as synthetic and real question answering tasks, on which our model achieves significant improvements over the memory augmented networks with rule-based memory scheduling as well as an RL-based baseline that does not consider relative or historical importance of the memory. |
Tasks | Question Answering |
Published | 2018-12-11 |
URL | http://arxiv.org/abs/1812.04227v1 |
http://arxiv.org/pdf/1812.04227v1.pdf | |
PWC | https://paperswithcode.com/paper/learning-what-to-remember-long-term-episodic |
Repo | |
Framework | |
Real-Time Shape Tracking of Facial Landmarks
Title | Real-Time Shape Tracking of Facial Landmarks |
Authors | Hyungjoon Kim, Hyeonwoo Kim, Eenjun Hwang |
Abstract | Detection of facial landmarks and accurate tracking of their shape are essential in real-time virtual makeup applications, where users can see the makeups effect by moving their face in different directions. Typical face tracking techniques detect diverse facial landmarks and track them using a point tracker such as the Kanade-Lucas-Tomasi (KLT) point tracker. Typically, 5 or 64 points are used for tracking a face. Even though these points are sufficient to track the approximate locations of facial landmarks, they are not sufficient to track the exact shape of facial landmarks. In this paper, we propose a method that can track the exact shape of facial landmarks in real-time by combining a deep learning technique and a point tracker. We detect facial landmarks accurately using SegNet, which performs semantic segmentation based on deep learning. Edge points of detected landmarks are tracked using the KLT point tracker. In spite of its popularity, the KLT point tracker suffers from the point loss problem. We solve this problem by executing SegNet periodically to calculate the shape of facial landmarks. That is, by combining the two techniques, we can avoid the computational overhead of SegNet for real-time shape tracking and the point loss problem of the KLT point tracker. We performed several experiments to evaluate the performance of our method and report some of the results herein. |
Tasks | Semantic Segmentation |
Published | 2018-07-14 |
URL | http://arxiv.org/abs/1807.05333v1 |
http://arxiv.org/pdf/1807.05333v1.pdf | |
PWC | https://paperswithcode.com/paper/real-time-shape-tracking-of-facial-landmarks |
Repo | |
Framework | |
Similarity based hierarchical clustering of physiological parameters for the identification of health states - a feasibility study
Title | Similarity based hierarchical clustering of physiological parameters for the identification of health states - a feasibility study |
Authors | Fabian Schrumpf, Gerold Bausch, Matthias Sturm, Mirco Fuchs |
Abstract | This paper introduces a new unsupervised method for the clustering of physiological data into health states based on their similarity. We propose an iterative hierarchical clustering approach that combines health states according to a similarity constraint to new arbitrary health states. We applied method to experimental data in which the physical strain of subjects was systematically varied. We derived health states based on parameters extracted from ECG data. The occurrence of health states shows a high temporal correlation to the experimental phases of the physical exercise. We compared our method to other clustering algorithms and found a significantly higher accuracy with respect to the identification of health states. |
Tasks | |
Published | 2018-03-26 |
URL | http://arxiv.org/abs/1803.09592v1 |
http://arxiv.org/pdf/1803.09592v1.pdf | |
PWC | https://paperswithcode.com/paper/similarity-based-hierarchical-clustering-of |
Repo | |
Framework | |
A Scalable Machine Learning Approach for Inferring Probabilistic US-LI-RADS Categorization
Title | A Scalable Machine Learning Approach for Inferring Probabilistic US-LI-RADS Categorization |
Authors | Imon Banerjee, Hailey H. Choi, Terry Desser, Daniel L. Rubin |
Abstract | We propose a scalable computerized approach for large-scale inference of Liver Imaging Reporting and Data System (LI-RADS) final assessment categories in narrative ultrasound (US) reports. Although our model was trained on reports created using a LI-RADS template, it was also able to infer LI-RADS scoring for unstructured reports that were created before the LI-RADS guidelines were established. No human-labelled data was required in any step of this study; for training, LI-RADS scores were automatically extracted from those reports that contained structured LI-RADS scores, and it translated the derived knowledge to reasoning on unstructured radiology reports. By providing automated LI-RADS categorization, our approach may enable standardizing screening recommendations and treatment planning of patients at risk for hepatocellular carcinoma, and it may facilitate AI-based healthcare research with US images by offering large scale text mining and data gathering opportunities from standard hospital clinical data repositories. |
Tasks | |
Published | 2018-06-15 |
URL | http://arxiv.org/abs/1806.07346v1 |
http://arxiv.org/pdf/1806.07346v1.pdf | |
PWC | https://paperswithcode.com/paper/a-scalable-machine-learning-approach-for |
Repo | |
Framework | |