January 30, 2020

2688 words 13 mins read

Paper Group ANR 297

A Framework for Evaluating Agricultural Ontologies. Equations Derivation of VINS-Mono. Hyperlink Regression via Bregman Divergence. Identifying Adversarial Sentences by Analyzing Text Complexity. Pólygamma Data Augmentation to address Non-conjugacy in the Bayesian Estimation of Mixed Multinomial Logit Models. Predicting Louisiana Public High School …

A Framework for Evaluating Agricultural Ontologies


Title	A Framework for Evaluating Agricultural Ontologies
Authors	Anat Goldstein, Lior Fink, Gilad Ravid
Abstract	An ontology is a formal representation of domain knowledge, which can be interpreted by machines. In recent years, ontologies have become a major tool for domain knowledge representation and a core component of many knowledge management systems, decision support systems and other intelligent systems, inter alia, in the context of agriculture. A review of the existing literature on agricultural ontologies, however, reveals that most of the studies, which propose agricultural ontologies, are lacking an explicit evaluation procedure. This is undesired because without well-structured evaluation processes, it is difficult to consider the value of ontologies to research and practice. Moreover, it is difficult to rely on such ontologies and share them on the Semantic Web or between semantic aware applications. With the growing number of ontology-based agricultural systems and the increasing popularity of the Semantic Web, it becomes essential that such development and evaluation methods are put forward to guide future efforts of ontology development. Our work contributes to the literature on agricultural ontologies, by presenting a method for evaluating agricultural ontologies, which seems to be missing from most existing studies on agricultural ontologies. The framework supports the matching of appropriate evaluation methods for a given ontology based on the ontology’s purpose.
Tasks
Published	2019-06-25
URL	https://arxiv.org/abs/1906.10450v2
PDF	https://arxiv.org/pdf/1906.10450v2.pdf
PWC	https://paperswithcode.com/paper/a-framework-for-evaluating-agricultural
Repo
Framework

Equations Derivation of VINS-Mono


Title	Equations Derivation of VINS-Mono
Authors	Yibin Wu
Abstract	The VINS-Mono is a monocular visual-inertial 6 DOF state estimator proposed by Aerial Robotics Group at HKUST in 2017, which can be performed on MAVs, smartphones and many other intelligent platforms. It is a state-of-the-art visual-inertial odometry algorithms which has gained extensive attention worldwide. The main equations including IMU preintegration, visual/inertial co-initialization and tightly-coupled nonlinear optimization are derived and analyzed in this manuscript.
Tasks
Published	2019-12-16
URL	https://arxiv.org/abs/1912.11986v1
PDF	https://arxiv.org/pdf/1912.11986v1.pdf
PWC	https://paperswithcode.com/paper/equations-derivation-of-vins-mono
Repo
Framework

Hyperlink Regression via Bregman Divergence


Title	Hyperlink Regression via Bregman Divergence
Authors	Akifumi Okuno, Hidetoshi Shimodaira
Abstract	A collection of $U : (\in \mathbb{N})$ data vectors is called a $U$-tuple, and the association strength among the vectors of a tuple is termed as the \emph{hyperlink weight}, that is assumed to be symmetric with respect to permutation of the entries in the index. We herein propose Bregman hyperlink regression (BHLR), which learns a user-specified symmetric similarity function such that it predicts the tuple’s hyperlink weight from data vectors stored in the $U$-tuple. BHLR is a simple and general framework for hyper-relational learning, that minimizes Bregman-divergence (BD) between the hyperlink weights and estimated similarities defined for the corresponding tuples; BHLR encompasses various existing methods, such as logistic regression ($U=1$), Poisson regression ($U=1$), link prediction ($U=2$), and those for representation learning, such as graph embedding ($U=2$), matrix factorization ($U=2$), tensor factorization ($U \geq 2$), and their variants equipped with arbitrary BD. Nonlinear functions (e.g., neural networks), can be employed for the similarity functions. However, there are theoretical challenges such that some of different tuples of BHLR may share data vectors therein, unlike the i.i.d. setting of classical regression. We address these theoretical issues, and proved that BHLR equipped with arbitrary BD and $U \in \mathbb{N}$ is (P-1) statistically consistent, that is, it asymptotically recovers the underlying true conditional expectation of hyperlink weights given data vectors, and (P-2) computationally tractable, that is, it is efficiently computed by stochastic optimization algorithms using a novel generalized minibatch sampling procedure for hyper-relational data. Consequently, theoretical guarantees for BHLR including several existing methods, that have been examined experimentally, are provided in a unified manner.
Tasks	Graph Embedding, Link Prediction, Relational Reasoning, Representation Learning, Stochastic Optimization
Published	2019-07-22
URL	https://arxiv.org/abs/1908.02573v2
PDF	https://arxiv.org/pdf/1908.02573v2.pdf
PWC	https://paperswithcode.com/paper/hyperlink-regression-via-bregman-divergence
Repo
Framework

Identifying Adversarial Sentences by Analyzing Text Complexity


Title	Identifying Adversarial Sentences by Analyzing Text Complexity
Authors	Hoang-Quoc Nguyen-Son, Tran Phuong Thao, Seira Hidano, Shinsaku Kiyomoto
Abstract	Attackers create adversarial text to deceive both human perception and the current AI systems to perform malicious purposes such as spam product reviews and fake political posts. We investigate the difference between the adversarial and the original text to prevent the risk. We prove that the text written by a human is more coherent and fluent. Moreover, the human can express the idea through the flexible text with modern words while a machine focuses on optimizing the generated text by the simple and common words. We also suggest a method to identify the adversarial text by extracting the features related to our findings. The proposed method achieves high performance with 82.0% of accuracy and 18.4% of equal error rate, which is better than the existing methods whose the best accuracy is 77.0% corresponding to the error rate 22.8%.
Tasks	Adversarial Text
Published	2019-12-19
URL	https://arxiv.org/abs/1912.08981v1
PDF	https://arxiv.org/pdf/1912.08981v1.pdf
PWC	https://paperswithcode.com/paper/identifying-adversarial-sentences-by
Repo
Framework

Pólygamma Data Augmentation to address Non-conjugacy in the Bayesian Estimation of Mixed Multinomial Logit Models


Title	Pólygamma Data Augmentation to address Non-conjugacy in the Bayesian Estimation of Mixed Multinomial Logit Models
Authors	Prateek Bansal, Rico Krueger, Michel Bierlaire, Ricardo A. Daziano, Taha H. Rashidi
Abstract	The standard Gibbs sampler of Mixed Multinomial Logit (MMNL) models involves sampling from conditional densities of utility parameters using Metropolis-Hastings (MH) algorithm due to unavailability of conjugate prior for logit kernel. To address this non-conjugacy concern, we propose the application of P'olygamma data augmentation (PG-DA) technique for the MMNL estimation. The posterior estimates of the augmented and the default Gibbs sampler are similar for two-alternative scenario (binary choice), but we encounter empirical identification issues in the case of more alternatives ($J \geq 3$).
Tasks	Data Augmentation
Published	2019-04-13
URL	http://arxiv.org/abs/1904.07688v1
PDF	http://arxiv.org/pdf/1904.07688v1.pdf
PWC	https://paperswithcode.com/paper/polygamma-data-augmentation-to-address-non
Repo
Framework

Predicting Louisiana Public High School Dropout through Imbalanced Learning Techniques


Title	Predicting Louisiana Public High School Dropout through Imbalanced Learning Techniques
Authors	Marmar Orooji, Jianhua Chen
Abstract	This study is motivated by the magnitude of the problem of Louisiana high school dropout and its negative impacts on individual and public well-being. Our goal is to predict students who are at risk of high school dropout, by examining Louisiana administrative dataset. Due to the imbalanced nature of the dataset, imbalanced learning techniques including resampling, case weighting, and cost-sensitive learning have been applied to enhance the prediction performance on the rare class. Performance metrics used in this study are F-measure, recall and precision of the rare class. We compare the performance of several machine learning algorithms such as neural networks, decision trees and bagging trees in combination with the imbalanced learning approaches using an administrative dataset of size of 366k+ from Louisiana Department of Education. Experiments show that application of imbalanced learning methods produces good results on recall but decreases precision, whereas base classifiers without regard of imbalanced data handling gives better precision but poor recall. Overall application of imbalanced learning techniques is beneficial, yet more studies are desired to improve precision.
Tasks
Published	2019-10-29
URL	https://arxiv.org/abs/1910.13018v1
PDF	https://arxiv.org/pdf/1910.13018v1.pdf
PWC	https://paperswithcode.com/paper/predicting-louisiana-public-high-school
Repo
Framework

Snomed2Vec: Random Walk and Poincaré Embeddings of a Clinical Knowledge Base for Healthcare Analytics


Title	Snomed2Vec: Random Walk and Poincaré Embeddings of a Clinical Knowledge Base for Healthcare Analytics
Authors	Khushbu Agarwal, Tome Eftimov, Raghavendra Addanki, Sutanay Choudhury, Suzanne Tamang, Robert Rallo
Abstract	Representation learning methods that transform encoded data (e.g., diagnosis and drug codes) into continuous vector spaces (i.e., vector embeddings) are critical for the application of deep learning in healthcare. Initial work in this area explored the use of variants of the word2vec algorithm to learn embeddings for medical concepts from electronic health records or medical claims datasets. We propose learning embeddings for medical concepts by using graph-based representation learning methods on SNOMED-CT, a widely popular knowledge graph in the healthcare domain with numerous operational and research applications. Current work presents an empirical analysis of various embedding methods, including the evaluation of their performance on multiple tasks of biomedical relevance (node classification, link prediction, and patient state prediction). Our results show that concept embeddings derived from the SNOMED-CT knowledge graph significantly outperform state-of-the-art embeddings, showing 5-6x improvement in ``concept similarity” and 6-20% improvement in patient diagnosis. \|
Tasks	Link Prediction, Node Classification, Representation Learning
Published	2019-07-19
URL	https://arxiv.org/abs/1907.08650v1
PDF	https://arxiv.org/pdf/1907.08650v1.pdf
PWC	https://paperswithcode.com/paper/snomed2vec-random-walk-and-poincare
Repo
Framework

Domain Constraint Approximation based Semi Supervision


Title	Domain Constraint Approximation based Semi Supervision
Authors	Yifu Wu, Jin Wei, Rigoberto Roche
Abstract	Deep learning for supervised learning has achieved astonishing performance in various machine learning applications. However, annotated data is expensive and rare. In practice, only a small portion of data samples are annotated. Pseudo-ensembling-based approaches have achieved state-of-the-art results in computer vision related tasks. However, it still relies on the quality of an initial model built by labeled data. Less labeled data may degrade model performance a lot. Domain constraint is another way regularize the posterior but has some limitation. In this paper, we proposed a fuzzy domain-constraint-based framework which loses the requirement of traditional constraint learning and enhances the model quality for semi supervision. Simulations results show the effectiveness of our design.
Tasks
Published	2019-02-11
URL	https://arxiv.org/abs/1902.04177v2
PDF	https://arxiv.org/pdf/1902.04177v2.pdf
PWC	https://paperswithcode.com/paper/domain-constraint-approximation-based-semi
Repo
Framework

Distantly Supervised Question Parsing


Title	Distantly Supervised Question Parsing
Authors	Hamid Zafar, Maryam Tavakol, Jens Lehmann
Abstract	The emergence of structured databases for Question Answering (QA) systems has led to developing methods, in which the problem of learning the correct answer efficiently is based on a linking task between the constituents of the question and the corresponding entries in the database. As a result, parsing the questions in order to determine their main elements, which are required for answer retrieval, becomes crucial. However, most datasets for QA systems lack gold annotations for parsing, i.e., labels are only available in the form of (question, formal-query, answer). In this paper, we propose a distantly supervised learning framework based on reinforcement learning to learn the mentions of entities and relations in questions. We leverage the provided formal queries to characterize delayed rewards for optimizing a policy gradient objective for the parsing model. An empirical evaluation of our approach shows a significant improvement in the performance of entity and relation linking compared to the state of the art. We also demonstrate that a more accurate parsing component enhances the overall performance of QA systems.
Tasks	Knowledge Graphs, Question Answering
Published	2019-09-27
URL	https://arxiv.org/abs/1909.12566v2
PDF	https://arxiv.org/pdf/1909.12566v2.pdf
PWC	https://paperswithcode.com/paper/mdp-based-shallow-parsing-in-distantly
Repo
Framework

Action Guidance with MCTS for Deep Reinforcement Learning


Title	Action Guidance with MCTS for Deep Reinforcement Learning
Authors	Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor
Abstract	Deep reinforcement learning has achieved great successes in recent years, however, one main challenge is the sample inefficiency. In this paper, we focus on how to use action guidance by means of a non-expert demonstrator to improve sample efficiency in a domain with sparse, delayed, and possibly deceptive rewards: the recently-proposed multi-agent benchmark of Pommerman. We propose a new framework where even a non-expert simulated demonstrator, e.g., planning algorithms such as Monte Carlo tree search with a small number rollouts, can be integrated within asynchronous distributed deep reinforcement learning methods. Compared to a vanilla deep RL algorithm, our proposed methods both learn faster and converge to better policies on a two-player mini version of the Pommerman game.
Tasks
Published	2019-07-25
URL	https://arxiv.org/abs/1907.11703v1
PDF	https://arxiv.org/pdf/1907.11703v1.pdf
PWC	https://paperswithcode.com/paper/action-guidance-with-mcts-for-deep
Repo
Framework

3D Organ Shape Reconstruction from Topogram Images


Title	3D Organ Shape Reconstruction from Topogram Images
Authors	Elena Balashova, Jiangping Wang, Vivek Singh, Bogdan Georgescu, Brian Teixeira, Ankur Kapoor
Abstract	Automatic delineation and measurement of main organs such as liver is one of the critical steps for assessment of hepatic diseases, planning and postoperative or treatment follow-up. However, addressing this problem typically requires performing computed tomography (CT) scanning and complicated postprocessing of the resulting scans using slice-by-slice techniques. In this paper, we show that 3D organ shape can be automatically predicted directly from topogram images, which are easier to acquire and have limited exposure to radiation during acquisition, compared to CT scans. We evaluate our approach on the challenging task of predicting liver shape using a generative model. We also demonstrate that our method can be combined with user annotations, such as a 2D mask, for improved prediction accuracy. We show compelling results on 3D liver shape reconstruction and volume estimation on 2129 CT scans.
Tasks	Computed Tomography (CT)
Published	2019-03-29
URL	http://arxiv.org/abs/1904.00073v1
PDF	http://arxiv.org/pdf/1904.00073v1.pdf
PWC	https://paperswithcode.com/paper/3d-organ-shape-reconstruction-from-topogram
Repo
Framework

Think Again Networks and the Delta Loss


Title	Think Again Networks and the Delta Loss
Authors	Alexandre Salle, Marcelo Prates
Abstract	This short paper introduces an abstraction called Think Again Networks (ThinkNet) which can be applied to any state-dependent function (such as a recurrent neural network).
Tasks	Language Modelling
Published	2019-04-26
URL	http://arxiv.org/abs/1904.11816v2
PDF	http://arxiv.org/pdf/1904.11816v2.pdf
PWC	https://paperswithcode.com/paper/think-again-networks-the-delta-loss-and-an
Repo
Framework

A Robust Transferable Deep Learning Framework for Cross-sectional Investment Strategy


Title	A Robust Transferable Deep Learning Framework for Cross-sectional Investment Strategy
Authors	Kei Nakagawa, Masaya Abe, Junpei Komiyama
Abstract	Stock return predictability is an important research theme as it reflects our economic and social organization, and significant efforts are made to explain the dynamism therein. Statistics of strong explanative power, called “factor” have been proposed to summarize the essence of predictive stock returns. Although machine learning methods are increasingly popular in stock return prediction, an inference of the stock returns is highly elusive, and still most investors, if partly, rely on their intuition to build a better decision making. The challenge here is to make an investment strategy that is consistent over a reasonably long period, with the minimum human decision on the entire process. To this end, we propose a new stock return prediction framework that we call Ranked Information Coefficient Neural Network (RIC-NN). RIC-NN is a deep learning approach and includes the following three novel ideas: (1) nonlinear multi-factor approach, (2) stopping criteria with ranked information coefficient (rank IC), and (3) deep transfer learning among multiple regions. Experimental comparison with the stocks in the Morgan Stanley Capital International (MSCI) indices shows that RIC-NN outperforms not only off-the-shelf machine learning methods but also the average return of major equity investment funds in the last fourteen years.
Tasks	Decision Making, Transfer Learning
Published	2019-10-02
URL	https://arxiv.org/abs/1910.01491v1
PDF	https://arxiv.org/pdf/1910.01491v1.pdf
PWC	https://paperswithcode.com/paper/a-robust-transferable-deep-learning-framework
Repo
Framework

A Unified Analysis of Extra-gradient and Optimistic Gradient Methods for Saddle Point Problems: Proximal Point Approach


Title	A Unified Analysis of Extra-gradient and Optimistic Gradient Methods for Saddle Point Problems: Proximal Point Approach
Authors	Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil
Abstract	In this paper we consider solving saddle point problems using two variants of Gradient Descent-Ascent algorithms, Extra-gradient (EG) and Optimistic Gradient Descent Ascent (OGDA) methods. We show that both of these algorithms admit a unified analysis as approximations of the classical proximal point method for solving saddle point problems. This viewpoint enables us to develop a new framework for analyzing EG and OGDA for bilinear and strongly convex-strongly concave settings. Moreover, we use the proximal point approximation interpretation to generalize the results for OGDA for a wide range of parameters.
Tasks
Published	2019-01-24
URL	https://arxiv.org/abs/1901.08511v4
PDF	https://arxiv.org/pdf/1901.08511v4.pdf
PWC	https://paperswithcode.com/paper/a-unified-analysis-of-extra-gradient-and
Repo
Framework

Fraud detection in telephone conversations for financial services using linguistic features


Title	Fraud detection in telephone conversations for financial services using linguistic features
Authors	Nikesh Bajaj, Tracy Goodluck Constance, Marvin Rajwadi, Julie Wall, Mansour Moniri, Cornelius Glackin, Nigel Cannings, Chris Woodruff, James Laird
Abstract	Detecting the elements of deception in a conversation is one of the most challenging problems for the AI community. It becomes even more difficult to design a transparent system, which is fully explainable and satisfies the need for financial and legal services to be deployed. This paper presents an approach for fraud detection in transcribed telephone conversations using linguistic features. The proposed approach exploits the syntactic and semantic information of the transcription to extract both the linguistic markers and the sentiment of the customer’s response. We demonstrate the results on real-world financial services data using simple, robust and explainable classifiers such as Naive Bayes, Decision Tree, Nearest Neighbours, and Support Vector Machines.
Tasks	Fraud Detection
Published	2019-12-10
URL	https://arxiv.org/abs/1912.04748v1
PDF	https://arxiv.org/pdf/1912.04748v1.pdf
PWC	https://paperswithcode.com/paper/fraud-detection-in-telephone-conversations
Repo
Framework