October 17, 2019

2530 words 12 mins read

Paper Group ANR 950

Deep Diabetologist: Learning to Prescribe Hyperglycemia Medications with Hierarchical Recurrent Neural Networks. Error Detection in a Large-Scale Lexical Taxonomy. Cleaning up the neighborhood: A full classification for adversarial partial monitoring. The Dynamics of Learning: A Random Matrix Approach. User-Guided Deep Anime Line Art Colorization w …

Deep Diabetologist: Learning to Prescribe Hyperglycemia Medications with Hierarchical Recurrent Neural Networks


Title	Deep Diabetologist: Learning to Prescribe Hyperglycemia Medications with Hierarchical Recurrent Neural Networks
Authors	Jing Mei, Shiwan Zhao, Feng Jin, Eryu Xia, Haifeng Liu, Xiang Li
Abstract	In healthcare, applying deep learning models to electronic health records (EHRs) has drawn considerable attention. EHR data consist of a sequence of medical visits, i.e. a multivariate time series of diagnosis, medications, physical examinations, lab tests, etc. This sequential nature makes EHR well matching the power of Recurrent Neural Network (RNN). In this paper, we propose “Deep Diabetologist” - using RNNs for EHR sequential data modelling, to provide the personalized hyperglycemia medication prediction for diabetic patients. Particularly, we develop a hierarchical RNN to capture the heterogeneous sequential information in the EHR data. Our experimental results demonstrate the improved performance, compared with a baseline classifier using logistic regression. Moreover, hierarchical RNN models outperform basic ones, providing deeper data insights for clinical decision support.
Tasks	Time Series
Published	2018-10-17
URL	http://arxiv.org/abs/1810.07692v1
PDF	http://arxiv.org/pdf/1810.07692v1.pdf
PWC	https://paperswithcode.com/paper/deep-diabetologist-learning-to-prescribe
Repo
Framework

Error Detection in a Large-Scale Lexical Taxonomy


Title	Error Detection in a Large-Scale Lexical Taxonomy
Authors	Sifan Liu, Hongzhi Wang
Abstract	Knowledge base (KB) is an important aspect in artificial intelligence. One significant challenge faced by KB construction is that it contains many noises, which prevents its effective usage. Even though some KB cleansing algorithms have been proposed, they focus on the structure of the knowledge graph and neglect the relation between the concepts, which could be helpful to discover wrong relations in KB. Motived by this, we measure the relation of two concepts by the distance between their corresponding instances and detect errors within the intersection of the conflicting concept sets. For efficient and effective knowledge base cleansing, we first apply a distance-based Model to determine the conflicting concept sets using two different methods. Then, we propose and analyze several algorithms on how to detect and repairing the errors based on our model, where we use hash method for an efficient way to calculate distance. Experimental results demonstrate that the proposed approaches could cleanse the knowledge bases efficiently and effectively.
Tasks
Published	2018-08-05
URL	http://arxiv.org/abs/1808.01690v1
PDF	http://arxiv.org/pdf/1808.01690v1.pdf
PWC	https://paperswithcode.com/paper/error-detection-in-a-large-scale-lexical
Repo
Framework

Cleaning up the neighborhood: A full classification for adversarial partial monitoring


Title	Cleaning up the neighborhood: A full classification for adversarial partial monitoring
Authors	Tor Lattimore, Csaba Szepesvari
Abstract	Partial monitoring is a generalization of the well-known multi-armed bandit framework where the loss is not directly observed by the learner. We complete the classification of finite adversarial partial monitoring to include all games, solving an open problem posed by Bartok et al. [2014]. Along the way we simplify and improve existing algorithms and correct errors in previous analyses. Our second contribution is a new algorithm for the class of games studied by Bartok [2013] where we prove upper and lower regret bounds that shed more light on the dependence of the regret on the game structure.
Tasks
Published	2018-05-23
URL	http://arxiv.org/abs/1805.09247v1
PDF	http://arxiv.org/pdf/1805.09247v1.pdf
PWC	https://paperswithcode.com/paper/cleaning-up-the-neighborhood-a-full
Repo
Framework

The Dynamics of Learning: A Random Matrix Approach


Title	The Dynamics of Learning: A Random Matrix Approach
Authors	Zhenyu Liao, Romain Couillet
Abstract	Understanding the learning dynamics of neural networks is one of the key issues for the improvement of optimization algorithms as well as for the theoretical comprehension of why deep neural nets work so well today. In this paper, we introduce a random matrix-based framework to analyze the learning dynamics of a single-layer linear network on a binary classification problem, for data of simultaneously large dimension and size, trained by gradient descent. Our results provide rich insights into common questions in neural nets, such as overfitting, early stopping and the initialization of training, thereby opening the door for future studies of more elaborate structures and models appearing in today’s neural networks.
Tasks
Published	2018-05-30
URL	http://arxiv.org/abs/1805.11917v2
PDF	http://arxiv.org/pdf/1805.11917v2.pdf
PWC	https://paperswithcode.com/paper/the-dynamics-of-learning-a-random-matrix
Repo
Framework

User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks


Title	User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks
Authors	Yuanzheng Ci, Xinzhu Ma, Zhihui Wang, Haojie Li, Zhongxuan Luo
Abstract	Scribble colors based line art colorization is a challenging computer vision problem since neither greyscale values nor semantic information is presented in line arts, and the lack of authentic illustration-line art training pairs also increases difficulty of model generalization. Recently, several Generative Adversarial Nets (GANs) based methods have achieved great success. They can generate colorized illustrations conditioned on given line art and color hints. However, these methods fail to capture the authentic illustration distributions and are hence perceptually unsatisfying in the sense that they often lack accurate shading. To address these challenges, we propose a novel deep conditional adversarial architecture for scribble based anime line art colorization. Specifically, we integrate the conditional framework with WGAN-GP criteria as well as the perceptual loss to enable us to robustly train a deep network that makes the synthesized images more natural and real. We also introduce a local features network that is independent of synthetic data. With GANs conditioned on features from such network, we notably increase the generalization capability over “in the wild” line arts. Furthermore, we collect two datasets that provide high-quality colorful illustrations and authentic line arts for training and benchmarking. With the proposed model trained on our illustration dataset, we demonstrate that images synthesized by the presented approach are considerably more realistic and precise than alternative approaches.
Tasks	Colorization, Line Art Colorization
Published	2018-08-09
URL	http://arxiv.org/abs/1808.03240v2
PDF	http://arxiv.org/pdf/1808.03240v2.pdf
PWC	https://paperswithcode.com/paper/user-guided-deep-anime-line-art-colorization
Repo
Framework

Linear and Nonlinear Identification of Dryer System Using Artificial Intelligence and Neural Networks


Title	Linear and Nonlinear Identification of Dryer System Using Artificial Intelligence and Neural Networks
Authors	Mostafa Darvishi
Abstract	As you read these words you are using a complex biological neural network. You have a highly interconnected set of some neurons to facilitate your reading, breathing, motion and thinking. Each of your biological neurons, a rich assembly of tissue and chemistry, has the complexity, if not the speed, of a microprocessor. Some of your neural structure was with you at birth. Other parts have been established by experience.
Tasks
Published	2018-11-16
URL	http://arxiv.org/abs/1811.11534v2
PDF	http://arxiv.org/pdf/1811.11534v2.pdf
PWC	https://paperswithcode.com/paper/linear-and-nonlinear-identification-of-dryer
Repo
Framework

Machine Learning for semi linear PDEs


Title	Machine Learning for semi linear PDEs
Authors	Quentin Chan-Wai-Nam, Joseph Mikael, Xavier Warin
Abstract	Recent machine learning algorithms dedicated to solving semi-linear PDEs are improved by using different neural network architectures and different parameterizations. These algorithms are compared to a new one that solves a fixed point problem by using deep learning techniques. This new algorithm appears to be competitive in terms of accuracy with the best existing algorithms.
Tasks
Published	2018-09-20
URL	http://arxiv.org/abs/1809.07609v2
PDF	http://arxiv.org/pdf/1809.07609v2.pdf
PWC	https://paperswithcode.com/paper/machine-learning-for-semi-linear-pdes
Repo
Framework

Quantum Neural Network and Soft Quantum Computing


Title	Quantum Neural Network and Soft Quantum Computing
Authors	Zeng-Bing Chen
Abstract	A new paradigm of quantum computing, namely, soft quantum computing, is proposed for nonclassical computation using real world quantum systems with naturally occurring environment-induced decoherence and dissipation. As a specific example of soft quantum computing, we suggest a quantum neural network, where the neurons connect pairwise via the “controlled Kraus operations”, hoping to pave an easier and more realistic way to quantum artificial intelligence and even to better understanding certain functioning of the human brain. Our quantum neuron model mimics as much as possible the realistic neurons and meanwhile, uses quantum laws for processing information. The quantum features of the noisy neural network are uncovered by the presence of quantum discord and by non-commutability of quantum operations. We believe that our model puts quantum computing into a wider context and inspires the hope to build a soft quantum computer much earlier than the standard one.
Tasks
Published	2018-10-10
URL	http://arxiv.org/abs/1810.05025v1
PDF	http://arxiv.org/pdf/1810.05025v1.pdf
PWC	https://paperswithcode.com/paper/quantum-neural-network-and-soft-quantum
Repo
Framework

Learning to Globally Edit Images with Textual Description


Title	Learning to Globally Edit Images with Textual Description
Authors	Hai Wang, Jason D. Williams, SingBing Kang
Abstract	We show how we can globally edit images using textual instructions: given a source image and a textual instruction for the edit, generate a new image transformed under this instruction. To tackle this novel problem, we develop three different trainable models based on RNN and Generative Adversarial Network (GAN). The models (bucket, filter bank, and end-to-end) differ in how much expert knowledge is encoded, with the most general version being purely end-to-end. To train these systems, we use Amazon Mechanical Turk to collect textual descriptions for around 2000 image pairs sampled from several datasets. Experimental results evaluated on our dataset validate our approaches. In addition, given that the filter bank model is a good compromise between generality and performance, we investigate it further by replacing RNN with Graph RNN, and show that Graph RNN improves performance. To the best of our knowledge, this is the first computational photography work on global image editing that is purely based on free-form textual instructions.
Tasks
Published	2018-10-13
URL	http://arxiv.org/abs/1810.05786v1
PDF	http://arxiv.org/pdf/1810.05786v1.pdf
PWC	https://paperswithcode.com/paper/learning-to-globally-edit-images-with-textual
Repo
Framework

Global Pose Estimation with an Attention-based Recurrent Network


Title	Global Pose Estimation with an Attention-based Recurrent Network
Authors	Emilio Parisotto, Devendra Singh Chaplot, Jian Zhang, Ruslan Salakhutdinov
Abstract	The ability for an agent to localize itself within an environment is crucial for many real-world applications. For unknown environments, Simultaneous Localization and Mapping (SLAM) enables incremental and concurrent building of and localizing within a map. We present a new, differentiable architecture, Neural Graph Optimizer, progressing towards a complete neural network solution for SLAM by designing a system composed of a local pose estimation model, a novel pose selection module, and a novel graph optimization process. The entire architecture is trained in an end-to-end fashion, enabling the network to automatically learn domain-specific features relevant to the visual odometry and avoid the involved process of feature engineering. We demonstrate the effectiveness of our system on a simulated 2D maze and the 3D ViZ-Doom environment.
Tasks	Feature Engineering, Pose Estimation, Simultaneous Localization and Mapping, Visual Odometry
Published	2018-02-19
URL	http://arxiv.org/abs/1802.06857v1
PDF	http://arxiv.org/pdf/1802.06857v1.pdf
PWC	https://paperswithcode.com/paper/global-pose-estimation-with-an-attention
Repo
Framework

Evaluating Gammatone Frequency Cepstral Coefficients with Neural Networks for Emotion Recognition from Speech


Title	Evaluating Gammatone Frequency Cepstral Coefficients with Neural Networks for Emotion Recognition from Speech
Authors	Gabrielle K. Liu
Abstract	Current approaches to speech emotion recognition focus on speech features that can capture the emotional content of a speech signal. Mel Frequency Cepstral Coefficients (MFCCs) are one of the most commonly used representations for audio speech recognition and classification. This paper proposes Gammatone Frequency Cepstral Coefficients (GFCCs) as a potentially better representation of speech signals for emotion recognition. The effectiveness of MFCC and GFCC representations are compared and evaluated over emotion and intensity classification tasks with fully connected and recurrent neural network architectures. The results provide evidence that GFCCs outperform MFCCs in speech emotion recognition.
Tasks	Emotion Recognition, Speech Emotion Recognition, Speech Recognition
Published	2018-06-23
URL	http://arxiv.org/abs/1806.09010v1
PDF	http://arxiv.org/pdf/1806.09010v1.pdf
PWC	https://paperswithcode.com/paper/evaluating-gammatone-frequency-cepstral
Repo
Framework


Title	Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
Authors	Soo-Whan Chung, Joon Son Chung, Hong-Goo Kang
Abstract	This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to find the most relevant audio segment given a short video clip. The method builds on the recent advances in learning representations from cross-modal self-supervision. The main contributions of this paper are as follows: (1) we propose a new learning strategy where the embeddings are learnt via a multi-way matching problem, as opposed to a binary classification (matching or non-matching) problem as proposed by recent papers; (2) we demonstrate that performance of this method far exceeds the existing baselines on the synchronization task; (3) we use the learnt embeddings for visual speech recognition in self-supervision, and show that the performance matches the representations learnt end-to-end in a fully-supervised manner.
Tasks	Cross-Modal Retrieval, Speech Recognition, Video Synchronization, Visual Speech Recognition
Published	2018-09-21
URL	http://arxiv.org/abs/1809.08001v2
PDF	http://arxiv.org/pdf/1809.08001v2.pdf
PWC	https://paperswithcode.com/paper/perfect-match-improved-cross-modal-embeddings
Repo
Framework

Improving Explorability in Variational Inference with Annealed Variational Objectives


Title	Improving Explorability in Variational Inference with Annealed Variational Objectives
Authors	Chin-Wei Huang, Shawn Tan, Alexandre Lacoste, Aaron Courville
Abstract	Despite the advances in the representational capacity of approximate distributions for variational inference, the optimization process can still limit the density that is ultimately learned. We demonstrate the drawbacks of biasing the true posterior to be unimodal, and introduce Annealed Variational Objectives (AVO) into the training of hierarchical variational methods. Inspired by Annealed Importance Sampling, the proposed method facilitates learning by incorporating energy tempering into the optimization objective. In our experiments, we demonstrate our method’s robustness to deterministic warm up, and the benefits of encouraging exploration in the latent space.
Tasks
Published	2018-09-06
URL	http://arxiv.org/abs/1809.01818v3
PDF	http://arxiv.org/pdf/1809.01818v3.pdf
PWC	https://paperswithcode.com/paper/improving-explorability-in-variational
Repo
Framework

Object Sorting Using a Global Texture-Shape 3D Feature Descriptor


Title	Object Sorting Using a Global Texture-Shape 3D Feature Descriptor
Authors	Zhun Fan, Zhongxing Li, Benzhang Qiu, Wenji Li, Jianye Hu, Alex Noel Josephraj, Heping Chen
Abstract	Object recognition and grasping plays a key role in robotic systems, especially for the autonomous robots to implement object sorting tasks in a warehouse. In this paper, we present a global texture-shape 3D feature descriptor which can be utilized in a system of object recognition and grasping, and can perform object sorting tasks well. Our proposed descriptor stems from the clustered viewpoint feature histogram (CVFH), which relies on the geometrical information of the whole 3D object surface only, and can not perform well in recognizing the objects with similar geometrical information. Therefore, we extend the CVFH descriptor with texture and color information to generate a new global 3D feature descriptor. The proposed descriptor is evaluated in tasks of recognizing and classifying 3D objects by applying multi-class support vector machines (SVM) in both public 3D image dataset and real scenes. The results of evaluation show that the proposed descriptor achieves a significant better performance for object recognition compared with the original CVFH. Then, the proposed descriptor is applied in our object recognition and grasping system, showing that the proposed descriptor helps the system implement the object recognition, object grasping and object sorting tasks well.
Tasks	Object Detection, Object Recognition
Published	2018-02-04
URL	https://arxiv.org/abs/1802.01116v3
PDF	https://arxiv.org/pdf/1802.01116v3.pdf
PWC	https://paperswithcode.com/paper/object-detection-and-sorting-by-using-a
Repo
Framework

FrameNet automatic analysis : a study on a French corpus of encyclopedic texts


Title	FrameNet automatic analysis : a study on a French corpus of encyclopedic texts
Authors	Gabriel Marzinotto, Géraldine Damnati, Frederic Bechet
Abstract	This article presents an automatic frame analysis system evaluated on a corpus of French encyclopedic history texts annotated according to the FrameNet formalism. The chosen approach relies on an integrated sequence labeling model which jointly optimizes frame identification and semantic role segmentation and identification. The purpose of this study is to analyze the task complexity from several dimensions. Hence we provide detailed evaluations from a feature selection point of view and from the data point of view.
Tasks	Feature Selection
Published	2018-12-19
URL	http://arxiv.org/abs/1812.08044v1
PDF	http://arxiv.org/pdf/1812.08044v1.pdf
PWC	https://paperswithcode.com/paper/framenet-automatic-analysis-a-study-on-a
Repo
Framework