October 15, 2019

2640 words 13 mins read

Paper Group NANR 60

Paper Group NANR 60

Deep Reinforcement Learning for NLP. 3D Human Sensing, Action and Emotion Recognition in Robot Assisted Therapy of Children With Autism. Proceedings of the BioNLP 2018 workshop. A Detailed Evaluation of Neural Sequence-to-Sequence Models for In-domain and Cross-domain Text Simplification. Testing for Families of Distributions via the Fourier Transf …

Deep Reinforcement Learning for NLP

Title Deep Reinforcement Learning for NLP
Authors William Yang Wang, Jiwei Li, Xiaodong He
Abstract Many Natural Language Processing (NLP) tasks (including generation, language grounding, reasoning, information extraction, coreference resolution, and dialog) can be formulated as deep reinforcement learning (DRL) problems. However, since language is often discrete and the space for all sentences is infinite, there are many challenges for formulating reinforcement learning problems of NLP tasks. In this tutorial, we provide a gentle introduction to the foundation of deep reinforcement learning, as well as some practical DRL solutions in NLP. We describe recent advances in designing deep reinforcement learning for NLP, with a special focus on generation, dialogue, and information extraction. Finally, we discuss why they succeed, and when they may fail, aiming at providing some practical advice about deep reinforcement learning for solving real-world NLP problems.
Tasks Atari Games, Coreference Resolution, Decision Making, Hierarchical Reinforcement Learning, Representation Learning, Text Classification, Video Captioning
Published 2018-07-01
URL https://www.aclweb.org/anthology/P18-5007/
PDF https://www.aclweb.org/anthology/P18-5007
PWC https://paperswithcode.com/paper/deep-reinforcement-learning-for-nlp
Repo
Framework

3D Human Sensing, Action and Emotion Recognition in Robot Assisted Therapy of Children With Autism

Title 3D Human Sensing, Action and Emotion Recognition in Robot Assisted Therapy of Children With Autism
Authors Elisabeta Marinoiu, Mihai Zanfir, Vlad Olaru, Cristian Sminchisescu
Abstract We introduce new, fine-grained action and emotion recognition tasks defined on non-staged videos, recorded during robot-assisted therapy sessions of children with autism. The tasks present several challenges: a large dataset with long videos, a large number of highly variable actions, children that are only partially visible, have different ages and may show unpredictable behaviour, as well as non-standard camera viewpoints. We investigate how state-of-the-art 3d human pose reconstruction methods perform on the newly introduced tasks and propose extensions to adapt them to deal with these challenges. We also analyze multiple approaches in action and emotion recognition from 3d human pose data, establish several baselines, and discuss results and their implications in the context of child-robot interaction.
Tasks Emotion Recognition
Published 2018-06-01
URL http://openaccess.thecvf.com/content_cvpr_2018/html/Marinoiu_3D_Human_Sensing_CVPR_2018_paper.html
PDF http://openaccess.thecvf.com/content_cvpr_2018/papers/Marinoiu_3D_Human_Sensing_CVPR_2018_paper.pdf
PWC https://paperswithcode.com/paper/3d-human-sensing-action-and-emotion
Repo
Framework

Proceedings of the BioNLP 2018 workshop

Title Proceedings of the BioNLP 2018 workshop
Authors
Abstract
Tasks
Published 2018-07-01
URL https://www.aclweb.org/anthology/W18-2300/
PDF https://www.aclweb.org/anthology/W18-2300
PWC https://paperswithcode.com/paper/proceedings-of-the-bionlp-2018-workshop
Repo
Framework

A Detailed Evaluation of Neural Sequence-to-Sequence Models for In-domain and Cross-domain Text Simplification

Title A Detailed Evaluation of Neural Sequence-to-Sequence Models for In-domain and Cross-domain Text Simplification
Authors Sanja {\v{S}}tajner, Sergiu Nisioi
Abstract
Tasks Machine Translation, Text Simplification
Published 2018-05-01
URL https://www.aclweb.org/anthology/L18-1479/
PDF https://www.aclweb.org/anthology/L18-1479
PWC https://paperswithcode.com/paper/a-detailed-evaluation-of-neural-sequence-to
Repo
Framework

Testing for Families of Distributions via the Fourier Transform

Title Testing for Families of Distributions via the Fourier Transform
Authors Alistair Stewart, Ilias Diakonikolas, Clement Canonne
Abstract We study the general problem of testing whether an unknown discrete distribution belongs to a specified family of distributions. More specifically, given a distribution family P and sample access to an unknown discrete distribution D , we want to distinguish (with high probability) between the case that D in P and the case that D is ε-far, in total variation distance, from every distribution in P . This is the prototypical hypothesis testing problem that has received significant attention in statistics and, more recently, in computer science. The main contribution of this work is a simple and general testing technique that is applicable to all distribution families whose Fourier spectrum satisfies a certain approximate sparsity property. We apply our Fourier-based framework to obtain near sample-optimal and computationally efficient testers for the following fundamental distribution families: Sums of Independent Integer Random Variables (SIIRVs), Poisson Multinomial Distributions (PMDs), and Discrete Log-Concave Distributions. For the first two, ours are the first non-trivial testers in the literature, vastly generalizing previous work on testing Poisson Binomial Distributions. For the third, our tester improves on prior work in both sample and time complexity.
Tasks
Published 2018-12-01
URL http://papers.nips.cc/paper/8210-testing-for-families-of-distributions-via-the-fourier-transform
PDF http://papers.nips.cc/paper/8210-testing-for-families-of-distributions-via-the-fourier-transform.pdf
PWC https://paperswithcode.com/paper/testing-for-families-of-distributions-via-the
Repo
Framework

A single channel sleep-spindle detector based on multivariate classification of EEG epochs: MUSSDET.

Title A single channel sleep-spindle detector based on multivariate classification of EEG epochs: MUSSDET.
Authors DanielLachner-Piza, Nino Epitashvili, Andreas Schulze-Bonhage, Thomas Stieglitz, Julia Jacobs, Matthias Dümpelmann
Abstract BACKGROUND: Studies on sleep-spindles are typically based on visual-marks performed by experts, however this process is time consuming and presents a low inter-expert agreement, causing the data to be limited in quantity and prone to bias. An automatic detector would tackle these issues by generating large amounts of objectively marked data. NEW METHOD: Our goal was to develop a sensitive, precise and robust sleep-spindle detection method. Emphasis has been placed on achieving a consistent performance across heterogeneous recordings and without the need for further parameter fine tuning. The developed detector runs on a single channel and is based on multivariate classification using a support vector machine. Scalp-electroencephalogram recordings were segmented into epochs which were then characterized by a selection of relevant and non-redundant features. The training and validation data came from the Medical Center-University of Freiburg, the test data consisted of 27 records coming from 2 public databases. RESULTS: Using a sample based assessment, 53% sensitivity, 37% precision and 96% specificity was achieved on the DREAMS database. On the MASS database, 77% sensitivity, 46% precision and 96% specificity was achieved. The developed detector performed favorably when compared to previous detectors. The classification of normalized EEG epochs in a multidimensional space, as well as the use of a validation set, allowed to objectively define a single detection threshold for all databases and participants. CONCLUSIONS: The use of the developed tool will allow increasing the data-size and statistical significance of research studies on the role of sleep-spindles.
Tasks EEG, Spindle Detection
Published 2018-03-01
URL https://doi.org/10.1016/j.jneumeth.2017.12.023
PDF https://www.deepdyve.com/lp/elsevier/a-single-channel-sleep-spindle-detector-based-on-multivariate-grFKFTd9gR#bsSignUpModal
PWC https://paperswithcode.com/paper/a-single-channel-sleep-spindle-detector-based
Repo
Framework

Information-based Adaptive Stimulus Selection to Optimize Communication Efficiency in Brain-Computer Interfaces

Title Information-based Adaptive Stimulus Selection to Optimize Communication Efficiency in Brain-Computer Interfaces
Authors Boyla Mainsah, Dmitry Kalika, Leslie Collins, Siyuan Liu, Chandra Throckmorton
Abstract Stimulus-driven brain-computer interfaces (BCIs), such as the P300 speller, rely on using a sequence of sensory stimuli to elicit specific neural responses as control signals, while a user attends to relevant target stimuli that occur within the sequence. In current BCIs, the stimulus presentation schedule is typically generated in a pseudo-random fashion. Given the non-stationarity of brain electrical signals, a better strategy could be to adapt the stimulus presentation schedule in real-time by selecting the optimal stimuli that will maximize the signal-to-noise ratios of the elicited neural responses and provide the most information about the user’s intent based on the uncertainties of the data being measured. However, the high-dimensional stimulus space limits the development of algorithms with tractable solutions for optimized stimulus selection to allow for real-time decision-making within the stringent time requirements of BCI processing. We derive a simple analytical solution of an information-based objective function for BCI stimulus selection by transforming the high-dimensional stimulus space into a one-dimensional space that parameterizes the objective function - the prior probability mass of the stimulus under consideration, irrespective of its contents. We demonstrate the utility of our adaptive stimulus selection algorithm in improving BCI performance with results from simulation and real-time human experiments.
Tasks Decision Making
Published 2018-12-01
URL http://papers.nips.cc/paper/7731-information-based-adaptive-stimulus-selection-to-optimize-communication-efficiency-in-brain-computer-interfaces
PDF http://papers.nips.cc/paper/7731-information-based-adaptive-stimulus-selection-to-optimize-communication-efficiency-in-brain-computer-interfaces.pdf
PWC https://paperswithcode.com/paper/information-based-adaptive-stimulus-selection
Repo
Framework

LanguageNet: Learning to Find Sense Relevant Example Sentences

Title LanguageNet: Learning to Find Sense Relevant Example Sentences
Authors Shang-Chien Cheng, Jhih-Jie Chen, Chingyu Yang, Jason Chang
Abstract In this paper, we present a system, LanguageNet, which can help second language learners to search for different meanings and usages of a word. We disambiguate word senses based on the pairs of an English word and its corresponding Chinese translations in a parallel corpus, UM-Corpus. The process involved performing word alignment, learning vector space representations of words and training a classifier to distinguish words into groups of senses. LanguageNet directly shows the definition of a sense, bilingual synonyms and sense relevant examples.
Tasks Word Alignment
Published 2018-08-01
URL https://www.aclweb.org/anthology/C18-2022/
PDF https://www.aclweb.org/anthology/C18-2022
PWC https://paperswithcode.com/paper/languagenet-learning-to-find-sense-relevant
Repo
Framework

Face Recognition with Contrastive Convolution

Title Face Recognition with Contrastive Convolution
Authors Chunrui Han, Shiguang Shan, Meina Kan, Shuzhe Wu, Xilin Chen
Abstract In current face recognition approaches with convolutional neural network (CNN), a pair of faces to compare are independently fed into the CNN for feature extraction. For both faces the same kernels are applied and hence the representation of a face stays fixed regardless of who it is compared with. As for us humans, however, one generally focuses on varied characteristics of a face when comparing it with distinct persons. Inspired, we propose a novel CNN structure with what we referred to as contrastive convolution, which specifically focuses on the distinct characteristics between the two faces to compare, i.e., those contrastive characteristics. Extensive experiments on the challenging LFW, and IJB-A show that our proposed contrastive convolution significantly improves the vanilla CNN and achieves quite promising performance in face verification task.
Tasks Face Recognition, Face Verification
Published 2018-09-01
URL http://openaccess.thecvf.com/content_ECCV_2018/html/Chunrui_Han_Face_Recognition_with_ECCV_2018_paper.html
PDF http://openaccess.thecvf.com/content_ECCV_2018/papers/Chunrui_Han_Face_Recognition_with_ECCV_2018_paper.pdf
PWC https://paperswithcode.com/paper/face-recognition-with-contrastive-convolution
Repo
Framework

Conditional Generative Adversarial Network for Structured Domain Adaptation

Title Conditional Generative Adversarial Network for Structured Domain Adaptation
Authors Weixiang Hong, Zhenzhen Wang, Ming Yang, Junsong Yuan
Abstract In recent years, deep neural nets have triumphed over many computer vision problems, including semantic segmentation, which is a critical task in emerging autonomous driving and medical image diagnostics applications. In general, training deep neural nets requires a humongous amount of labeled data, which is laborious and costly to collect and annotate. Recent advances in computer graphics shed light on utilizing photo-realistic synthetic data with computer generated annotations to train neural nets. Nevertheless, the domain mismatch between real images and synthetic ones is the major challenge against harnessing the generated data and labels. In this paper, we propose a principled way to conduct structured domain adaption for semantic segmentation, i.e., integrating GAN into the FCN framework to mitigate the gap between source and target domains. Specifically, we learn a conditional generator to transform features of synthetic images to real-image like features, and a discriminator to distinguish them. For each training batch, the conditional generator and the discriminator compete against each other so that the generator learns to produce real-image like features to fool the discriminator; afterwards, the FCN parameters are updated to accommodate the changes of GAN. In experiments, without using labels of real image data, our method significantly outperforms the baselines as well as state-of-the-art methods by 12% ∼ 20% mean IoU on the Cityscapes dataset.
Tasks Autonomous Driving, Domain Adaptation, Semantic Segmentation
Published 2018-06-01
URL http://openaccess.thecvf.com/content_cvpr_2018/html/Hong_Conditional_Generative_Adversarial_CVPR_2018_paper.html
PDF http://openaccess.thecvf.com/content_cvpr_2018/papers/Hong_Conditional_Generative_Adversarial_CVPR_2018_paper.pdf
PWC https://paperswithcode.com/paper/conditional-generative-adversarial-network
Repo
Framework

Regret Bounds for Online Portfolio Selection with a Cardinality Constraint

Title Regret Bounds for Online Portfolio Selection with a Cardinality Constraint
Authors Shinji Ito, Daisuke Hatano, Sumita Hanna, Akihiro Yabe, Takuro Fukunaga, Naonori Kakimura, Ken-Ichi Kawarabayashi
Abstract Online portfolio selection is a sequential decision-making problem in which a learner repetitively selects a portfolio over a set of assets, aiming to maximize long-term return. In this paper, we study the problem with the cardinality constraint that the number of assets in a portfolio is restricted to be at most k, and consider two scenarios: (i) in the full-feedback setting, the learner can observe price relatives (rates of return to cost) for all assets, and (ii) in the bandit-feedback setting, the learner can observe price relatives only for invested assets. We propose efficient algorithms for these scenarios that achieve sublinear regrets. We also provide regret (statistical) lower bounds for both scenarios which nearly match the upper bounds when k is a constant. In addition, we give a computational lower bound which implies that no algorithm maintains both computational efficiency, as well as a small regret upper bound.
Tasks Decision Making
Published 2018-12-01
URL http://papers.nips.cc/paper/8258-regret-bounds-for-online-portfolio-selection-with-a-cardinality-constraint
PDF http://papers.nips.cc/paper/8258-regret-bounds-for-online-portfolio-selection-with-a-cardinality-constraint.pdf
PWC https://paperswithcode.com/paper/regret-bounds-for-online-portfolio-selection
Repo
Framework

Improving Unsupervised Keyphrase Extraction using Background Knowledge

Title Improving Unsupervised Keyphrase Extraction using Background Knowledge
Authors Yang Yu, Vincent Ng
Abstract
Tasks
Published 2018-05-01
URL https://www.aclweb.org/anthology/L18-1588/
PDF https://www.aclweb.org/anthology/L18-1588
PWC https://paperswithcode.com/paper/improving-unsupervised-keyphrase-extraction
Repo
Framework

Interpretable Neural Architectures for Attributing an Ad’s Performance to its Writing Style

Title Interpretable Neural Architectures for Attributing an Ad’s Performance to its Writing Style
Authors Reid Pryzant, Sugato Basu, Kazoo Sone
Abstract How much does {``}free shipping!{''} help an advertisement{'}s ability to persuade? This paper presents two methods for \textit{performance attribution}: finding the degree to which an outcome can be attributed to parts of a text while controlling for potential confounders. Both algorithms are based on interpreting the behaviors and parameters of trained neural networks. One method uses a CNN to encode the text, an adversarial objective function to control for confounders, and projects its weights onto its activations to interpret the importance of each phrase towards each output class. The other method leverages residualization to control for confounds and performs interpretation by aggregating over learned word vectors. We demonstrate these algorithms{'} efficacy on 118,000 internet search advertisements and outcomes, finding language indicative of high and low click through rate (CTR) regardless of who the ad is by or what it is for. Our results suggest the proposed algorithms are high performance and data efficient, able to glean actionable insights from fewer than 10,000 data points. We find that quick, easy, and authoritative language is associated with success, while lackluster embellishment is related to failure. These findings agree with the advertising industry{'}s emperical wisdom, automatically revealing insights which previously required manual A/B testing to discover. |
Tasks Interpretable Machine Learning
Published 2018-11-01
URL https://www.aclweb.org/anthology/W18-5415/
PDF https://www.aclweb.org/anthology/W18-5415
PWC https://paperswithcode.com/paper/interpretable-neural-architectures-for
Repo
Framework

Leveraging Gloss Knowledge in Neural Word Sense Disambiguation by Hierarchical Co-Attention

Title Leveraging Gloss Knowledge in Neural Word Sense Disambiguation by Hierarchical Co-Attention
Authors Fuli Luo, Tianyu Liu, Zexue He, Qiaolin Xia, Zhifang Sui, Baobao Chang
Abstract The goal of Word Sense Disambiguation (WSD) is to identify the correct meaning of a word in the particular context. Traditional supervised methods only use labeled data (context), while missing rich lexical knowledge such as the gloss which defines the meaning of a word sense. Recent studies have shown that incorporating glosses into neural networks for WSD has made significant improvement. However, the previous models usually build the context representation and gloss representation separately. In this paper, we find that the learning for the context and gloss representation can benefit from each other. Gloss can help to highlight the important words in the context, thus building a better context representation. Context can also help to locate the key words in the gloss of the correct word sense. Therefore, we introduce a co-attention mechanism to generate co-dependent representations for the context and gloss. Furthermore, in order to capture both word-level and sentence-level information, we extend the attention mechanism in a hierarchical fashion. Experimental results show that our model achieves the state-of-the-art results on several standard English all-words WSD test datasets.
Tasks Word Sense Disambiguation
Published 2018-10-01
URL https://www.aclweb.org/anthology/D18-1170/
PDF https://www.aclweb.org/anthology/D18-1170
PWC https://paperswithcode.com/paper/leveraging-gloss-knowledge-in-neural-word
Repo
Framework

Deep Parametric Continuous Convolutional Neural Networks

Title Deep Parametric Continuous Convolutional Neural Networks
Authors Shenlong Wang, Simon Suo, Wei-Chiu Ma, Andrei Pokrovsky, Raquel Urtasun
Abstract Standard convolutional neural networks assume a grid structured input is available and exploit discrete convolutions as their fundamental building blocks. This limits their applicability to many real-world applications. In this paper we propose Parametric Continuous Convolution, a new learnable operator that operates over non-grid structured data. The key idea is to exploit parameterized kernel functions that span the full continuous vector space. This generalization allows us to learn over arbitrary data structures as long as their support relationship is computable. Our experiments show significant improvement over the state-of-the-art in point cloud segmentation of indoor and outdoor scenes, and lidar motion estimation of driving scenes.
Tasks Motion Estimation
Published 2018-06-01
URL http://openaccess.thecvf.com/content_cvpr_2018/html/Wang_Deep_Parametric_Continuous_CVPR_2018_paper.html
PDF http://openaccess.thecvf.com/content_cvpr_2018/papers/Wang_Deep_Parametric_Continuous_CVPR_2018_paper.pdf
PWC https://paperswithcode.com/paper/deep-parametric-continuous-convolutional
Repo
Framework
comments powered by Disqus