January 25, 2020

2448 words 12 mins read

Paper Group NANR 87

Paper Group NANR 87

Talk The Walk: Navigating Grids in New York City through Grounded Dialogue. Serial Recall Effects in Neural Language Modeling. Learning Morphosyntactic Analyzers from the Bible via Iterative Annotation Projection across 26 Languages. Neural Machine Translation with Reordering Embeddings. Auditing Deep Learning processes through Kernel-based Explana …

Talk The Walk: Navigating Grids in New York City through Grounded Dialogue

Title Talk The Walk: Navigating Grids in New York City through Grounded Dialogue
Authors Harm de Vries, Kurt Shuster, Dhruv Batra, Devi Parikh, Jason Weston, Douwe Kiela
Abstract We introduce `"Talk The Walk”, the first large-scale dialogue dataset grounded in action and perception. The task involves two agents (a ‘guide’ and a ‘tourist’) that communicate via natural language in order to achieve a common goal: having the tourist navigate to a given target location. The task and dataset, which are described in detail, are challenging and their full solution is an open problem that we pose to the community. We (i) focus on the task of tourist localization and develop the novel Masked Attention for Spatial Convolutions (MASC) mechanism that allows for grounding tourist utterances into the guide’s map, (ii) show it yields significant improvements for both emergent and natural language communication, and (iii) using this method, we establish non-trivial baselines on the full task. |
Tasks
Published 2019-05-01
URL https://openreview.net/forum?id=HyxhusA9Fm
PDF https://openreview.net/pdf?id=HyxhusA9Fm
PWC https://paperswithcode.com/paper/talk-the-walk-navigating-grids-in-new-york
Repo
Framework

Serial Recall Effects in Neural Language Modeling

Title Serial Recall Effects in Neural Language Modeling
Authors Hassan Hajipoor, Hadi Amiri, Maseud Rahgozar, Farhad Oroumchian
Abstract Serial recall experiments study the ability of humans to recall words in the order in which they occurred. The following serial recall effects are generally investigated in studies with humans: word length and frequency, primacy and recency, semantic confusion, repetition, and transposition effects. In this research, we investigate neural language models in the context of these serial recall effects. Our work provides a framework to better understand and analyze neural language models and opens a new window to develop accurate language models.
Tasks Language Modelling
Published 2019-06-01
URL https://www.aclweb.org/anthology/N19-1073/
PDF https://www.aclweb.org/anthology/N19-1073
PWC https://paperswithcode.com/paper/serial-recall-effects-in-neural-language
Repo
Framework

Learning Morphosyntactic Analyzers from the Bible via Iterative Annotation Projection across 26 Languages

Title Learning Morphosyntactic Analyzers from the Bible via Iterative Annotation Projection across 26 Languages
Authors Garrett Nicolai, David Yarowsky
Abstract A large percentage of computational tools are concentrated in a very small subset of the planet{'}s languages. Compounding the issue, many languages lack the high-quality linguistic annotation necessary for the construction of such tools with current machine learning methods. In this paper, we address both issues simultaneously: leveraging the high accuracy of English taggers and parsers, we project morphological information onto translations of the Bible in 26 varied test languages. Using an iterative discovery, constraint, and training process, we build inflectional lexica in the target languages. Through a combination of iteration, ensembling, and reranking, we see double-digit relative error reductions in lemmatization and morphological analysis over a strong initial system.
Tasks Lemmatization, Morphological Analysis
Published 2019-07-01
URL https://www.aclweb.org/anthology/P19-1172/
PDF https://www.aclweb.org/anthology/P19-1172
PWC https://paperswithcode.com/paper/learning-morphosyntactic-analyzers-from-the
Repo
Framework

Neural Machine Translation with Reordering Embeddings

Title Neural Machine Translation with Reordering Embeddings
Authors Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita
Abstract The reordering model plays an important role in phrase-based statistical machine translation. However, there are few works that exploit the reordering information in neural machine translation. In this paper, we propose a reordering mechanism to learn the reordering embedding of a word based on its contextual information. These learned reordering embeddings are stacked together with self-attention networks to learn sentence representation for machine translation. The reordering mechanism can be easily integrated into both the encoder and the decoder in the Transformer translation system. Experimental results on WMT{'}14 English-to-German, NIST Chinese-to-English, and WAT Japanese-to-English translation tasks demonstrate that the proposed methods can significantly improve the performance of the Transformer.
Tasks Machine Translation
Published 2019-07-01
URL https://www.aclweb.org/anthology/P19-1174/
PDF https://www.aclweb.org/anthology/P19-1174
PWC https://paperswithcode.com/paper/neural-machine-translation-with-reordering
Repo
Framework

Auditing Deep Learning processes through Kernel-based Explanatory Models

Title Auditing Deep Learning processes through Kernel-based Explanatory Models
Authors Danilo Croce, Daniele Rossini, Roberto Basili
Abstract While NLP systems become more pervasive, their accountability gains value as a focal point of effort. Epistemological opaqueness of nonlinear learning methods, such as deep learning models, can be a major drawback for their adoptions. In this paper, we discuss the application of Layerwise Relevance Propagation over a linguistically motivated neural architecture, the Kernel-based Deep Architecture, in order to trace back connections between linguistic properties of input instances and system decisions. Such connections then guide the construction of argumentations on network{'}s inferences, i.e., explanations based on real examples, semantically related to the input. We propose here a methodology to evaluate the transparency and coherence of analogy-based explanations modeling an audit stage for the system. Quantitative analysis on two semantic tasks, i.e., question classification and semantic role labeling, show that the explanatory capabilities (native in KDAs) are effective and they pave the way to more complex argumentation methods.
Tasks Semantic Role Labeling
Published 2019-11-01
URL https://www.aclweb.org/anthology/D19-1415/
PDF https://www.aclweb.org/anthology/D19-1415
PWC https://paperswithcode.com/paper/auditing-deep-learning-processes-through
Repo
Framework

Study on Unsupervised Statistical Machine Translation for Backtranslation

Title Study on Unsupervised Statistical Machine Translation for Backtranslation
Authors Anush Kumar, Nihal V. Nayak, Ch, Aditya ra, Mydhili K. Nair
Abstract Machine Translation systems have drastically improved over the years for several language pairs. Monolingual data is often used to generate synthetic sentences to augment the training data which has shown to improve the performance of machine translation models. In our paper, we make use of an Unsupervised Statistical Machine Translation (USMT) to generate synthetic sentences. Our study compares the performance improvements in Neural Machine Translation model when using synthetic sentences from supervised and unsupervised Machine Translation models. Our approach of using USMT for backtranslation shows promise in low resource conditions and achieves an improvement of 3.2 BLEU score over the Neural Machine Translation model.
Tasks Machine Translation, Unsupervised Machine Translation
Published 2019-09-01
URL https://www.aclweb.org/anthology/R19-1068/
PDF https://www.aclweb.org/anthology/R19-1068
PWC https://paperswithcode.com/paper/study-on-unsupervised-statistical-machine
Repo
Framework

Generating Diverse Translations with Sentence Codes

Title Generating Diverse Translations with Sentence Codes
Authors Raphael Shu, Hideki Nakayama, Kyunghyun Cho
Abstract Users of machine translation systems may desire to obtain multiple candidates translated in different ways. In this work, we attempt to obtain diverse translations by using sentence codes to condition the sentence generation. We describe two methods to extract the codes, either with or without the help of syntax information. For diverse generation, we sample multiple candidates, each of which conditioned on a unique code. Experiments show that the sampled translations have much higher diversity scores when using reasonable sentence codes, where the translation quality is still on par with the baselines even under strong constraint imposed by the codes. In qualitative analysis, we show that our method is able to generate paraphrase translations with drastically different structures. The proposed approach can be easily adopted to existing translation systems as no modification to the model is required.
Tasks Machine Translation
Published 2019-07-01
URL https://www.aclweb.org/anthology/P19-1177/
PDF https://www.aclweb.org/anthology/P19-1177
PWC https://paperswithcode.com/paper/generating-diverse-translations-with-sentence
Repo
Framework

Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks

Title Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks
Authors Ziyi Liu, Le Wang, Qilin Zhang, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua
Abstract Weakly-supervised temporal action localization (WS-TAL) is a promising but challenging task with only video-level action categorical labels available during training. Without requiring temporal action boundary annotations in training data, WS-TAL could possibly exploit automatically retrieved video tags as video-level labels. However, such coarse video-level supervision inevitably incurs confusions, especially in untrimmed videos containing multiple action instances. To address this challenge, we propose the Contrast-based Localization EvaluAtioN Network (CleanNet) with our new action proposal evaluator, which provides pseudo-supervision by leveraging the temporal contrast in snippet-level action classification predictions. Essentially, the new action proposal evaluator enforces an additional temporal contrast constraint so that high-evaluation-score action proposals are more likely to coincide with true action instances. Moreover, the new action localization module is an integral part of CleanNet which enables end-to-end training. This is in contrast to many existing WS-TAL methods where action localization is merely a post-processing step. Experiments on THUMOS14 and ActivityNet datasets validate the efficacy of CleanNet against existing state-ofthe- art WS-TAL algorithms.
Tasks Action Classification, Action Localization, Temporal Action Localization, Weakly Supervised Action Localization, Weakly-supervised Temporal Action Localization
Published 2019-10-01
URL http://openaccess.thecvf.com/content_ICCV_2019/html/Liu_Weakly_Supervised_Temporal_Action_Localization_Through_Contrast_Based_Evaluation_Networks_ICCV_2019_paper.html
PDF http://openaccess.thecvf.com/content_ICCV_2019/papers/Liu_Weakly_Supervised_Temporal_Action_Localization_Through_Contrast_Based_Evaluation_Networks_ICCV_2019_paper.pdf
PWC https://paperswithcode.com/paper/weakly-supervised-temporal-action
Repo
Framework

Dynamic PET Image Reconstruction Using Nonnegative Matrix Factorization Incorporated With Deep Image Prior

Title Dynamic PET Image Reconstruction Using Nonnegative Matrix Factorization Incorporated With Deep Image Prior
Authors Tatsuya Yokota, Kazuya Kawai, Muneyuki Sakata, Yuichi Kimura, Hidekata Hontani
Abstract We propose a method that reconstructs dynamic positron emission tomography (PET) images from given sinograms by using non-negative matrix factorization (NMF) incorporated with a deep image prior (DIP) for appropriately constraining the spatial patterns of resultant images. The proposed method can reconstruct dynamic PET images with higher signal-to-noise ratio (SNR) and blindly decompose an image matrix into pairs of spatial and temporal factors. The former represent homogeneous tissues with different kinetic parameters and the latter represent the time activity curves that are observed in the corresponding homogeneous tissues. We employ U-Nets combined in parallel for DIP and each of the U-nets is used to extract each spatial factor decomposed from the data matrix. Experimental results show that the proposed method outperforms conventional methods and can extract spatial factors that represent the homogeneous tissues.
Tasks Image Reconstruction
Published 2019-10-01
URL http://openaccess.thecvf.com/content_ICCV_2019/html/Yokota_Dynamic_PET_Image_Reconstruction_Using_Nonnegative_Matrix_Factorization_Incorporated_With_ICCV_2019_paper.html
PDF http://openaccess.thecvf.com/content_ICCV_2019/papers/Yokota_Dynamic_PET_Image_Reconstruction_Using_Nonnegative_Matrix_Factorization_Incorporated_With_ICCV_2019_paper.pdf
PWC https://paperswithcode.com/paper/dynamic-pet-image-reconstruction-using
Repo
Framework

CIIDefence: Defeating Adversarial Attacks by Fusing Class-Specific Image Inpainting and Image Denoising

Title CIIDefence: Defeating Adversarial Attacks by Fusing Class-Specific Image Inpainting and Image Denoising
Authors Puneet Gupta, Esa Rahtu
Abstract This paper presents a novel approach for protecting deep neural networks from adversarial attacks, i.e., methods that add well-crafted imperceptible modifications to the original inputs such that they are incorrectly classified with high confidence. The proposed defence mechanism is inspired by the recent works mitigating the adversarial disturbances by the means of image reconstruction and denoising. However, unlike the previous works, we apply the reconstruction only for small and carefully selected image areas that are most influential to the current classification outcome. The selection process is guided by the class activation map responses obtained for multiple top-ranking class labels. The same regions are also the most prominent for the adversarial perturbations and hence most important to purify. The resulting inpainting task is substantially more tractable than the full image reconstruction, while still being able to prevent the adversarial attacks. Furthermore, we combine the selective image inpainting with wavelet based image denoising to produce a non differentiable layer that prevents attacker from using gradient backpropagation. Moreover, the proposed nonlinearity cannot be easily approximated with simple differentiable alternative as demonstrated in the experiments with Backward Pass Differentiable Approximation (BPDA) attack. Finally, we experimentally show that the proposed Class-specific Image Inpainting Defence (CIIDefence) is able to withstand several powerful adversarial attacks including the BPDA. The obtained results are consistently better compared to the other recent defence approaches.
Tasks Denoising, Image Denoising, Image Inpainting, Image Reconstruction
Published 2019-10-01
URL http://openaccess.thecvf.com/content_ICCV_2019/html/Gupta_CIIDefence_Defeating_Adversarial_Attacks_by_Fusing_Class-Specific_Image_Inpainting_and_ICCV_2019_paper.html
PDF http://openaccess.thecvf.com/content_ICCV_2019/papers/Gupta_CIIDefence_Defeating_Adversarial_Attacks_by_Fusing_Class-Specific_Image_Inpainting_and_ICCV_2019_paper.pdf
PWC https://paperswithcode.com/paper/ciidefence-defeating-adversarial-attacks-by
Repo
Framework

Hyperspectral Image Reconstruction Using Deep External and Internal Learning

Title Hyperspectral Image Reconstruction Using Deep External and Internal Learning
Authors Tao Zhang, Ying Fu, Lizhi Wang, Hua Huang
Abstract To solve the low spatial and/or temporal resolution problem which the conventional hypelrspectral cameras often suffer from, coded snapshot hyperspectral imaging systems have attracted more attention recently. Recovering a hyperspectral image (HSI) from its corresponding coded image is an ill-posed inverse problem, and learning accurate prior of HSI is essential to solve this inverse problem. In this paper, we present an effective convolutional neural network (CNN) based method for coded HSI reconstruction, which learns the deep prior from the external dataset as well as the internal information of input coded image with spatial-spectral constraint. Our method can effectively exploit spatial-spectral correlation and sufficiently represent the variety nature of HSIs. Experimental results show our method outperforms the state-of-the-art methods under both comprehensive quantitative metrics and perceptive quality.
Tasks Image Reconstruction
Published 2019-10-01
URL http://openaccess.thecvf.com/content_ICCV_2019/html/Zhang_Hyperspectral_Image_Reconstruction_Using_Deep_External_and_Internal_Learning_ICCV_2019_paper.html
PDF http://openaccess.thecvf.com/content_ICCV_2019/papers/Zhang_Hyperspectral_Image_Reconstruction_Using_Deep_External_and_Internal_Learning_ICCV_2019_paper.pdf
PWC https://paperswithcode.com/paper/hyperspectral-image-reconstruction-using-deep
Repo
Framework

Computational Hyperspectral Imaging Based on Dimension-Discriminative Low-Rank Tensor Recovery

Title Computational Hyperspectral Imaging Based on Dimension-Discriminative Low-Rank Tensor Recovery
Authors Shipeng Zhang, Lizhi Wang, Ying Fu, Xiaoming Zhong, Hua Huang
Abstract Exploiting the prior information is fundamental for the image reconstruction in computational hyperspectral imaging. Existing methods usually unfold the 3D signal as a 1D vector and treat the prior information within different dimensions in an indiscriminative manner, which ignores the high-dimensionality nature of hyperspectral image (HSI) and thus results in poor quality reconstruction. In this paper, we propose to make full use of the high-dimensionality structure of the desired HSI to boost the reconstruction quality. We first build a high-order tensor by exploiting the nonlocal similarity in HSI. Then, we propose a dimension-discriminative low-rank tensor recovery (DLTR) model to characterize the structure prior adaptively in each dimension. By integrating the structure prior in DLTR with the system imaging process, we develop an optimization framework for HSI reconstruction, which is finally solved via the alternating minimization algorithm. Extensive experiments implemented with both synthetic and real data demonstrate that our method outperforms state-of-the-art methods.
Tasks Image Reconstruction
Published 2019-10-01
URL http://openaccess.thecvf.com/content_ICCV_2019/html/Zhang_Computational_Hyperspectral_Imaging_Based_on_Dimension-Discriminative_Low-Rank_Tensor_Recovery_ICCV_2019_paper.html
PDF http://openaccess.thecvf.com/content_ICCV_2019/papers/Zhang_Computational_Hyperspectral_Imaging_Based_on_Dimension-Discriminative_Low-Rank_Tensor_Recovery_ICCV_2019_paper.pdf
PWC https://paperswithcode.com/paper/computational-hyperspectral-imaging-based-on
Repo
Framework

Downsampling leads to Image Memorization in Convolutional Autoencoders

Title Downsampling leads to Image Memorization in Convolutional Autoencoders
Authors Adityanarayanan Radhakrishnan, Caroline Uhler, Mikhail Belkin
Abstract Memorization of data in deep neural networks has become a subject of significant research interest. In this paper, we link memorization of images in deep convolutional autoencoders to downsampling through strided convolution. To analyze this mechanism in a simpler setting, we train linear convolutional autoencoders and show that linear combinations of training data are stored as eigenvectors in the linear operator corresponding to the network when downsampling is used. On the other hand, networks without downsampling do not memorize training data. We provide further evidence that the same effect happens in nonlinear networks. Moreover, downsampling in nonlinear networks causes the model to not only memorize just linear combinations of images, but individual training images. Since convolutional autoencoder components are building blocks of deep convolutional networks, we envision that our findings will shed light on the important phenomenon of memorization in over-parameterized deep networks.
Tasks
Published 2019-05-01
URL https://openreview.net/forum?id=ByGUFsAqYm
PDF https://openreview.net/pdf?id=ByGUFsAqYm
PWC https://paperswithcode.com/paper/downsampling-leads-to-image-memorization-in
Repo
Framework

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

Title Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Authors
Abstract
Tasks
Published 2019-07-01
URL https://www.aclweb.org/anthology/P19-1000/
PDF https://www.aclweb.org/anthology/P19-1000
PWC https://paperswithcode.com/paper/proceedings-of-the-57th-conference-of-the
Repo
Framework

Writing Styles of Salwa and Al-Qarni

Title Writing Styles of Salwa and Al-Qarni
Authors Ahmed Omer, Michael Oakes
Abstract
Tasks
Published 2019-07-01
URL https://www.aclweb.org/anthology/W19-5603/
PDF https://www.aclweb.org/anthology/W19-5603
PWC https://paperswithcode.com/paper/writing-styles-of-salwa-and-al-qarni
Repo
Framework
comments powered by Disqus