October 15, 2019

2366 words 12 mins read

Paper Group NANR 101

Paper Group NANR 101

Statistical Machine Transliteration Baselines for NEWS 2018. Named-Entity Tagging and Domain adaptation for Better Customized Translation. Leveraging Meta-Embeddings for Bilingual Lexicon Extraction from Specialized Comparable Corpora. Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control. Conditions on abruptness in a gradient-ascent M …

Statistical Machine Transliteration Baselines for NEWS 2018

Title Statistical Machine Transliteration Baselines for NEWS 2018
Authors Snigdha Singhania, Minh Nguyen, Gia H. Ngo, Nancy Chen
Abstract This paper reports the results of our trans-literation experiments conducted on NEWS 2018 Shared Task dataset. We focus on creating the baseline systems trained using two open-source, statistical transliteration tools, namely Sequitur and Moses. We discuss the pre-processing steps performed on this dataset for both the systems. We also provide a re-ranking system which uses top hypotheses from Sequitur and Moses to create a consolidated list of transliterations. The results obtained from each of these models can be used to present a good starting point for the participating teams.
Tasks Information Retrieval, Machine Translation, Transliteration
Published 2018-07-01
URL https://www.aclweb.org/anthology/W18-2410/
PDF https://www.aclweb.org/anthology/W18-2410
PWC https://paperswithcode.com/paper/statistical-machine-transliteration-baselines
Repo
Framework

Named-Entity Tagging and Domain adaptation for Better Customized Translation

Title Named-Entity Tagging and Domain adaptation for Better Customized Translation
Authors Zhongwei Li, Xuancong Wang, Ai Ti Aw, Eng Siong Chng, Haizhou Li
Abstract Customized translation need pay spe-cial attention to the target domain ter-minology especially the named-entities for the domain. Adding linguistic features to neural machine translation (NMT) has been shown to benefit translation in many studies. In this paper, we further demonstrate that adding named-entity (NE) feature with named-entity recognition (NER) into the source language produces better translation with NMT. Our experiments show that by just including the different NE classes and boundary tags, we can increase the BLEU score by around 1 to 2 points using the standard test sets from WMT2017. We also show that adding NE tags using NER and applying in-domain adaptation can be combined to further improve customized machine translation.
Tasks Domain Adaptation, Machine Translation, Named Entity Recognition, Sentence Embedding
Published 2018-07-01
URL https://www.aclweb.org/anthology/W18-2407/
PDF https://www.aclweb.org/anthology/W18-2407
PWC https://paperswithcode.com/paper/named-entity-tagging-and-domain-adaptation
Repo
Framework

Leveraging Meta-Embeddings for Bilingual Lexicon Extraction from Specialized Comparable Corpora

Title Leveraging Meta-Embeddings for Bilingual Lexicon Extraction from Specialized Comparable Corpora
Authors Amir Hazem, Emmanuel Morin
Abstract Recent evaluations on bilingual lexicon extraction from specialized comparable corpora have shown contrasted performance while using word embedding models. This can be partially explained by the lack of large specialized comparable corpora to build efficient representations. Within this context, we try to answer the following questions: First, (i) among the state-of-the-art embedding models, whether trained on specialized corpora or pre-trained on large general data sets, which one is the most appropriate model for bilingual terminology extraction? Second (ii) is it worth it to combine multiple embeddings trained on different data sets? For that purpose, we propose the first systematic evaluation of different word embedding models for bilingual terminology extraction from specialized comparable corpora. We emphasize how the character-based embedding model outperforms other models on the quality of the extracted bilingual lexicons. Further more, we propose a new efficient way to combine different embedding models learned from specialized and general-domain data sets. Our approach leads to higher performance than the best individual embedding model.
Tasks Information Retrieval, Machine Translation
Published 2018-08-01
URL https://www.aclweb.org/anthology/C18-1080/
PDF https://www.aclweb.org/anthology/C18-1080
PWC https://paperswithcode.com/paper/leveraging-meta-embeddings-for-bilingual
Repo
Framework

Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control

Title Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control
Authors Fereshteh Sadeghi, Alexander Toshev, Eric Jang, Sergey Levine
Abstract Humans are remarkably proficient at controlling their limbs and tools from a wide range of viewpoints. In robotics, this ability is referred to as visual servoing: moving a tool or end-point to a desired location using primarily visual feedback. In this paper, we propose learning viewpoint invariant visual servoing skills in a robot manipulation task. We train a deep recurrent controller that can automatically determine which actions move the end-effector of a robotic arm to a desired object. This problem is fundamentally ambiguous: under severe variation in viewpoint, it may be impossible to determine the actions in a single feedforward operation. Instead, our visual servoing approach uses its memory of past movements to understand how the actions affect the robot motion from the current viewpoint, correcting mistakes and gradually moving closer to the target. This ability is in stark contrast to previous visual servoing methods, which assume known dynamics or require a calibration phase. We learn our recurrent controller using simulated data, synthetic demonstrations and reinforcement learning. We then describe how the resulting model can be transferred to a real-world robot by disentangling perception from control and only adapting the visual layers. The adapted model can servo to previously unseen objects from novel viewpoints on a real-world Kuka IIWA robotic arm. For supplementary videos, see: href{https://www.youtube.com/watch?v=oLgM2Bnb7fo}{https://www.youtube.com/watch?v=oLgM2Bnb7fo}
Tasks Calibration
Published 2018-06-01
URL http://openaccess.thecvf.com/content_cvpr_2018/html/Sadeghi_Sim2Real_Viewpoint_Invariant_CVPR_2018_paper.html
PDF http://openaccess.thecvf.com/content_cvpr_2018/papers/Sadeghi_Sim2Real_Viewpoint_Invariant_CVPR_2018_paper.pdf
PWC https://paperswithcode.com/paper/sim2real-viewpoint-invariant-visual-servoing
Repo
Framework

Conditions on abruptness in a gradient-ascent Maximum Entropy learner

Title Conditions on abruptness in a gradient-ascent Maximum Entropy learner
Authors Elliott Moreton
Abstract
Tasks
Published 2018-01-01
URL https://www.aclweb.org/anthology/W18-0312/
PDF https://www.aclweb.org/anthology/W18-0312
PWC https://paperswithcode.com/paper/conditions-on-abruptness-in-a-gradient-ascent
Repo
Framework

The University of Edinburgh’s Submissions to the WMT18 News Translation Task

Title The University of Edinburgh’s Submissions to the WMT18 News Translation Task
Authors Barry Haddow, Nikolay Bogoychev, Denis Emelin, Ulrich Germann, Roman Grundkiewicz, Kenneth Heafield, Antonio Valerio Miceli Barone, Rico Sennrich
Abstract The University of Edinburgh made submissions to all 14 language pairs in the news translation task, with strong performances in most pairs. We introduce new RNN-variant, mixed RNN/Transformer ensembles, data selection and weighting, and extensions to back-translation.
Tasks Machine Translation
Published 2018-10-01
URL https://www.aclweb.org/anthology/W18-6412/
PDF https://www.aclweb.org/anthology/W18-6412
PWC https://paperswithcode.com/paper/the-university-of-edinburghas-submissions-to
Repo
Framework

Inducing a lexicon of sociolinguistic variables from code-mixed text

Title Inducing a lexicon of sociolinguistic variables from code-mixed text
Authors Philippa Shoemark, James Kirby, Sharon Goldwater
Abstract Sociolinguistics is often concerned with how variants of a linguistic item (e.g., \textit{nothing} vs. \textit{nothin{'}}) are used by different groups or in different situations. We introduce the task of inducing lexical variables from code-mixed text: that is, identifying equivalence pairs such as (\textit{football}, \textit{fitba}) along with their linguistic code (\textit{football}→British, \textit{fitba}→Scottish). We adapt a framework for identifying gender-biased word pairs to this new task, and present results on three different pairs of English dialects, using tweets as the code-mixed text. Our system achieves precision of over 70{%} for two of these three datasets, and produces useful results even without extensive parameter tuning. Our success in adapting this framework from gender to language variety suggests that it could be used to discover other types of analogous pairs as well.
Tasks
Published 2018-11-01
URL https://www.aclweb.org/anthology/W18-6101/
PDF https://www.aclweb.org/anthology/W18-6101
PWC https://paperswithcode.com/paper/inducing-a-lexicon-of-sociolinguistic
Repo
Framework

A Corpus of Corporate Annual and Social Responsibility Reports: 280 Million Tokens of Balanced Organizational Writing

Title A Corpus of Corporate Annual and Social Responsibility Reports: 280 Million Tokens of Balanced Organizational Writing
Authors Sebastian G.M. H{"a}ndschke, Sven Buechel, Jan Goldenstein, Philipp Poschmann, Tinghui Duan, Peter Walgenbach, Udo Hahn
Abstract We introduce JOCo, a novel text corpus for NLP analytics in the field of economics, business and management. This corpus is composed of corporate annual and social responsibility reports of the top 30 US, UK and German companies in the major (DJIA, FTSE 100, DAX), middle-sized (S{&}P 500, FTSE 250, MDAX) and technology (NASDAQ, FTSE AIM 100, TECDAX) stock indices, respectively. Altogether, this adds up to 5,000 reports from 270 companies headquartered in three of the world{'}s most important economies. The corpus spans a time frame from 2000 up to 2015 and contains, in total, 282M tokens. We also feature JOCo in a small-scale experiment to demonstrate its potential for NLP-fueled studies in economics, business and management research.
Tasks Tokenization
Published 2018-07-01
URL https://www.aclweb.org/anthology/W18-3103/
PDF https://www.aclweb.org/anthology/W18-3103
PWC https://paperswithcode.com/paper/a-corpus-of-corporate-annual-and-social
Repo
Framework

Context-Aware Neural Model for Temporal Information Extraction

Title Context-Aware Neural Model for Temporal Information Extraction
Authors Yuanliang Meng, Anna Rumshisky
Abstract We propose a context-aware neural network model for temporal information extraction. This model has a uniform architecture for event-event, event-timex and timex-timex pairs. A Global Context Layer (GCL), inspired by Neural Turing Machine (NTM), stores processed temporal relations in narrative order, and retrieves them for use when relevant entities come in. Relations are then classified in context. The GCL model has long-term memory and attention mechanisms to resolve irregular long-distance dependencies that regular RNNs such as LSTM cannot recognize. It does not require any new input features, while outperforming the existing models in literature. To our knowledge it is also the first model to use NTM-like architecture to process the information from global context in discourse-scale natural text processing. We are going to release the source code in the future.
Tasks Information Retrieval, Question Answering, Temporal Information Extraction
Published 2018-07-01
URL https://www.aclweb.org/anthology/P18-1049/
PDF https://www.aclweb.org/anthology/P18-1049
PWC https://paperswithcode.com/paper/context-aware-neural-model-for-temporal
Repo
Framework

A Graph-theoretic Summary Evaluation for ROUGE

Title A Graph-theoretic Summary Evaluation for ROUGE
Authors Elaheh ShafieiBavani, Mohammad Ebrahimi, Raymond Wong, Fang Chen
Abstract ROUGE is one of the first and most widely used evaluation metrics for text summarization. However, its assessment merely relies on surface similarities between peer and model summaries. Consequently, ROUGE is unable to fairly evaluate summaries including lexical variations and paraphrasing. We propose a graph-based approach adopted into ROUGE to evaluate summaries based on both lexical and semantic similarities. Experiment results over TAC AESOP datasets show that exploiting the lexico-semantic similarity of the words used in summaries would significantly help ROUGE correlate better with human judgments.
Tasks Abstractive Text Summarization, Semantic Similarity, Semantic Textual Similarity, Text Summarization
Published 2018-10-01
URL https://www.aclweb.org/anthology/D18-1085/
PDF https://www.aclweb.org/anthology/D18-1085
PWC https://paperswithcode.com/paper/a-graph-theoretic-summary-evaluation-for
Repo
Framework

Language-Guided Adaptive Perception for Efficient Grounded Communication with Robotic Manipulators in Cluttered Environments

Title Language-Guided Adaptive Perception for Efficient Grounded Communication with Robotic Manipulators in Cluttered Environments
Authors Siddharth Patki, Thomas Howard
Abstract The utility of collaborative manipulators for shared tasks is highly dependent on the speed and accuracy of communication between the human and the robot. The run-time of recently developed probabilistic inference models for situated symbol grounding of natural language instructions depends on the complexity of the representation of the environment in which they reason. As we move towards more complex bi-directional interactions, tasks, and environments, we need intelligent perception models that can selectively infer precise pose, semantics, and affordances of the objects when inferring exhaustively detailed world models is inefficient and prohibits real-time interaction with these robots. In this paper we propose a model of language and perception for the problem of adapting the configuration of the robot perception pipeline for tasks where constructing exhaustively detailed models of the environment is inefficient and inconsequential for symbol grounding. We present experimental results from a synthetic corpus of natural language instructions for robot manipulation in example environments. The results demonstrate that by adapting perception we get significant gains in terms of run-time for perception and situated symbol grounding of the language instructions without a loss in the accuracy of the latter.
Tasks
Published 2018-07-01
URL https://www.aclweb.org/anthology/W18-5016/
PDF https://www.aclweb.org/anthology/W18-5016
PWC https://paperswithcode.com/paper/language-guided-adaptive-perception-for
Repo
Framework

An Evaluation of Two Vocabulary Reduction Methods for Neural Machine Translation

Title An Evaluation of Two Vocabulary Reduction Methods for Neural Machine Translation
Authors Duygu Ataman, Marcello Federico
Abstract
Tasks Machine Translation
Published 2018-03-01
URL https://www.aclweb.org/anthology/W18-1810/
PDF https://www.aclweb.org/anthology/W18-1810
PWC https://paperswithcode.com/paper/an-evaluation-of-two-vocabulary-reduction
Repo
Framework

Distantly Supervised Attribute Detection from Reviews

Title Distantly Supervised Attribute Detection from Reviews
Authors Lisheng Fu, Pablo Barrio
Abstract This work aims to detect specific attributes of a place (e.g., if it has a romantic atmosphere, or if it offers outdoor seating) from its user reviews via distant supervision: without direct annotation of the review text, we use the crowdsourced attribute labels of the place as labels of the review text. We then use review-level attention to pay more attention to those reviews related to the attributes. The experimental results show that our attention-based model predicts attributes for places from reviews with over 98{%} accuracy. The attention weights assigned to each review provide explanation of capturing relevant reviews.
Tasks Entity Extraction, Sentiment Analysis
Published 2018-11-01
URL https://www.aclweb.org/anthology/W18-6110/
PDF https://www.aclweb.org/anthology/W18-6110
PWC https://paperswithcode.com/paper/distantly-supervised-attribute-detection-from
Repo
Framework

Multimodal image alignment through a multiscale chain of neural networks with application to remote sensing

Title Multimodal image alignment through a multiscale chain of neural networks with application to remote sensing
Authors Armand Zampieri, Guillaume Charpiat, Nicolas Girard, Yuliya Tarabalka
Abstract We tackle here the problem of multimodal image non-rigid registration, which is of prime importance in remote sensing and medical imaging. The difficulties encountered by classical registration approaches include feature design and slow optimization by gradient descent. By analyzing these methods, we note the significance of the notion of scale. We design easy-to-train, fully-convolutional neural networks able to learn scale-specific features. Once chained appropriately, they perform global registration in linear time, getting rid of gradient descent schemes by predicting directly the deformation. We show their performance in terms of quality and speed through various tasks of remote sensing multimodal image alignment. In particular, we are able to register correctly cadastral maps of buildings as well as road polylines onto RGB images, and outperform current keypoint matching methods.
Tasks
Published 2018-09-01
URL http://openaccess.thecvf.com/content_ECCV_2018/html/Armand_Zampieri_Multimodal_image_alignment_ECCV_2018_paper.html
PDF http://openaccess.thecvf.com/content_ECCV_2018/papers/Armand_Zampieri_Multimodal_image_alignment_ECCV_2018_paper.pdf
PWC https://paperswithcode.com/paper/multimodal-image-alignment-through-a
Repo
Framework

Automated Essay Scoring in the Presence of Biased Ratings

Title Automated Essay Scoring in the Presence of Biased Ratings
Authors Evelin Amorim, Marcia Can{\c{c}}ado, Adriano Veloso
Abstract Studies in Social Sciences have revealed that when people evaluate someone else, their evaluations often reflect their biases. As a result, rater bias may introduce highly subjective factors that make their evaluations inaccurate. This may affect automated essay scoring models in many ways, as these models are typically designed to model (potentially biased) essay raters. While there is sizeable literature on rater effects in general settings, it remains unknown how rater bias affects automated essay scoring. To this end, we present a new annotated corpus containing essays and their respective scores. Different from existing corpora, our corpus also contains comments provided by the raters in order to ground their scores. We present features to quantify rater bias based on their comments, and we found that rater bias plays an important role in automated essay scoring. We investigated the extent to which rater bias affects models based on hand-crafted features. Finally, we propose to rectify the training set by removing essays associated with potentially biased scores while learning the scoring model.
Tasks
Published 2018-06-01
URL https://www.aclweb.org/anthology/N18-1021/
PDF https://www.aclweb.org/anthology/N18-1021
PWC https://paperswithcode.com/paper/automated-essay-scoring-in-the-presence-of
Repo
Framework
comments powered by Disqus