May 5, 2019

1393 words 7 mins read

Paper Group NANR 71

Paper Group NANR 71

ParFDA for Instance Selection for Statistical Machine Translation. The Effect of Multiple Grammatical Errors on Processing Non-Native Writing. Unsupervised Modeling of Topical Relevance in L2 Learner Text. Collective Entity Resolution with Multi-Focal Attention. A Japanese Chess Commentary Corpus. Account Deletion Prediction on RuNet: A Case Study …

ParFDA for Instance Selection for Statistical Machine Translation

Title ParFDA for Instance Selection for Statistical Machine Translation
Authors Ergun Bicici
Abstract
Tasks Language Modelling, Machine Translation
Published 2016-08-01
URL https://www.aclweb.org/anthology/papers/W16-2306/w16-2306
PDF https://www.aclweb.org/anthology/W16-2306v2
PWC https://paperswithcode.com/paper/parfda-for-instance-selection-for-statistical
Repo
Framework

The Effect of Multiple Grammatical Errors on Processing Non-Native Writing

Title The Effect of Multiple Grammatical Errors on Processing Non-Native Writing
Authors Courtney Napoles, Aoife Cahill, Nitin Madnani
Abstract
Tasks Dependency Parsing
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0501/
PDF https://www.aclweb.org/anthology/W16-0501
PWC https://paperswithcode.com/paper/the-effect-of-multiple-grammatical-errors-on
Repo
Framework

Unsupervised Modeling of Topical Relevance in L2 Learner Text

Title Unsupervised Modeling of Topical Relevance in L2 Learner Text
Authors Ronan Cummins, Helen Yannakoudakis, Ted Briscoe
Abstract
Tasks Information Retrieval
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0510/
PDF https://www.aclweb.org/anthology/W16-0510
PWC https://paperswithcode.com/paper/unsupervised-modeling-of-topical-relevance-in
Repo
Framework

Collective Entity Resolution with Multi-Focal Attention

Title Collective Entity Resolution with Multi-Focal Attention
Authors Amir Globerson, Nevena Lazic, Soumen Chakrabarti, Amarnag Subramanya, Michael Ringgaard, Fern Pereira, o
Abstract
Tasks Entity Resolution, Topic Models
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1059/
PDF https://www.aclweb.org/anthology/P16-1059
PWC https://paperswithcode.com/paper/collective-entity-resolution-with-multi-focal
Repo
Framework

A Japanese Chess Commentary Corpus

Title A Japanese Chess Commentary Corpus
Authors Shinsuke Mori, John Richardson, Atsushi Ushiku, Tetsuro Sasada, Hirotaka Kameko, Yoshimasa Tsuruoka
Abstract In recent years there has been a surge of interest in the natural language prosessing related to the real world, such as symbol grounding, language generation, and nonlinguistic data search by natural language queries. In order to concentrate on language ambiguities, we propose to use a well-defined {``}real world,{''} that is game states. We built a corpus consisting of pairs of sentences and a game state. The game we focus on is shogi (Japanese chess). We collected 742,286 commentary sentences in Japanese. They are spontaneously generated contrary to natural language annotations in many image datasets provided by human workers on Amazon Mechanical Turk. We defined domain specific named entities and we segmented 2,508 sentences into words manually and annotated each word with a named entity tag. We describe a detailed definition of named entities and show some statistics of our game commentary corpus. We also show the results of the experiments of word segmentation and named entity recognition. The accuracies are as high as those on general domain texts indicating that we are ready to tackle various new problems related to the real world. |
Tasks Named Entity Recognition, Text Generation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1225/
PDF https://www.aclweb.org/anthology/L16-1225
PWC https://paperswithcode.com/paper/a-japanese-chess-commentary-corpus
Repo
Framework

Account Deletion Prediction on RuNet: A Case Study of Suspicious Twitter Accounts Active During the Russian-Ukrainian Crisis

Title Account Deletion Prediction on RuNet: A Case Study of Suspicious Twitter Accounts Active During the Russian-Ukrainian Crisis
Authors Svitlana Volkova, Eric Bell
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0801/
PDF https://www.aclweb.org/anthology/W16-0801
PWC https://paperswithcode.com/paper/account-deletion-prediction-on-runet-a-case
Repo
Framework

Bitext Name Tagging for Cross-lingual Entity Annotation Projection

Title Bitext Name Tagging for Cross-lingual Entity Annotation Projection
Authors Dongxu Zhang, Boliang Zhang, Xiaoman Pan, Xiaocheng Feng, Heng Ji, Weiran Xu
Abstract Annotation projection is a practical method to deal with the low resource problem in incident languages (IL) processing. Previous methods on annotation projection mainly relied on word alignment results without any training process, which led to noise propagation caused by word alignment errors. In this paper, we focus on the named entity recognition (NER) task and propose a weakly-supervised framework to project entity annotations from English to IL through bitexts. Instead of directly relying on word alignment results, this framework combines advantages of rule-based methods and deep learning methods by implementing two steps: First, generates a high-confidence entity annotation set on IL side with strict searching methods; Second, uses this high-confidence set to weakly supervise the model training. The model is finally used to accomplish the projecting process. Experimental results on two low-resource ILs show that the proposed method can generate better annotations projected from English-IL parallel corpora. The performance of IL name tagger can also be improved significantly by training on the newly projected IL annotation set.
Tasks Named Entity Recognition, Word Alignment
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1045/
PDF https://www.aclweb.org/anthology/C16-1045
PWC https://paperswithcode.com/paper/bitext-name-tagging-for-cross-lingual-entity
Repo
Framework

Brundlefly at SemEval-2016 Task 12: Recurrent Neural Networks vs. Joint Inference for Clinical Temporal Information Extraction

Title Brundlefly at SemEval-2016 Task 12: Recurrent Neural Networks vs. Joint Inference for Clinical Temporal Information Extraction
Authors Jason Fries
Abstract
Tasks Entity Extraction, Structured Prediction, Temporal Information Extraction, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1198/
PDF https://www.aclweb.org/anthology/S16-1198
PWC https://paperswithcode.com/paper/brundlefly-at-semeval-2016-task-12-recurrent-1
Repo
Framework

A Tag-based English Math Word Problem Solver with Understanding, Reasoning and Explanation

Title A Tag-based English Math Word Problem Solver with Understanding, Reasoning and Explanation
Authors Chao-Chun Liang, Kuang-Yi Hsu, Chien-Tsung Huang, Chung-Min Li, Shen-Yu Miao, Keh-Yih Su
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-3014/
PDF https://www.aclweb.org/anthology/N16-3014
PWC https://paperswithcode.com/paper/a-tag-based-english-math-word-problem-solver
Repo
Framework

Tools and Guidelines for Principled Machine Translation Development

Title Tools and Guidelines for Principled Machine Translation Development
Authors Nora Aranberri, Eleftherios Avramidis, Aljoscha Burchardt, Ond{\v{r}}ej Klejch, Martin Popel, Maja Popovi{'c}
Abstract This work addresses the need to aid Machine Translation (MT) development cycles with a complete workflow of MT evaluation methods. Our aim is to assess, compare and improve MT system variants. We hereby report on novel tools and practices that support various measures, developed in order to support a principled and informed approach of MT development. Our toolkit for automatic evaluation showcases quick and detailed comparison of MT system variants through automatic metrics and n-gram feedback, along with manual evaluation via edit-distance, error annotation and task-based feedback.
Tasks Machine Translation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1296/
PDF https://www.aclweb.org/anthology/L16-1296
PWC https://paperswithcode.com/paper/tools-and-guidelines-for-principled-machine
Repo
Framework

Token-Level Metaphor Detection using Neural Networks

Title Token-Level Metaphor Detection using Neural Networks
Authors Erik-L{^a}n Do Dinh, Iryna Gurevych
Abstract
Tasks Feature Engineering, Machine Translation, Topic Models, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1104/
PDF https://www.aclweb.org/anthology/W16-1104
PWC https://paperswithcode.com/paper/token-level-metaphor-detection-using-neural
Repo
Framework

Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation

Title Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation
Authors Chenhui Chu, Sadao Kurohashi
Abstract Out-of-vocabulary (OOV) word is a crucial problem in statistical machine translation (SMT) with low resources. OOV paraphrasing that augments the translation model for the OOV words by using the translation knowledge of their paraphrases has been proposed to address the OOV problem. In this paper, we propose using word embeddings and semantic lexicons for OOV paraphrasing. Experiments conducted on a low resource setting of the OLYMPICS task of IWSLT 2012 verify the effectiveness of our proposed method.
Tasks Machine Translation, Word Embeddings
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1101/
PDF https://www.aclweb.org/anthology/L16-1101
PWC https://paperswithcode.com/paper/paraphrasing-out-of-vocabulary-words-with
Repo
Framework

Evaluation of acoustic word embeddings

Title Evaluation of acoustic word embeddings
Authors Sahar Ghannay, Yannick Est{`e}ve, Nathalie Camelin, Paul Deleglise
Abstract
Tasks Speech Recognition, Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2511/
PDF https://www.aclweb.org/anthology/W16-2511
PWC https://paperswithcode.com/paper/evaluation-of-acoustic-word-embeddings
Repo
Framework

Subsumption Preservation as a Comparative Measure for Evaluating Sense-Directed Embeddings

Title Subsumption Preservation as a Comparative Measure for Evaluating Sense-Directed Embeddings
Authors Ali Seyed
Abstract
Tasks Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2516/
PDF https://www.aclweb.org/anthology/W16-2516
PWC https://paperswithcode.com/paper/subsumption-preservation-as-a-comparative
Repo
Framework

Extracting Interlinear Glossed Text from LaTeX Documents

Title Extracting Interlinear Glossed Text from LaTeX Documents
Authors Mathias Schenner, Sebastian Nordhoff
Abstract We present texigt, a command-line tool for the extraction of structured linguistic data from LaTeX source documents, and a language resource that has been generated using this tool: a corpus of interlinear glossed text (IGT) extracted from open access books published by Language Science Press. Extracted examples are represented in a simple XML format that is easy to process and can be used to validate certain aspects of interlinear glossed text. The main challenge involved is the parsing of TeX and LaTeX documents. We review why this task is impossible in general and how the texhs Haskell library uses a layered architecture and selective early evaluation (expansion) during lexing and parsing in order to provide access to structured representations of LaTeX documents at several levels. In particular, its parsing modules generate an abstract syntax tree for LaTeX documents after expansion of all user-defined macros and lexer-level commands that serves as an ideal interface for the extraction of interlinear glossed text by texigt. This architecture can easily be adapted to extract other types of linguistic data structures from LaTeX source documents.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1638/
PDF https://www.aclweb.org/anthology/L16-1638
PWC https://paperswithcode.com/paper/extracting-interlinear-glossed-text-from
Repo
Framework
comments powered by Disqus