July 26, 2019

2026 words 10 mins read

Paper Group NANR 135

Paper Group NANR 135

Combining Multiple Corpora for Readability Assessment for People with Cognitive Disabilities. Connecting the Dots: Towards Human-Level Grammatical Error Correction. Neural Relation Extraction with Multi-lingual Attention. Log-linear Models for Uyghur Segmentation in Spoken Language Translation. TWINA at SemEval-2017 Task 4: Twitter Sentiment Analys …

Combining Multiple Corpora for Readability Assessment for People with Cognitive Disabilities

Title Combining Multiple Corpora for Readability Assessment for People with Cognitive Disabilities
Authors Victoria Yaneva, Constantin Or{\u{a}}san, Richard Evans, Omid Rohanian
Abstract Given the lack of large user-evaluated corpora in disability-related NLP research (e.g. text simplification or readability assessment for people with cognitive disabilities), the question of choosing suitable training data for NLP models is not straightforward. The use of large generic corpora may be problematic because such data may not reflect the needs of the target population. The use of the available user-evaluated corpora may be problematic because these datasets are not large enough to be used as training data. In this paper we explore a third approach, in which a large generic corpus is combined with a smaller population-specific corpus to train a classifier which is evaluated using two sets of unseen user-evaluated data. One of these sets, the ASD Comprehension corpus, is developed for the purposes of this study and made freely available. We explore the effects of the size and type of the training data used on the performance of the classifiers, and the effects of the type of the unseen test datasets on the classification performance.
Tasks Text Simplification
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-5013/
PDF https://www.aclweb.org/anthology/W17-5013
PWC https://paperswithcode.com/paper/combining-multiple-corpora-for-readability
Repo
Framework

Connecting the Dots: Towards Human-Level Grammatical Error Correction

Title Connecting the Dots: Towards Human-Level Grammatical Error Correction
Authors Shamil Chollampatt, Hwee Tou Ng
Abstract We build a grammatical error correction (GEC) system primarily based on the state-of-the-art statistical machine translation (SMT) approach, using task-specific features and tuning, and further enhance it with the modeling power of neural network joint models. The SMT-based system is weak in generalizing beyond patterns seen during training and lacks granularity below the word level. To address this issue, we incorporate a character-level SMT component targeting the misspelled words that the original SMT-based system fails to correct. Our final system achieves 53.14{%} F 0.5 score on the benchmark CoNLL-2014 test set, an improvement of 3.62{%} F 0.5 over the best previous published score.
Tasks Grammatical Error Correction, Language Modelling, Machine Translation
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-5037/
PDF https://www.aclweb.org/anthology/W17-5037
PWC https://paperswithcode.com/paper/connecting-the-dots-towards-human-level
Repo
Framework

Neural Relation Extraction with Multi-lingual Attention

Title Neural Relation Extraction with Multi-lingual Attention
Authors Yankai Lin, Zhiyuan Liu, Maosong Sun
Abstract Relation extraction has been widely used for finding unknown relational facts from plain text. Most existing methods focus on exploiting mono-lingual data for relation extraction, ignoring massive information from the texts in various languages. To address this issue, we introduce a multi-lingual neural relation extraction framework, which employs mono-lingual attention to utilize the information within mono-lingual texts and further proposes cross-lingual attention to consider the information consistency and complementarity among cross-lingual texts. Experimental results on real-world datasets show that, our model can take advantage of multi-lingual texts and consistently achieve significant improvements on relation extraction as compared with baselines.
Tasks Information Retrieval, Question Answering, Relation Extraction
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-1004/
PDF https://www.aclweb.org/anthology/P17-1004
PWC https://paperswithcode.com/paper/neural-relation-extraction-with-multi-lingual
Repo
Framework

Log-linear Models for Uyghur Segmentation in Spoken Language Translation

Title Log-linear Models for Uyghur Segmentation in Spoken Language Translation
Authors Chenggang Mi, Yating Yang, Rui Dong, Xi Zhou, Lei Wang, Xiao Li, Tonghai Jiang
Abstract To alleviate data sparsity in spoken Uyghur machine translation, we proposed a log-linear based morphological segmentation approach. Instead of learning model only from monolingual annotated corpus, this approach optimizes Uyghur segmentation for spoken translation based on both bilingual and monolingual corpus. Our approach relies on several features such as traditional conditional random field (CRF) feature, bilingual word alignment feature and monolingual suffixword co-occurrence feature. Experimental results shown that our proposed segmentation model for Uyghur spoken translation achieved 1.6 BLEU score improvements compared with the state-of-the-art baseline.
Tasks Machine Translation, Word Alignment
Published 2017-09-01
URL https://www.aclweb.org/anthology/R17-1065/
PDF https://doi.org/10.26615/978-954-452-049-6_065
PWC https://paperswithcode.com/paper/log-linear-models-for-uyghur-segmentation-in
Repo
Framework

TWINA at SemEval-2017 Task 4: Twitter Sentiment Analysis with Ensemble Gradient Boost Tree Classifier

Title TWINA at SemEval-2017 Task 4: Twitter Sentiment Analysis with Ensemble Gradient Boost Tree Classifier
Authors Naveen Kumar Laskari, Suresh Kumar Sanampudi
Abstract This paper describes the TWINA system, with which we participated in SemEval-2017 Task 4B (Topic Based Message Polarity Classification {–} Two point scale) and 4D (two-point scale Tweet quantification). We implemented ensemble based Gradient Boost Trees classification method for both the tasks. Our system could perform well for the task 4D and ranked 13th among 15 teams, for the task 4B our model ranked 23rd position.
Tasks Information Retrieval, Sentiment Analysis, Twitter Sentiment Analysis
Published 2017-08-01
URL https://www.aclweb.org/anthology/S17-2109/
PDF https://www.aclweb.org/anthology/S17-2109
PWC https://paperswithcode.com/paper/twina-at-semeval-2017-task-4-twitter
Repo
Framework

Processo de constru\cc~ao de um corpus anotado com Entidades Geol'ogicas visando REN (Building an annotated corpus with geological entities for NER)[In Portuguese]

Title Processo de constru\cc~ao de um corpus anotado com Entidades Geol'ogicas visando REN (Building an annotated corpus with geological entities for NER)[In Portuguese]
Authors Daniela Amaral, S Collovini, ra, Anny Figueira, Renata Vieira, Renata Vieira, Marco Gonzalez
Abstract
Tasks Named Entity Recognition
Published 2017-10-01
URL https://www.aclweb.org/anthology/W17-6609/
PDF https://www.aclweb.org/anthology/W17-6609
PWC https://paperswithcode.com/paper/processo-de-construaao-de-um-corpus-anotado
Repo
Framework

Proceedings of the Third Arabic Natural Language Processing Workshop

Title Proceedings of the Third Arabic Natural Language Processing Workshop
Authors
Abstract
Tasks
Published 2017-04-01
URL https://www.aclweb.org/anthology/W17-1300/
PDF https://www.aclweb.org/anthology/W17-1300
PWC https://paperswithcode.com/paper/proceedings-of-the-third-arabic-natural
Repo
Framework

Interactive Visual Analysis of Transcribed Multi-Party Discourse

Title Interactive Visual Analysis of Transcribed Multi-Party Discourse
Authors Mennatallah El-Assady, Annette Hautli-Janisz, Valentin Gold, Miriam Butt, Katharina Holzinger, Daniel Keim
Abstract
Tasks
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-4009/
PDF https://www.aclweb.org/anthology/P17-4009
PWC https://paperswithcode.com/paper/interactive-visual-analysis-of-transcribed
Repo
Framework

Demographic-aware word associations

Title Demographic-aware word associations
Authors Aparna Garimella, Carmen Banea, Rada Mihalcea
Abstract Variations of word associations across different groups of people can provide insights into people{'}s psychologies and their world views. To capture these variations, we introduce the task of demographic-aware word associations. We build a new gold standard dataset consisting of word association responses for approximately 300 stimulus words, collected from more than 800 respondents of different gender (male/female) and from different locations (India/United States), and show that there are significant variations in the word associations made by these groups. We also introduce a new demographic-aware word association model based on a neural net skip-gram architecture, and show how computational methods for measuring word associations that specifically account for writer demographics can outperform generic methods that are agnostic to such information.
Tasks Information Retrieval, Keyword Extraction, Relation Extraction
Published 2017-09-01
URL https://www.aclweb.org/anthology/D17-1242/
PDF https://www.aclweb.org/anthology/D17-1242
PWC https://paperswithcode.com/paper/demographic-aware-word-associations
Repo
Framework

Learning Multimodal Gender Profile using Neural Networks

Title Learning Multimodal Gender Profile using Neural Networks
Authors Carlos P{'e}rez Estruch, Roberto Paredes Palacios, Paolo Rosso
Abstract Gender identification in social networks is one of the most popular aspects of user profile learning. Traditionally it has been linked to author profiling, a difficult problem to solve because of the little difference in the use of language between genders. This situation has led to the need of taking into account other information apart from textual data, favoring the emergence of multimodal data. The aim of this paper is to apply neural networks to perform data fusion, using an existing multimodal corpus, the NUS-MSS data set, that (not only) contains text data, but also image and location information. We improved previous results in terms of macro accuracy (87.8{%}) obtaining the state-of-the-art performance of 91.3{%}.
Tasks
Published 2017-09-01
URL https://www.aclweb.org/anthology/R17-1075/
PDF https://doi.org/10.26615/978-954-452-049-6_075
PWC https://paperswithcode.com/paper/learning-multimodal-gender-profile-using
Repo
Framework

Distinguishing Japanese Non-standard Usages from Standard Ones

Title Distinguishing Japanese Non-standard Usages from Standard Ones
Authors Tatsuya Aoki, Ryohei Sasano, Hiroya Takamura, Manabu Okumura
Abstract We focus on non-standard usages of common words on social media. In the context of social media, words sometimes have other usages that are totally different from their original. In this study, we attempt to distinguish non-standard usages on social media from standard ones in an unsupervised manner. Our basic idea is that non-standardness can be measured by the inconsistency between the expected meaning of the target word and the given context. For this purpose, we use context embeddings derived from word embeddings. Our experimental results show that the model leveraging the context embedding outperforms other methods and provide us with findings, for example, on how to construct context embeddings and which corpus to use.
Tasks Machine Translation, Word Embeddings
Published 2017-09-01
URL https://www.aclweb.org/anthology/D17-1246/
PDF https://www.aclweb.org/anthology/D17-1246
PWC https://paperswithcode.com/paper/distinguishing-japanese-non-standard-usages
Repo
Framework

Structured Learning for Context-aware Spoken Language Understanding of Robotic Commands

Title Structured Learning for Context-aware Spoken Language Understanding of Robotic Commands
Authors Andrea Vanzo, Danilo Croce, Roberto Basili, Daniele Nardi
Abstract Service robots are expected to operate in specific environments, where the presence of humans plays a key role. A major feature of such robotics platforms is thus the ability to react to spoken commands. This requires the understanding of the user utterance with an accuracy able to trigger the robot reaction. Such correct interpretation of linguistic exchanges depends on physical, cognitive and language-dependent aspects related to the environment. In this work, we present the empirical evaluation of an adaptive Spoken Language Understanding chain for robotic commands, that explicitly depends on the operational environment during both the learning and recognition stages. The effectiveness of such a context-sensitive command interpretation is tested against an extension of an already existing corpus of commands, that introduced explicit perceptual knowledge: this enabled deeper measures proving that more accurate disambiguation capabilities can be actually obtained.
Tasks Spoken Language Understanding
Published 2017-08-01
URL https://www.aclweb.org/anthology/W17-2804/
PDF https://www.aclweb.org/anthology/W17-2804
PWC https://paperswithcode.com/paper/structured-learning-for-context-aware-spoken
Repo
Framework

Joint, Incremental Disfluency Detection and Utterance Segmentation from Speech

Title Joint, Incremental Disfluency Detection and Utterance Segmentation from Speech
Authors Julian Hough, David Schlangen
Abstract We present the joint task of incremental disfluency detection and utterance segmentation and a simple deep learning system which performs it on transcripts and ASR results. We show how the constraints of the two tasks interact. Our joint-task system outperforms the equivalent individual task systems, provides competitive results and is suitable for future use in conversation agents in the psychiatric domain.
Tasks Speech Recognition
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-1031/
PDF https://www.aclweb.org/anthology/E17-1031
PWC https://paperswithcode.com/paper/joint-incremental-disfluency-detection-and
Repo
Framework

Inter-Weighted Alignment Network for Sentence Pair Modeling

Title Inter-Weighted Alignment Network for Sentence Pair Modeling
Authors Gehui Shen, Yunlun Yang, Zhi-Hong Deng
Abstract Sentence pair modeling is a crucial problem in the field of natural language processing. In this paper, we propose a model to measure the similarity of a sentence pair focusing on the interaction information. We utilize the word level similarity matrix to discover fine-grained alignment of two sentences. It should be emphasized that each word in a sentence has a different importance from the perspective of semantic composition, so we exploit two novel and efficient strategies to explicitly calculate a weight for each word. Although the proposed model only use a sequential LSTM for sentence modeling without any external resource such as syntactic parser tree and additional lexicon features, experimental results show that our model achieves state-of-the-art performance on three datasets of two tasks.
Tasks Machine Translation, Natural Language Inference, Paraphrase Identification, Question Answering, Relation Classification, Semantic Composition, Semantic Textual Similarity, Sentence Pair Modeling, Word Embeddings
Published 2017-09-01
URL https://www.aclweb.org/anthology/D17-1122/
PDF https://www.aclweb.org/anthology/D17-1122
PWC https://paperswithcode.com/paper/inter-weighted-alignment-network-for-sentence
Repo
Framework

KeLP at SemEval-2017 Task 3: Learning Pairwise Patterns in Community Question Answering

Title KeLP at SemEval-2017 Task 3: Learning Pairwise Patterns in Community Question Answering
Authors Simone Filice, Giovanni Da San Martino, Aless Moschitti, ro
Abstract This paper describes the KeLP system participating in the SemEval-2017 community Question Answering (cQA) task. The system is a refinement of the kernel-based sentence pair modeling we proposed for the previous year challenge. It is implemented within the Kernel-based Learning Platform called KeLP, from which we inherit the team{'}s name. Our primary submission ranked first in subtask A, and third in subtasks B and C, being the only systems appearing in the top-3 ranking for all the English subtasks. This shows that the proposed framework, which has minor variations among the three subtasks, is extremely flexible and effective in tackling learning tasks defined on sentence pairs.
Tasks Community Question Answering, Question Answering, Relational Reasoning, Sentence Pair Modeling
Published 2017-08-01
URL https://www.aclweb.org/anthology/S17-2053/
PDF https://www.aclweb.org/anthology/S17-2053
PWC https://paperswithcode.com/paper/kelp-at-semeval-2017-task-3-learning-pairwise
Repo
Framework
comments powered by Disqus