May 4, 2019

2181 words 11 mins read

Paper Group NANR 215

Paper Group NANR 215

DSIC-ELIRF at SemEval-2016 Task 4: Message Polarity Classification in Twitter using a Support Vector Machine Approach. ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling. An Empirical Evaluation of Noise Contrastive Estimation for the Neural Network Joint Model of Translation. Generating Video Description …

DSIC-ELIRF at SemEval-2016 Task 4: Message Polarity Classification in Twitter using a Support Vector Machine Approach

Title DSIC-ELIRF at SemEval-2016 Task 4: Message Polarity Classification in Twitter using a Support Vector Machine Approach
Authors V{'\i}ctor Martinez Morant, LLu{'\i}s-F. Hurtado, Ferran Pla
Abstract
Tasks Opinion Mining, Sentiment Analysis, Tokenization
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1029/
PDF https://www.aclweb.org/anthology/S16-1029
PWC https://paperswithcode.com/paper/dsic-elirf-at-semeval-2016-task-4-message
Repo
Framework

ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling

Title ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling
Authors Forough Poursabzi-Sangdeh, Jordan Boyd-Graber, Leah Findlater, Kevin Seppi
Abstract
Tasks Active Learning, Text Classification, Topic Models
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1110/
PDF https://www.aclweb.org/anthology/P16-1110
PWC https://paperswithcode.com/paper/alto-active-learning-with-topic-overviews-for
Repo
Framework

An Empirical Evaluation of Noise Contrastive Estimation for the Neural Network Joint Model of Translation

Title An Empirical Evaluation of Noise Contrastive Estimation for the Neural Network Joint Model of Translation
Authors Colin Cherry
Abstract
Tasks Domain Adaptation, Language Modelling, Machine Translation
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1006/
PDF https://www.aclweb.org/anthology/N16-1006
PWC https://paperswithcode.com/paper/an-empirical-evaluation-of-noise-contrastive
Repo
Framework

Generating Video Description using Sequence-to-sequence Model with Temporal Attention

Title Generating Video Description using Sequence-to-sequence Model with Temporal Attention
Authors Natsuda Laokulrat, Sang Phan, Noriki Nishida, Raphael Shu, Yo Ehara, Naoaki Okazaki, Yusuke Miyao, Hideki Nakayama
Abstract Automatic video description generation has recently been getting attention after rapid advancement in image caption generation. Automatically generating description for a video is more challenging than for an image due to its temporal dynamics of frames. Most of the work relied on Recurrent Neural Network (RNN) and recently attentional mechanisms have also been applied to make the model learn to focus on some frames of the video while generating each word in a describing sentence. In this paper, we focus on a sequence-to-sequence approach with temporal attention mechanism. We analyze and compare the results from different attention model configuration. By applying the temporal attention mechanism to the system, we can achieve a METEOR score of 0.310 on Microsoft Video Description dataset, which outperformed the state-of-the-art system so far.
Tasks Video Classification, Video Description
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1005/
PDF https://www.aclweb.org/anthology/C16-1005
PWC https://paperswithcode.com/paper/generating-video-description-using-sequence
Repo
Framework

MarsaGram: an excursion in the forests of parsing trees

Title MarsaGram: an excursion in the forests of parsing trees
Authors Philippe Blache, St{'e}phane Rauzy, Gr{'e}goire Montcheuil
Abstract The question of how to compare languages and more generally the domain of linguistic typology, relies on the study of different linguistic properties or phenomena. Classically, such a comparison is done semi-manually, for example by extracting information from databases such as the WALS. However, it remains difficult to identify precisely regular parameters, available for different languages, that can be used as a basis towards modeling. We propose in this paper, focusing on the question of syntactic typology, a method for automatically extracting such parameters from treebanks, bringing them into a typology perspective. We present the method and the tools for inferring such information and navigating through the treebanks. The approach has been applied to 10 languages of the Universal Dependencies Treebank. We approach is evaluated by showing how automatic classification correlates with language families.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1370/
PDF https://www.aclweb.org/anthology/L16-1370
PWC https://paperswithcode.com/paper/marsagram-an-excursion-in-the-forests-of
Repo
Framework

On-line Multilingual Linguistic Services

Title On-line Multilingual Linguistic Services
Authors Eric Wehrli, Yves Scherrer, Luka Nerima
Abstract In this demo, we present our free on-line multilingual linguistic services which allow to analyze sentences or to extract collocations from a corpus directly on-line, or by uploading a corpus. They are available for 8 European languages (English, French, German, Greek, Italian, Portuguese, Romanian, Spanish) and can also be accessed as web services by programs. While several open systems are available for POS-tagging and dependency parsing or terminology extraction, their integration into an application requires some computational competence. Furthermore, none of the parsers/taggers handles MWEs very satisfactorily, in particular when the two terms of the collocation are distant from each other or in reverse order. Our tools, on the other hand, are specifically designed for users with no particular computational literacy. They do not require from the user any download, installation or adaptation if used on-line, and their integration in an application, using one the scripts described below is quite easy. Furthermore, by default, the parser handles collocations and other MWEs, as well as anaphora resolution (limited to 3rd person personal pronouns). When used in the tagger mode, it can be set to display grammatical functions and collocations.
Tasks Dependency Parsing
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2019/
PDF https://www.aclweb.org/anthology/C16-2019
PWC https://paperswithcode.com/paper/on-line-multilingual-linguistic-services
Repo
Framework

Word Order Sensitive Embedding Features/Conditional Random Field-based Chinese Grammatical Error Detection

Title Word Order Sensitive Embedding Features/Conditional Random Field-based Chinese Grammatical Error Detection
Authors Wei-Chieh Chou, Chin-Kui Lin, Yuan-Fu Liao, Yih-Ru Wang
Abstract This paper discusses how to adapt two new word embedding features to build a more efficient Chinese Grammatical Error Diagnosis (CGED) systems to assist Chinese foreign learners (CFLs) in improving their written essays. The major idea is to apply word order sensitive Word2Vec approaches including (1) structured skip-gram and (2) continuous window (CWindow) models, because they are more suitable for solving syntax-based problems. The proposed new features were evaluated on the Test of Chinese as a Foreign Language (TOCFL) learner database provided by NLP-TEA-3{&}CGED shared task. Experimental results showed that the new features did work better than the traditional word order insensitive Word2Vec approaches. Moreover, according to the official evaluation results, our system achieved the lowest (0.1362) false positive (FA) and the highest precision rates in all three measurements.
Tasks Grammatical Error Detection, Language Modelling
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4910/
PDF https://www.aclweb.org/anthology/W16-4910
PWC https://paperswithcode.com/paper/word-order-sensitive-embedding
Repo
Framework

Can Text Simplification Help Machine Translation?

Title Can Text Simplification Help Machine Translation?
Authors Sanja {\v{S}}tajner, Maja Popovic
Abstract
Tasks Machine Translation, Text Simplification
Published 2016-01-01
URL https://www.aclweb.org/anthology/W16-3411/
PDF https://www.aclweb.org/anthology/W16-3411
PWC https://paperswithcode.com/paper/can-text-simplification-help-machine
Repo
Framework

Phrase-based Machine Translation using Multiple Preordering Candidates

Title Phrase-based Machine Translation using Multiple Preordering Candidates
Authors Yusuke Oda, Taku Kudo, Tetsuji Nakagawa, Taro Watanabe
Abstract In this paper, we propose a new decoding method for phrase-based statistical machine translation which directly uses multiple preordering candidates as a graph structure. Compared with previous phrase-based decoding methods, our method is based on a simple left-to-right dynamic programming in which no decoding-time reordering is performed. As a result, its runtime is very fast and implementing the algorithm becomes easy. Our system does not depend on specific preordering methods as long as they output multiple preordering candidates, and it is trivial to employ existing preordering methods into our system. In our experiments for translating diverse 11 languages into English, the proposed method outperforms conventional phrase-based decoder in terms of translation qualities under comparable or faster decoding time.
Tasks Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1134/
PDF https://www.aclweb.org/anthology/C16-1134
PWC https://paperswithcode.com/paper/phrase-based-machine-translation-using
Repo
Framework

Corpus Resources for Dispute Mediation Discourse

Title Corpus Resources for Dispute Mediation Discourse
Authors Mathilde Janier, Chris Reed
Abstract Dispute mediation is a growing activity in the resolution of conflicts, and more and more research emerge to enhance and better understand this (until recently) understudied practice. Corpus analyses are necessary to study discourse in this context; yet, little data is available, mainly because of its confidentiality principle. After proposing hints and avenues to acquire transcripts of mediation sessions, this paper presents the Dispute Mediation Corpus, which gathers annotated excerpts of mediation dialogues. Although developed as part of a project on argumentation, it is freely available and the text data can be used by anyone. This first-ever open corpus of mediation interactions can be of interest to scholars studying discourse, but also conflict resolution, argumentation, linguistics, communication, etc. We advocate for using and extending this resource that may be valuable to a large variety of domains of research, particularly those striving to enhance the study of the rapidly growing activity of dispute mediation.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1161/
PDF https://www.aclweb.org/anthology/L16-1161
PWC https://paperswithcode.com/paper/corpus-resources-for-dispute-mediation
Repo
Framework

Chinese Grammatical Error Diagnosis Using Single Word Embedding

Title Chinese Grammatical Error Diagnosis Using Single Word Embedding
Authors Jinnan Yang, Bo Peng, Jin Wang, Jixian Zhang, Xuejie Zhang
Abstract Abstract Automatic grammatical error detection for Chinese has been a big challenge for NLP researchers. Due to the formal and strict grammar rules in Chinese, it is hard for foreign students to master Chinese. A computer-assisted learning tool which can automatically detect and correct Chinese grammatical errors is necessary for those foreign students. Some of the previous works have sought to identify Chinese grammatical errors using template- and learning-based methods. In contrast, this study introduced convolutional neural network (CNN) and long-short term memory (LSTM) for the shared task of Chinese Grammatical Error Diagnosis (CGED). Different from traditional word-based embedding, single word embedding was used as input of CNN and LSTM. The proposed single word embedding can capture both semantic and syntactic information to detect those four type grammatical error. In experimental evaluation, the recall and f1-score of our submitted results Run1 of the TOCFL testing data ranked the fourth place in all submissions in detection-level.
Tasks Grammatical Error Detection, Language Modelling, Multi-Label Classification
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4920/
PDF https://www.aclweb.org/anthology/W16-4920
PWC https://paperswithcode.com/paper/chinese-grammatical-error-diagnosis-using-2
Repo
Framework

Capturing Pragmatic Knowledge in Article Usage Prediction using LSTMs

Title Capturing Pragmatic Knowledge in Article Usage Prediction using LSTMs
Authors Jad Kabbara, Yulan Feng, Jackie Chi Kit Cheung
Abstract We examine the potential of recurrent neural networks for handling pragmatic inferences involving complex contextual cues for the task of article usage prediction. We train and compare several variants of Long Short-Term Memory (LSTM) networks with an attention mechanism. Our model outperforms a previous state-of-the-art system, achieving up to 96.63{%} accuracy on the WSJ/PTB corpus. In addition, we perform a series of analyses to understand the impact of various model choices. We find that the gain in performance can be attributed to the ability of LSTMs to pick up on contextual cues, both local and further away in distance, and that the model is able to solve cases involving reasoning about coreference and synonymy. We also show how the attention mechanism contributes to the interpretability of the model{'}s effectiveness.
Tasks Grammatical Error Detection, Machine Translation, Word Embeddings
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1247/
PDF https://www.aclweb.org/anthology/C16-1247
PWC https://paperswithcode.com/paper/capturing-pragmatic-knowledge-in-article
Repo
Framework

Bidirectional Recurrent Convolutional Neural Network for Relation Classification

Title Bidirectional Recurrent Convolutional Neural Network for Relation Classification
Authors Rui Cai, Xiaodong Zhang, Houfeng Wang
Abstract
Tasks Relation Classification, Word Embeddings
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1072/
PDF https://www.aclweb.org/anthology/P16-1072
PWC https://paperswithcode.com/paper/bidirectional-recurrent-convolutional-neural
Repo
Framework

The Construction of a Chinese Collocational Knowledge Resource and Its Application for Second Language Acquisition

Title The Construction of a Chinese Collocational Knowledge Resource and Its Application for Second Language Acquisition
Authors Renfen Hu, Jiayong Chen, Kuang-hua Chen
Abstract The appropriate use of collocations is a challenge for second language acquisition. However, high quality and easily accessible Chinese collocation resources are not available for both teachers and students. This paper presents the design and construction of a large scale resource of Chinese collocational knowledge, and a web-based application (OCCA, Online Chinese Collocation Assistant) which offers free and convenient collocation search service to end users. We define and classify collocations based on practical language acquisition needs and utilize a syntax based method to extract nine types of collocations. Totally 37 extraction rules are compiled with word, POS and dependency relation features, 1,750,000 collocations are extracted from a corpus for L2 learning and complementary Wikipedia data, and OCCA is implemented based on these extracted collocations. By comparing OCCA with two traditional collocation dictionaries, we find OCCA has higher entry coverage and collocation quantity, and our method achieves quite low error rate at less than 5{%}. We also discuss how to apply collocational knowledge to grammatical error detection and demonstrate comparable performance to the best results in 2015 NLP-TEA CGED shared task. The preliminary experiment shows that the collocation knowledge is helpful in detecting all the four types of grammatical errors.
Tasks Grammatical Error Detection, Language Acquisition
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1307/
PDF https://www.aclweb.org/anthology/C16-1307
PWC https://paperswithcode.com/paper/the-construction-of-a-chinese-collocational
Repo
Framework

Geolocation Prediction in Twitter Using Location Indicative Words and Textual Features

Title Geolocation Prediction in Twitter Using Location Indicative Words and Textual Features
Authors Lianhua Chi, Kwan Hui Lim, Nebula Alam, Christopher J. Butler
Abstract Knowing the location of a social media user and their posts is important for various purposes, such as the recommendation of location-based items/services, and locality detection of crisis/disasters. This paper describes our submission to the shared task {``}Geolocation Prediction in Twitter{''} of the 2nd Workshop on Noisy User-generated Text. In this shared task, we propose an algorithm to predict the location of Twitter users and tweets using a multinomial Naive Bayes classifier trained on Location Indicative Words and various textual features (such as city/country names, {#}hashtags and @mentions). We compared our approach against various baselines based on Location Indicative Words, city/country names, {#}hashtags and @mentions as individual feature sets, and experimental results show that our approach outperforms these baselines in terms of classification accuracy, mean and median error distance. |
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3930/
PDF https://www.aclweb.org/anthology/W16-3930
PWC https://paperswithcode.com/paper/geolocation-prediction-in-twitter-using
Repo
Framework
comments powered by Disqus