May 4, 2019

2181 words 11 mins read

Paper Group NANR 215

DSIC-ELIRF at SemEval-2016 Task 4: Message Polarity Classification in Twitter using a Support Vector Machine Approach. ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling. An Empirical Evaluation of Noise Contrastive Estimation for the Neural Network Joint Model of Translation. Generating Video Description …

DSIC-ELIRF at SemEval-2016 Task 4: Message Polarity Classification in Twitter using a Support Vector Machine Approach


Title	DSIC-ELIRF at SemEval-2016 Task 4: Message Polarity Classification in Twitter using a Support Vector Machine Approach
Authors	V{'\i}ctor Martinez Morant, LLu{'\i}s-F. Hurtado, Ferran Pla
Abstract
Tasks	Opinion Mining, Sentiment Analysis, Tokenization
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1029/
PDF	https://www.aclweb.org/anthology/S16-1029
PWC	https://paperswithcode.com/paper/dsic-elirf-at-semeval-2016-task-4-message
Repo
Framework

ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling


Title	ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling
Authors	Forough Poursabzi-Sangdeh, Jordan Boyd-Graber, Leah Findlater, Kevin Seppi
Abstract
Tasks	Active Learning, Text Classification, Topic Models
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1110/
PDF	https://www.aclweb.org/anthology/P16-1110
PWC	https://paperswithcode.com/paper/alto-active-learning-with-topic-overviews-for
Repo
Framework

An Empirical Evaluation of Noise Contrastive Estimation for the Neural Network Joint Model of Translation


Title	An Empirical Evaluation of Noise Contrastive Estimation for the Neural Network Joint Model of Translation
Authors	Colin Cherry
Abstract
Tasks	Domain Adaptation, Language Modelling, Machine Translation
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-1006/
PDF	https://www.aclweb.org/anthology/N16-1006
PWC	https://paperswithcode.com/paper/an-empirical-evaluation-of-noise-contrastive
Repo
Framework

Generating Video Description using Sequence-to-sequence Model with Temporal Attention


Title	Generating Video Description using Sequence-to-sequence Model with Temporal Attention
Authors	Natsuda Laokulrat, Sang Phan, Noriki Nishida, Raphael Shu, Yo Ehara, Naoaki Okazaki, Yusuke Miyao, Hideki Nakayama
Abstract	Automatic video description generation has recently been getting attention after rapid advancement in image caption generation. Automatically generating description for a video is more challenging than for an image due to its temporal dynamics of frames. Most of the work relied on Recurrent Neural Network (RNN) and recently attentional mechanisms have also been applied to make the model learn to focus on some frames of the video while generating each word in a describing sentence. In this paper, we focus on a sequence-to-sequence approach with temporal attention mechanism. We analyze and compare the results from different attention model configuration. By applying the temporal attention mechanism to the system, we can achieve a METEOR score of 0.310 on Microsoft Video Description dataset, which outperformed the state-of-the-art system so far.
Tasks	Video Classification, Video Description
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1005/
PDF	https://www.aclweb.org/anthology/C16-1005
PWC	https://paperswithcode.com/paper/generating-video-description-using-sequence
Repo
Framework

MarsaGram: an excursion in the forests of parsing trees


Title	MarsaGram: an excursion in the forests of parsing trees
Authors	Philippe Blache, St{'e}phane Rauzy, Gr{'e}goire Montcheuil
Abstract	The question of how to compare languages and more generally the domain of linguistic typology, relies on the study of different linguistic properties or phenomena. Classically, such a comparison is done semi-manually, for example by extracting information from databases such as the WALS. However, it remains difficult to identify precisely regular parameters, available for different languages, that can be used as a basis towards modeling. We propose in this paper, focusing on the question of syntactic typology, a method for automatically extracting such parameters from treebanks, bringing them into a typology perspective. We present the method and the tools for inferring such information and navigating through the treebanks. The approach has been applied to 10 languages of the Universal Dependencies Treebank. We approach is evaluated by showing how automatic classification correlates with language families.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1370/
PDF	https://www.aclweb.org/anthology/L16-1370
PWC	https://paperswithcode.com/paper/marsagram-an-excursion-in-the-forests-of
Repo
Framework

On-line Multilingual Linguistic Services


Title	On-line Multilingual Linguistic Services
Authors	Eric Wehrli, Yves Scherrer, Luka Nerima
Abstract	In this demo, we present our free on-line multilingual linguistic services which allow to analyze sentences or to extract collocations from a corpus directly on-line, or by uploading a corpus. They are available for 8 European languages (English, French, German, Greek, Italian, Portuguese, Romanian, Spanish) and can also be accessed as web services by programs. While several open systems are available for POS-tagging and dependency parsing or terminology extraction, their integration into an application requires some computational competence. Furthermore, none of the parsers/taggers handles MWEs very satisfactorily, in particular when the two terms of the collocation are distant from each other or in reverse order. Our tools, on the other hand, are specifically designed for users with no particular computational literacy. They do not require from the user any download, installation or adaptation if used on-line, and their integration in an application, using one the scripts described below is quite easy. Furthermore, by default, the parser handles collocations and other MWEs, as well as anaphora resolution (limited to 3rd person personal pronouns). When used in the tagger mode, it can be set to display grammatical functions and collocations.
Tasks	Dependency Parsing
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2019/
PDF	https://www.aclweb.org/anthology/C16-2019
PWC	https://paperswithcode.com/paper/on-line-multilingual-linguistic-services
Repo
Framework

Word Order Sensitive Embedding Features/Conditional Random Field-based Chinese Grammatical Error Detection


Title	Word Order Sensitive Embedding Features/Conditional Random Field-based Chinese Grammatical Error Detection
Authors	Wei-Chieh Chou, Chin-Kui Lin, Yuan-Fu Liao, Yih-Ru Wang
Abstract	This paper discusses how to adapt two new word embedding features to build a more efficient Chinese Grammatical Error Diagnosis (CGED) systems to assist Chinese foreign learners (CFLs) in improving their written essays. The major idea is to apply word order sensitive Word2Vec approaches including (1) structured skip-gram and (2) continuous window (CWindow) models, because they are more suitable for solving syntax-based problems. The proposed new features were evaluated on the Test of Chinese as a Foreign Language (TOCFL) learner database provided by NLP-TEA-3{&}CGED shared task. Experimental results showed that the new features did work better than the traditional word order insensitive Word2Vec approaches. Moreover, according to the official evaluation results, our system achieved the lowest (0.1362) false positive (FA) and the highest precision rates in all three measurements.
Tasks	Grammatical Error Detection, Language Modelling
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4910/
PDF	https://www.aclweb.org/anthology/W16-4910
PWC	https://paperswithcode.com/paper/word-order-sensitive-embedding
Repo
Framework

Can Text Simplification Help Machine Translation?


Title	Can Text Simplification Help Machine Translation?
Authors	Sanja {\v{S}}tajner, Maja Popovic
Abstract
Tasks	Machine Translation, Text Simplification
Published	2016-01-01
URL	https://www.aclweb.org/anthology/W16-3411/
PDF	https://www.aclweb.org/anthology/W16-3411
PWC	https://paperswithcode.com/paper/can-text-simplification-help-machine
Repo
Framework

Phrase-based Machine Translation using Multiple Preordering Candidates


Title	Phrase-based Machine Translation using Multiple Preordering Candidates
Authors	Yusuke Oda, Taku Kudo, Tetsuji Nakagawa, Taro Watanabe
Abstract	In this paper, we propose a new decoding method for phrase-based statistical machine translation which directly uses multiple preordering candidates as a graph structure. Compared with previous phrase-based decoding methods, our method is based on a simple left-to-right dynamic programming in which no decoding-time reordering is performed. As a result, its runtime is very fast and implementing the algorithm becomes easy. Our system does not depend on specific preordering methods as long as they output multiple preordering candidates, and it is trivial to employ existing preordering methods into our system. In our experiments for translating diverse 11 languages into English, the proposed method outperforms conventional phrase-based decoder in terms of translation qualities under comparable or faster decoding time.
Tasks	Machine Translation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1134/
PDF	https://www.aclweb.org/anthology/C16-1134
PWC	https://paperswithcode.com/paper/phrase-based-machine-translation-using
Repo
Framework

Corpus Resources for Dispute Mediation Discourse


Title	Corpus Resources for Dispute Mediation Discourse
Authors	Mathilde Janier, Chris Reed
Abstract	Dispute mediation is a growing activity in the resolution of conflicts, and more and more research emerge to enhance and better understand this (until recently) understudied practice. Corpus analyses are necessary to study discourse in this context; yet, little data is available, mainly because of its confidentiality principle. After proposing hints and avenues to acquire transcripts of mediation sessions, this paper presents the Dispute Mediation Corpus, which gathers annotated excerpts of mediation dialogues. Although developed as part of a project on argumentation, it is freely available and the text data can be used by anyone. This first-ever open corpus of mediation interactions can be of interest to scholars studying discourse, but also conflict resolution, argumentation, linguistics, communication, etc. We advocate for using and extending this resource that may be valuable to a large variety of domains of research, particularly those striving to enhance the study of the rapidly growing activity of dispute mediation.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1161/
PDF	https://www.aclweb.org/anthology/L16-1161
PWC	https://paperswithcode.com/paper/corpus-resources-for-dispute-mediation
Repo
Framework

Chinese Grammatical Error Diagnosis Using Single Word Embedding


Title	Chinese Grammatical Error Diagnosis Using Single Word Embedding
Authors	Jinnan Yang, Bo Peng, Jin Wang, Jixian Zhang, Xuejie Zhang
Abstract	Abstract Automatic grammatical error detection for Chinese has been a big challenge for NLP researchers. Due to the formal and strict grammar rules in Chinese, it is hard for foreign students to master Chinese. A computer-assisted learning tool which can automatically detect and correct Chinese grammatical errors is necessary for those foreign students. Some of the previous works have sought to identify Chinese grammatical errors using template- and learning-based methods. In contrast, this study introduced convolutional neural network (CNN) and long-short term memory (LSTM) for the shared task of Chinese Grammatical Error Diagnosis (CGED). Different from traditional word-based embedding, single word embedding was used as input of CNN and LSTM. The proposed single word embedding can capture both semantic and syntactic information to detect those four type grammatical error. In experimental evaluation, the recall and f1-score of our submitted results Run1 of the TOCFL testing data ranked the fourth place in all submissions in detection-level.
Tasks	Grammatical Error Detection, Language Modelling, Multi-Label Classification
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4920/
PDF	https://www.aclweb.org/anthology/W16-4920
PWC	https://paperswithcode.com/paper/chinese-grammatical-error-diagnosis-using-2
Repo
Framework

Capturing Pragmatic Knowledge in Article Usage Prediction using LSTMs


Title	Capturing Pragmatic Knowledge in Article Usage Prediction using LSTMs
Authors	Jad Kabbara, Yulan Feng, Jackie Chi Kit Cheung
Abstract	We examine the potential of recurrent neural networks for handling pragmatic inferences involving complex contextual cues for the task of article usage prediction. We train and compare several variants of Long Short-Term Memory (LSTM) networks with an attention mechanism. Our model outperforms a previous state-of-the-art system, achieving up to 96.63{%} accuracy on the WSJ/PTB corpus. In addition, we perform a series of analyses to understand the impact of various model choices. We find that the gain in performance can be attributed to the ability of LSTMs to pick up on contextual cues, both local and further away in distance, and that the model is able to solve cases involving reasoning about coreference and synonymy. We also show how the attention mechanism contributes to the interpretability of the model{'}s effectiveness.
Tasks	Grammatical Error Detection, Machine Translation, Word Embeddings
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1247/
PDF	https://www.aclweb.org/anthology/C16-1247
PWC	https://paperswithcode.com/paper/capturing-pragmatic-knowledge-in-article
Repo
Framework

Bidirectional Recurrent Convolutional Neural Network for Relation Classification


Title	Bidirectional Recurrent Convolutional Neural Network for Relation Classification
Authors	Rui Cai, Xiaodong Zhang, Houfeng Wang
Abstract
Tasks	Relation Classification, Word Embeddings
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1072/
PDF	https://www.aclweb.org/anthology/P16-1072
PWC	https://paperswithcode.com/paper/bidirectional-recurrent-convolutional-neural
Repo
Framework

The Construction of a Chinese Collocational Knowledge Resource and Its Application for Second Language Acquisition


Title	The Construction of a Chinese Collocational Knowledge Resource and Its Application for Second Language Acquisition
Authors	Renfen Hu, Jiayong Chen, Kuang-hua Chen
Abstract	The appropriate use of collocations is a challenge for second language acquisition. However, high quality and easily accessible Chinese collocation resources are not available for both teachers and students. This paper presents the design and construction of a large scale resource of Chinese collocational knowledge, and a web-based application (OCCA, Online Chinese Collocation Assistant) which offers free and convenient collocation search service to end users. We define and classify collocations based on practical language acquisition needs and utilize a syntax based method to extract nine types of collocations. Totally 37 extraction rules are compiled with word, POS and dependency relation features, 1,750,000 collocations are extracted from a corpus for L2 learning and complementary Wikipedia data, and OCCA is implemented based on these extracted collocations. By comparing OCCA with two traditional collocation dictionaries, we find OCCA has higher entry coverage and collocation quantity, and our method achieves quite low error rate at less than 5{%}. We also discuss how to apply collocational knowledge to grammatical error detection and demonstrate comparable performance to the best results in 2015 NLP-TEA CGED shared task. The preliminary experiment shows that the collocation knowledge is helpful in detecting all the four types of grammatical errors.
Tasks	Grammatical Error Detection, Language Acquisition
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1307/
PDF	https://www.aclweb.org/anthology/C16-1307
PWC	https://paperswithcode.com/paper/the-construction-of-a-chinese-collocational
Repo
Framework

Geolocation Prediction in Twitter Using Location Indicative Words and Textual Features


Title	Geolocation Prediction in Twitter Using Location Indicative Words and Textual Features
Authors	Lianhua Chi, Kwan Hui Lim, Nebula Alam, Christopher J. Butler
Abstract	Knowing the location of a social media user and their posts is important for various purposes, such as the recommendation of location-based items/services, and locality detection of crisis/disasters. This paper describes our submission to the shared task {``}Geolocation Prediction in Twitter{''} of the 2nd Workshop on Noisy User-generated Text. In this shared task, we propose an algorithm to predict the location of Twitter users and tweets using a multinomial Naive Bayes classifier trained on Location Indicative Words and various textual features (such as city/country names, {#}hashtags and @mentions). We compared our approach against various baselines based on Location Indicative Words, city/country names, {#}hashtags and @mentions as individual feature sets, and experimental results show that our approach outperforms these baselines in terms of classification accuracy, mean and median error distance. \|
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-3930/
PDF	https://www.aclweb.org/anthology/W16-3930
PWC	https://paperswithcode.com/paper/geolocation-prediction-in-twitter-using
Repo
Framework