July 26, 2019

1999 words 10 mins read

Paper Group NANR 22

200K+ Crowdsourced Political Arguments for a New Chilean Constitution. Unsupervised AMR-Dependency Parse Alignment. Assessing the Verifiability of Attributions in News Text. SVNIT @ SemEval 2017 Task-6: Learning a Sense of Humor Using Supervised Approach. LIA at SemEval-2017 Task 4: An Ensemble of Neural Networks for Sentiment Classification. A Qua …

200K+ Crowdsourced Political Arguments for a New Chilean Constitution


Title	200K+ Crowdsourced Political Arguments for a New Chilean Constitution
Authors	Constanza Fierro, Claudio Fuentes, Jorge P{'e}rez, Mauricio Quezada
Abstract	In this paper we present the dataset of 200,000+ political arguments produced in the local phase of the 2016 Chilean constitutional process. We describe the human processing of this data by the government officials, and the manual tagging of arguments performed by members of our research group. Afterwards we focus on classification tasks that mimic the human processes, comparing linear methods with neural network architectures. The experiments show that some of the manual tasks are suitable for automatization. In particular, the best methods achieve a 90{%} top-5 accuracy in a multi-class classification of arguments, and 65{%} macro-averaged F1-score for tagging arguments according to a three-part argumentation model.
Tasks	Argument Mining
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-5101/
PDF	https://www.aclweb.org/anthology/W17-5101
PWC	https://paperswithcode.com/paper/200k-crowdsourced-political-arguments-for-a
Repo
Framework

Unsupervised AMR-Dependency Parse Alignment


Title	Unsupervised AMR-Dependency Parse Alignment
Authors	Wei-Te Chen, Martha Palmer
Abstract	In this paper, we introduce an Abstract Meaning Representation (AMR) to Dependency Parse aligner. Alignment is a preliminary step for AMR parsing, and our aligner improves current AMR parser performance. Our aligner involves several different features, including named entity tags and semantic role labels, and uses Expectation-Maximization training. Results show that our aligner reaches an 87.1{%} F-Score score with the experimental data, and enhances AMR parsing.
Tasks	Amr Parsing
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-1053/
PDF	https://www.aclweb.org/anthology/E17-1053
PWC	https://paperswithcode.com/paper/unsupervised-amr-dependency-parse-alignment
Repo
Framework

Assessing the Verifiability of Attributions in News Text


Title	Assessing the Verifiability of Attributions in News Text
Authors	Edward Newell, Ariane Schang, Drew Margolin, Derek Ruths
Abstract	When reporting the news, journalists rely on the statements of stakeholders, experts, and officials. The attribution of such a statement is verifiable if its fidelity to the source can be confirmed or denied. In this paper, we develop a new NLP task: determining the verifiability of an attribution based on linguistic cues. We operationalize the notion of verifiability as a score between 0 and 1 using human judgments in a comparison-based approach. Using crowdsourcing, we create a dataset of verifiability-scored attributions, and demonstrate a model that achieves an RMSE of 0.057 and Spearman{'}s rank correlation of 0.95 to human-generated scores. We discuss the application of this technique to the analysis of mass media.
Tasks	Question Answering
Published	2017-11-01
URL	https://www.aclweb.org/anthology/I17-1076/
PDF	https://www.aclweb.org/anthology/I17-1076
PWC	https://paperswithcode.com/paper/assessing-the-verifiability-of-attributions
Repo
Framework

SVNIT @ SemEval 2017 Task-6: Learning a Sense of Humor Using Supervised Approach


Title	SVNIT @ SemEval 2017 Task-6: Learning a Sense of Humor Using Supervised Approach
Authors	Rutal Mahajan, Mukesh Zaveri
Abstract	This paper describes the system devel-oped for SemEval 2017 task 6: {#}HashTagWars -Learning a Sense of Hu-mor. Learning to recognize sense of hu-mor is the important task for language understanding applications. Different set of features based on frequency of words, structure of tweets and semantics are used in this system to identify the presence of humor in tweets. Supervised machine learning approaches, Multilayer percep-tron and Na{"\i}ve Bayes are used to classify the tweets in to three level of sense of humor. For given Hashtag, the system finds the funniest tweet and predicts the amount of funniness of all the other tweets. In official submitted runs, we have achieved 0.506 accuracy using mul-tilayer perceptron in subtask-A and 0.938 distance in subtask-B. Using Na{"\i}ve bayes in subtask-B, the system achieved 0.949 distance. Apart from official runs, this system have scored 0.751 accuracy in subtask-A using SVM. But still there is a wide room for improvement in system.
Tasks	Opinion Mining, Sentiment Analysis
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-2069/
PDF	https://www.aclweb.org/anthology/S17-2069
PWC	https://paperswithcode.com/paper/svnit-semeval-2017-task-6-learning-a-sense-of
Repo
Framework

LIA at SemEval-2017 Task 4: An Ensemble of Neural Networks for Sentiment Classification


Title	LIA at SemEval-2017 Task 4: An Ensemble of Neural Networks for Sentiment Classification
Authors	Mickael Rouvier
Abstract	This paper describes the system developed at LIA for the SemEval-2017 evaluation campaign. The goal of Task 4.A was to identify sentiment polarity in tweets. The system is an ensemble of Deep Neural Network (DNN) models: Convolutional Neural Network (CNN) and Recurrent Neural Network Long Short-Term Memory (RNN-LSTM). We initialize the input representation of DNN with different sets of embeddings trained on large datasets. The ensemble of DNNs are combined using a score-level fusion approach. The system ranked 2nd at SemEval-2017 and obtained an average recall of 67.6{%}.
Tasks	Sentence Classification, Sentiment Analysis, Word Embeddings
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-2128/
PDF	https://www.aclweb.org/anthology/S17-2128
PWC	https://paperswithcode.com/paper/lia-at-semeval-2017-task-4-an-ensemble-of
Repo
Framework

A Quantitative Study of Data in the NLP community


Title	A Quantitative Study of Data in the NLP community
Authors	Margot Mieskes
Abstract	We present results on a quantitative analysis of publications in the NLP domain on collecting, publishing and availability of research data. We find that a wide range of publications rely on data crawled from the web, but few give details on how potentially sensitive data was treated. Additionally, we find that while links to repositories of data are given, they often do not work even a short time after publication. We put together several suggestions on how to improve this situation based on publications from the NLP domain, but also other research areas.
Tasks	Speech Recognition
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1603/
PDF	https://www.aclweb.org/anthology/W17-1603
PWC	https://paperswithcode.com/paper/a-quantitative-study-of-data-in-the-nlp
Repo
Framework

Inducing Event Types and Roles in Reverse: Using Function to Discover Theme


Title	Inducing Event Types and Roles in Reverse: Using Function to Discover Theme
Authors	Natalie Ahn
Abstract	With growing interest in automated event extraction, there is an increasing need to overcome the labor costs of hand-written event templates, entity lists, and annotated corpora. In the last few years, more inductive approaches have emerged, seeking to discover unknown event types and roles in raw text. The main recent efforts use probabilistic generative models, as in topic modeling, which are formally concise but do not always yield stable or easily interpretable results. We argue that event schema induction can benefit from greater structure in the process and in linguistic features that distinguish words{'} functions and themes. To maximize our use of limited data, we reverse the typical schema induction steps and introduce new similarity measures, building an intuitive process for inducing the structure of unknown events.
Tasks
Published	2017-08-01
URL	https://www.aclweb.org/anthology/W17-2710/
PDF	https://www.aclweb.org/anthology/W17-2710
PWC	https://paperswithcode.com/paper/inducing-event-types-and-roles-in-reverse
Repo
Framework

Patterns of Argumentation Strategies across Topics


Title	Patterns of Argumentation Strategies across Topics
Authors	Khalid Al-Khatib, Henning Wachsmuth, Matthias Hagen, Benno Stein
Abstract	This paper presents an analysis of argumentation strategies in news editorials within and across topics. Given nearly 29,000 argumentative editorials from the New York Times, we develop two machine learning models, one for determining an editorial{'}s topic, and one for identifying evidence types in the editorial. Based on the distribution and structure of the identified types, we analyze the usage patterns of argumentation strategies among 12 different topics. We detect several common patterns that provide insights into the manifestation of argumentation strategies. Also, our experiments reveal clear correlations between the topics and the detected patterns.
Tasks	Argument Mining
Published	2017-09-01
URL	https://www.aclweb.org/anthology/D17-1141/
PDF	https://www.aclweb.org/anthology/D17-1141
PWC	https://paperswithcode.com/paper/patterns-of-argumentation-strategies-across
Repo
Framework

Visualizing and Understanding Neural Machine Translation


Title	Visualizing and Understanding Neural Machine Translation
Authors	Yanzhuo Ding, Yang Liu, Huanbo Luan, Maosong Sun
Abstract	While neural machine translation (NMT) has made remarkable progress in recent years, it is hard to interpret its internal workings due to the continuous representations and non-linearity of neural networks. In this work, we propose to use layer-wise relevance propagation (LRP) to compute the contribution of each contextual word to arbitrary hidden states in the attention-based encoder-decoder framework. We show that visualization with LRP helps to interpret the internal workings of NMT and analyze translation errors.
Tasks	Machine Translation
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-1106/
PDF	https://www.aclweb.org/anthology/P17-1106
PWC	https://paperswithcode.com/paper/visualizing-and-understanding-neural-machine
Repo
Framework

Identification and Classification of the Most Important Moments in Students’ Collaborative Chats


Title	Identification and Classification of the Most Important Moments in Students’ Collaborative Chats
Authors	Costin Chiru, Remus Decea
Abstract	In this paper, we present an application for the automatic identification of the important moments that might occur during students{'} collaborative chats. The moments are detected based on the input received from the user, who may choose to perform an analysis on the topics that interest him/her. Moreover, the application offers various types of suggestive and intuitive graphics that aid the user in identification of such moments. There are two main aspects that are considered when identifying important moments: the concepts{'} frequency and distribution throughout the conversation and the chat tempo, which is analyzed for identifying intensively debated concepts. By the tempo of the chat we understand the rate at which the ideas are input by the chat participants, expressed by the utterances{'} timestamps.
Tasks
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1024/
PDF	https://doi.org/10.26615/978-954-452-049-6_024
PWC	https://paperswithcode.com/paper/identification-and-classification-of-the-most
Repo
Framework

Explaining and Generalizing Skip-Gram through Exponential Family Principal Component Analysis


Title	Explaining and Generalizing Skip-Gram through Exponential Family Principal Component Analysis
Authors	Ryan Cotterell, Adam Poliak, Benjamin Van Durme, Jason Eisner
Abstract	The popular skip-gram model induces word embeddings by exploiting the signal from word-context coocurrence. We offer a new interpretation of skip-gram based on exponential family PCA-a form of matrix factorization to generalize the skip-gram model to tensor factorization. In turn, this lets us train embeddings through richer higher-order coocurrences, e.g., triples that include positional information (to incorporate syntax) or morphological information (to share parameters across related words). We experiment on 40 languages and show our model improves upon skip-gram.
Tasks	Word Embeddings
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-2028/
PDF	https://www.aclweb.org/anthology/E17-2028
PWC	https://paperswithcode.com/paper/explaining-and-generalizing-skip-gram-through
Repo
Framework

Verb-Particle Constructions in Questions


Title	Verb-Particle Constructions in Questions
Authors	Veronika Vincze
Abstract	In this paper, we investigate the behavior of verb-particle constructions in English questions. We present a small dataset that contains questions and verb-particle construction candidates. We demonstrate that there are significant differences in the distribution of WH-words, verbs and prepositions/particles in sentences that contain VPCs and sentences that contain only verb + prepositional phrase combinations both by statistical means and in machine learning experiments. Hence, VPCs and non-VPCs can be effectively separated from each other by using a rich feature set, containing several novel features.
Tasks	Chunking
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1721/
PDF	https://www.aclweb.org/anthology/W17-1721
PWC	https://paperswithcode.com/paper/verb-particle-constructions-in-questions
Repo
Framework


Title	Guiding Interaction Behaviors for Multi-modal Grounded Language Learning
Authors	Jesse Thomason, Jivko Sinapov, Raymond Mooney
Abstract	Multi-modal grounded language learning connects language predicates to physical properties of objects in the world. Sensing with multiple modalities, such as audio, haptics, and visual colors and shapes while performing interaction behaviors like lifting, dropping, and looking on objects enables a robot to ground non-visual predicates like {`}empty{''} as well as visual predicates like {`}red{''}. Previous work has established that grounding in multi-modal space improves performance on object retrieval from human descriptions. In this work, we gather behavior annotations from humans and demonstrate that these improve language grounding performance by allowing a system to focus on relevant behaviors for words like {`}white{''} or {`}half-full{''} that can be understood by looking or lifting, respectively. We also explore adding modality annotations (whether to focus on audio or haptics when performing a behavior), which improves performance, and sharing information between linguistically related predicates (if {`}green{''} is a color, {`}white{''} is a color), which improves grounding recall but at the cost of precision.
Tasks
Published	2017-08-01
URL	https://www.aclweb.org/anthology/W17-2803/
PDF	https://www.aclweb.org/anthology/W17-2803
PWC	https://paperswithcode.com/paper/guiding-interaction-behaviors-for-multi-modal
Repo
Framework

POS Tagging For Resource Poor Languages Through Feature Projection


Title	POS Tagging For Resource Poor Languages Through Feature Projection
Authors	Pruthwik Mishra, V Mujadia, an, Dipti Misra Sharma
Abstract
Tasks
Published	2017-12-01
URL	https://www.aclweb.org/anthology/W17-7507/
PDF	https://www.aclweb.org/anthology/W17-7507
PWC	https://paperswithcode.com/paper/pos-tagging-for-resource-poor-languages
Repo
Framework

Unsupervised Method for Improving Arabic Speech Recognition Systems


Title	Unsupervised Method for Improving Arabic Speech Recognition Systems
Authors	Mohamed Labidi, Mohsen Maraoui, Mounir Zrigui
Abstract
Tasks	Language Modelling, Speech Recognition
Published	2017-11-01
URL	https://www.aclweb.org/anthology/Y17-1024/
PDF	https://www.aclweb.org/anthology/Y17-1024
PWC	https://paperswithcode.com/paper/unsupervised-method-for-improving-arabic
Repo
Framework