May 5, 2019

1990 words 10 mins read

Paper Group NANR 103

An Empirical Analysis of Formality in Online Communication. Chinese Preposition Selection for Grammatical Error Diagnosis. Delineating Fields Using Mathematical Jargon. CoCoGen - Complexity Contour Generator: Automatic Assessment of Linguistic Complexity Using a Sliding-Window Technique. Aspect Based Sentiment Analysis using Sentiment Flow with Loc …

An Empirical Analysis of Formality in Online Communication


Title	An Empirical Analysis of Formality in Online Communication
Authors	Ellie Pavlick, Joel Tetreault
Abstract	This paper presents an empirical study of linguistic formality. We perform an analysis of humans{'} perceptions of formality in four different genres. These findings are used to develop a statistical model for predicting formality, which is evaluated under different feature settings and genres. We apply our model to an investigation of formality in online discussion forums, and present findings consistent with theories of formality and linguistic coordination.
Tasks	Automatic Writing, Knowledge Base Population, Natural Language Inference
Published	2016-01-01
URL	https://www.aclweb.org/anthology/Q16-1005/
PDF	https://www.aclweb.org/anthology/Q16-1005
PWC	https://paperswithcode.com/paper/an-empirical-analysis-of-formality-in-online
Repo
Framework

Chinese Preposition Selection for Grammatical Error Diagnosis


Title	Chinese Preposition Selection for Grammatical Error Diagnosis
Authors	Hen-Hsen Huang, Yen-Chi Shao, Hsin-Hsi Chen
Abstract	Misuse of Chinese prepositions is one of common word usage errors in grammatical error diagnosis. In this paper, we adopt the Chinese Gigaword corpus and HSK corpus as L1 and L2 corpora, respectively. We explore gated recurrent neural network model (GRU), and an ensemble of GRU model and maximum entropy language model (GRU-ME) to select the best preposition from 43 candidates for each test sentence. The experimental results show the advantage of the GRU models over simple RNN and n-gram models. We further analyze the effectiveness of linguistic information such as word boundary and part-of-speech tag in this task.
Tasks	Language Modelling
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1085/
PDF	https://www.aclweb.org/anthology/C16-1085
PWC	https://paperswithcode.com/paper/chinese-preposition-selection-for-grammatical
Repo
Framework

Delineating Fields Using Mathematical Jargon


Title	Delineating Fields Using Mathematical Jargon
Authors	Jevin West, Jason Portenoy
Abstract
Tasks	Information Retrieval
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-1508/
PDF	https://www.aclweb.org/anthology/W16-1508
PWC	https://paperswithcode.com/paper/delineating-fields-using-mathematical-jargon
Repo
Framework

CoCoGen - Complexity Contour Generator: Automatic Assessment of Linguistic Complexity Using a Sliding-Window Technique


Title	CoCoGen - Complexity Contour Generator: Automatic Assessment of Linguistic Complexity Using a Sliding-Window Technique
Authors	Str{"o}bel Marcus, Elma Kerz, Daniel Wiechmann, Stella Neumann
Abstract	We present a novel approach to the automatic assessment of text complexity based on a sliding-window technique that tracks the distribution of complexity within a text. Such distribution is captured by what we term {``}complexity contours{''} derived from a series of measurements for a given linguistic complexity measure. This approach is implemented in an automatic computational tool, CoCoGen {–} Complexity Contour Generator, which in its current version supports 32 indices of linguistic complexity. The goal of the paper is twofold: (1) to introduce the design of our computational tool based on a sliding-window technique and (2) to showcase this approach in the area of second language (L2) learning, i.e. more specifically, in the area of L2 writing. \|
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4103/
PDF	https://www.aclweb.org/anthology/W16-4103
PWC	https://paperswithcode.com/paper/cocogen-complexity-contour-generator
Repo
Framework

Aspect Based Sentiment Analysis using Sentiment Flow with Local and Non-local Neighbor Information


Title	Aspect Based Sentiment Analysis using Sentiment Flow with Local and Non-local Neighbor Information
Authors	Shubham Pateria
Abstract	Aspect-level analysis of sentiments contained in a review text is important to reveal a detailed picture of consumer opinions. While a plethora of methods have been traditionally employed for this task, majority focus has been on analyzing only aspect-centered local information. However, incorporating context information from non-local aspect neighbors may capture richer structure in review text and enhance prediction. This may especially be helpful to resolve ambiguous predictions. The context around an aspect can be incorporated using semantic relations within text and inter-label dependencies in the output. On the output side, this becomes a structured prediction task. However, non-local label correlations are computationally heavy and intractable to infer for structured prediction models like Conditional Random Fields (CRF). Moreover, some prior intuition is required to incorporate non-local context. Thus, inspired by previous research on multi-stage prediction, we propose a two-level model for aspect-based analysis. The proposed model uses predicted probability estimates from first level to incorporate neighbor information in the second level. The model is evaluated on data taken from SemEval Workshops and Bing Liu{'}s review collection. It shows comparatively better performance against few existing methods. Overall, we get prediction accuracy in a range of 83-88{%} and almost 3-4 point increment against baseline (first level only) scores.
Tasks	Aspect-Based Sentiment Analysis, Sentiment Analysis, Structured Prediction
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1248/
PDF	https://www.aclweb.org/anthology/C16-1248
PWC	https://paperswithcode.com/paper/aspect-based-sentiment-analysis-using
Repo
Framework

Imitation learning for language generation from unaligned data


Title	Imitation learning for language generation from unaligned data
Authors	Gerasimos Lampouras, Andreas Vlachos
Abstract	Natural language generation (NLG) is the task of generating natural language from a meaning representation. Current rule-based approaches require domain-specific and manually constructed linguistic resources, while most machine-learning based approaches rely on aligned training data and/or phrase templates. The latter are needed to restrict the search space for the structured prediction task defined by the unaligned datasets. In this work we propose the use of imitation learning for structured prediction which learns an incremental model that handles the large search space by avoiding explicit enumeration of the outputs. We focus on the Locally Optimal Learning to Search framework which allows us to train against non-decomposable loss functions such as the BLEU or ROUGE scores while not assuming gold standard alignments. We evaluate our approach on three datasets using both automatic measures and human judgements and achieve results comparable to the state-of-the-art approaches developed for each of them.
Tasks	Imitation Learning, Structured Prediction, Text Generation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1105/
PDF	https://www.aclweb.org/anthology/C16-1105
PWC	https://paperswithcode.com/paper/imitation-learning-for-language-generation
Repo
Framework

Inconsistency Detection in Semantic Annotation


Title	Inconsistency Detection in Semantic Annotation
Authors	Nora Hollenstein, Nathan Schneider, Bonnie Webber
Abstract	Inconsistencies are part of any manually annotated corpus. Automatically finding these inconsistencies and correcting them (even manually) can increase the quality of the data. Past research has focused mainly on detecting inconsistency in syntactic annotation. This work explores new approaches to detecting inconsistency in semantic annotation. Two ranking methods are presented in this paper: a discrepancy ranking and an entropy ranking. Those methods are then tested and evaluated on multiple corpora annotated with multiword expressions and supersense labels. The results show considerable improvements in detecting inconsistency candidates over a random baseline. Possible applications of methods for inconsistency detection are improving the annotation procedure as well as the guidelines and correcting errors in completed annotations.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1629/
PDF	https://www.aclweb.org/anthology/L16-1629
PWC	https://paperswithcode.com/paper/inconsistency-detection-in-semantic
Repo
Framework

How can we adapt generation to the user’s cognitive load?


Title	How can we adapt generation to the user’s cognitive load?
Authors	Vera Demberg
Abstract
Tasks	Text Generation
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-6622/
PDF	https://www.aclweb.org/anthology/W16-6622
PWC	https://paperswithcode.com/paper/how-can-we-adapt-generation-to-the-useras
Repo
Framework

The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation


Title	The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation
Authors	Jorge Proen{\c{c}}a, Dirce Celorico, C, Sara eias, Carla Lopes, Fern Perdig{~a}o, o
Abstract	This paper introduces the LetsRead Corpus of European Portuguese read speech from 6 to 10 years old children. The motivation for the creation of this corpus stems from the inexistence of databases with recordings of reading tasks of Portuguese children with different performance levels and including all the common reading aloud disfluencies. It is also essential to develop techniques to fulfill the main objective of the LetsRead project: to automatically evaluate the reading performance of children through the analysis of reading tasks. The collected data amounts to 20 hours of speech from 284 children from private and public Portuguese schools, with each child carrying out two tasks: reading sentences and reading a list of pseudowords, both with varying levels of difficulty throughout the school grades. In this paper, the design of the reading tasks presented to children is described, as well as the collection procedure. Manually annotated data is analyzed according to disfluencies and reading performance. The considered word difficulty parameter is also confirmed to be suitable for the pseudoword reading tasks.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1125/
PDF	https://www.aclweb.org/anthology/L16-1125
PWC	https://paperswithcode.com/paper/the-letsread-corpus-of-portuguese-children
Repo
Framework

Minimizing Regret on Reflexive Banach Spaces and Nash Equilibria in Continuous Zero-Sum Games


Title	Minimizing Regret on Reflexive Banach Spaces and Nash Equilibria in Continuous Zero-Sum Games
Authors	Maximilian Balandat, Walid Krichene, Claire Tomlin, Alexandre Bayen
Abstract	We study a general adversarial online learning problem, in which we are given a decision set X’ in a reflexive Banach space X and a sequence of reward vectors in the dual space of X. At each iteration, we choose an action from X’, based on the observed sequence of previous rewards. Our goal is to minimize regret, defined as the gap between the realized reward and the reward of the best fixed action in hindsight. Using results from infinite dimensional convex analysis, we generalize the method of Dual Averaging (or Follow the Regularized Leader) to our setting and obtain upper bounds on the worst-case regret that generalize many previous results. Under the assumption of uniformly continuous rewards, we obtain explicit regret bounds in a setting where the decision set is the set of probability distributions on a compact metric space S. Importantly, we make no convexity assumptions on either the set S or the reward functions. We also prove a general lower bound on the worst-case regret for any online algorithm. We then apply these results to the problem of learning in repeated two-player zero-sum games on compact metric spaces. In doing so, we first prove that if both players play a Hannan-consistent strategy, then with probability 1 the empirical distributions of play weakly converge to the set of Nash equilibria of the game. We then show that, under mild assumptions, Dual Averaging on the (infinite-dimensional) space of probability distributions indeed achieves Hannan-consistency.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6216-minimizing-regret-on-reflexive-banach-spaces-and-nash-equilibria-in-continuous-zero-sum-games
PDF	http://papers.nips.cc/paper/6216-minimizing-regret-on-reflexive-banach-spaces-and-nash-equilibria-in-continuous-zero-sum-games.pdf
PWC	https://paperswithcode.com/paper/minimizing-regret-on-reflexive-banach-spaces-1
Repo
Framework

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2016


Title	The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2016
Authors	Thanh-Le Ha, Eunah Cho, Jan Niehues, Mohammed Mediani, Matthias Sperber, Alex Allauzen, re, Alex Waibel, er
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2314/
PDF	https://www.aclweb.org/anthology/W16-2314
PWC	https://paperswithcode.com/paper/the-karlsruhe-institute-of-technology-systems-1
Repo
Framework

Investigating Language Universal and Specific Properties in Word Embeddings


Title	Investigating Language Universal and Specific Properties in Word Embeddings
Authors	Peng Qian, Xipeng Qiu, Xuanjing Huang
Abstract
Tasks	Word Embeddings
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1140/
PDF	https://www.aclweb.org/anthology/P16-1140
PWC	https://paperswithcode.com/paper/investigating-language-universal-and-specific
Repo
Framework

A Study of Reuse and Plagiarism in LREC papers


Title	A Study of Reuse and Plagiarism in LREC papers
Authors	Gil Francopoulo, Joseph Mariani, Patrick Paroubek
Abstract	The aim of this experiment is to present an easy way to compare fragments of texts in order to detect (supposed) results of copy {&} paste operations between articles in the domain of Natural Language Processing (NLP). The search space of the comparisons is a corpus labeled as NLP4NLP gathering a large part of the NLP field. The study is centered on LREC papers in both directions, first with an LREC paper borrowing a fragment of text from the collection, and secondly in the reverse direction with fragments of LREC documents borrowed and inserted in the collection.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1298/
PDF	https://www.aclweb.org/anthology/L16-1298
PWC	https://paperswithcode.com/paper/a-study-of-reuse-and-plagiarism-in-lrec
Repo
Framework

Tweester at SemEval-2016 Task 4: Sentiment Analysis in Twitter Using Semantic-Affective Model Adaptation


Title	Tweester at SemEval-2016 Task 4: Sentiment Analysis in Twitter Using Semantic-Affective Model Adaptation
Authors	Elisavet Palogiannidi, Athanasia Kolovou, Fenia Christopoulou, Filippos Kokkinos, Elias Iosif, Mal, Nikolaos rakis, Haris Papageorgiou, Shrikanth Narayanan, Alex Potamianos, ros
Abstract
Tasks	Opinion Mining, Sentiment Analysis, Twitter Sentiment Analysis, Word Embeddings
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1023/
PDF	https://www.aclweb.org/anthology/S16-1023
PWC	https://paperswithcode.com/paper/tweester-at-semeval-2016-task-4-sentiment
Repo
Framework

The Role of Modifier and Head Properties in Predicting the Compositionality of English and German Noun-Noun Compounds: A Vector-Space Perspective


Title	The Role of Modifier and Head Properties in Predicting the Compositionality of English and German Noun-Noun Compounds: A Vector-Space Perspective
Authors	Sabine Schulte im Walde, Anna H{"a}tty, Stefan Bott
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/S16-2020/
PDF	https://www.aclweb.org/anthology/S16-2020
PWC	https://paperswithcode.com/paper/the-role-of-modifier-and-head-properties-in
Repo
Framework