July 26, 2019

1721 words 9 mins read

Paper Group NANR 171

Variation Autoencoder Based Network Representation Learning for Classification. Enabling Transitivity for Lexical Inference on Chinese Verbs Using Probabilistic Soft Logic. Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison. Cross-lingual tagger evaluation without test data. Annotating errors in student texts: First …

Variation Autoencoder Based Network Representation Learning for Classification


Title	Variation Autoencoder Based Network Representation Learning for Classification
Authors	Hang Li, Haozheng Wang, Zhenglu Yang, Masato Odagaki
Abstract
Tasks	Document Classification, Information Retrieval, Recommendation Systems, Representation Learning
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-3010/
PDF	https://www.aclweb.org/anthology/P17-3010
PWC	https://paperswithcode.com/paper/variation-autoencoder-based-network
Repo
Framework

Enabling Transitivity for Lexical Inference on Chinese Verbs Using Probabilistic Soft Logic


Title	Enabling Transitivity for Lexical Inference on Chinese Verbs Using Probabilistic Soft Logic
Authors	Wei-Chung Wang, Lun-Wei Ku
Abstract	To learn more knowledge, enabling transitivity is a vital step for lexical inference. However, most of the lexical inference models with good performance are for nouns or noun phrases, which cannot be directly applied to the inference on events or states. In this paper, we construct the largest Chinese verb lexical inference dataset containing 18,029 verb pairs, where for each pair one of four inference relations are annotated. We further build a probabilistic soft logic (PSL) model to infer verb lexicons using the logic language. With PSL, we easily enable transitivity in two layers, the observed layer and the feature layer, which are included in the knowledge base. We further discuss the effect of transitives within and between these layers. Results show the performance of the proposed PSL model can be improved at least 3.5{%} (relative) when the transitivity is enabled. Furthermore, experiments show that enabling transitivity in the observed layer benefits the most.
Tasks	Natural Language Inference, Open-Domain Question Answering, Question Answering, Text Generation
Published	2017-11-01
URL	https://www.aclweb.org/anthology/I17-1012/
PDF	https://www.aclweb.org/anthology/I17-1012
PWC	https://paperswithcode.com/paper/enabling-transitivity-for-lexical-inference
Repo
Framework

Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison


Title	Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison
Authors	Aless Raganato, ro, Jose Camacho-Collados, Roberto Navigli
Abstract	Word Sense Disambiguation is a long-standing task in Natural Language Processing, lying at the core of human language understanding. However, the evaluation of automatic systems has been problematic, mainly due to the lack of a reliable evaluation framework. In this paper we develop a unified evaluation framework and analyze the performance of various Word Sense Disambiguation systems in a fair setup. The results show that supervised systems clearly outperform knowledge-based models. Among the supervised systems, a linear classifier trained on conventional local features still proves to be a hard baseline to beat. Nonetheless, recent approaches exploiting neural networks on unlabeled corpora achieve promising results, surpassing this hard baseline in most test sets.
Tasks	Word Sense Disambiguation
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-1010/
PDF	https://www.aclweb.org/anthology/E17-1010
PWC	https://paperswithcode.com/paper/word-sense-disambiguation-a-unified
Repo
Framework

Cross-lingual tagger evaluation without test data


Title	Cross-lingual tagger evaluation without test data
Authors	{\v{Z}}eljko Agi{'c}, Barbara Plank, Anders S{\o}gaard
Abstract	We address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data. We put forth and evaluate two dictionary-based metrics. On the tasks of accuracy prediction and system ranking, we reveal that these metrics are reliable enough to approximate test set-based evaluation, and at the same time lean enough to support assessment for truly low-resource languages.
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-2040/
PDF	https://www.aclweb.org/anthology/E17-2040
PWC	https://paperswithcode.com/paper/cross-lingual-tagger-evaluation-without-test
Repo
Framework

Annotating errors in student texts: First experiences and experiments


Title	Annotating errors in student texts: First experiences and experiments
Authors	Sara Stymne, Eva Pettersson, Be{'a}ta Megyesi, Anne Palm{'e}r
Abstract
Tasks	Language Acquisition, Spelling Correction
Published	2017-05-01
URL	https://www.aclweb.org/anthology/W17-0306/
PDF	https://www.aclweb.org/anthology/W17-0306
PWC	https://paperswithcode.com/paper/annotating-errors-in-student-texts-first
Repo
Framework

OCR and post-correction of historical Finnish texts


Title	OCR and post-correction of historical Finnish texts
Authors	Senka Drobac, Pekka Kauppinen, Krister Lind{'e}n
Abstract
Tasks	Optical Character Recognition, Spelling Correction
Published	2017-05-01
URL	https://www.aclweb.org/anthology/W17-0209/
PDF	https://www.aclweb.org/anthology/W17-0209
PWC	https://paperswithcode.com/paper/ocr-and-post-correction-of-historical-finnish
Repo
Framework

Using Twitter Language to Predict the Real Estate Market


Title	Using Twitter Language to Predict the Real Estate Market
Authors	Mohammadzaman Zamani, H. Andrew Schwartz
Abstract	We explore whether social media can provide a window into community real estate -foreclosure rates and price changes- beyond that of traditional economic and demographic variables. We find language use in Twitter not only predicts real estate outcomes as well as traditional variables across counties, but that including Twitter language in traditional models leads to a significant improvement (e.g. from Pearson r = :50 to r = :59 for price changes). We overcome the challenge of the relative sparsity and noise in Twitter language variables by showing that training on the residual error of the traditional models leads to more accurate overall assessments. Finally, we discover that it is Twitter language related to business (e.g. {`}company{'}, {`}marketing{'}) and technology (e.g. {`}technology{'}, {`}internet{'}), among others, that yield predictive power over economics.
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-2005/
PDF	https://www.aclweb.org/anthology/E17-2005
PWC	https://paperswithcode.com/paper/using-twitter-language-to-predict-the-real
Repo
Framework

The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts


Title	The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts
Authors	Rachele Sprugnoli, Tommaso Caselli, Sara Tonelli, Giovanni Moretti
Abstract	This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.
Tasks	Sentiment Analysis
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-2042/
PDF	https://www.aclweb.org/anthology/E17-2042
PWC	https://paperswithcode.com/paper/the-content-types-dataset-a-new-resource-to
Repo
Framework

Using Gaze to Predict Text Readability


Title	Using Gaze to Predict Text Readability
Authors	Ana Valeria Gonz{'a}lez-Gardu{~n}o, Anders S{\o}gaard
Abstract	We show that text readability prediction improves significantly from hard parameter sharing with models predicting first pass duration, total fixation duration and regression duration. Specifically, we induce multi-task Multilayer Perceptrons and Logistic Regression models over sentence representations that capture various aggregate statistics, from two different text readability corpora for English, as well as the Dundee eye-tracking corpus. Our approach leads to significant improvements over Single task learning and over previous systems. In addition, our improvements are consistent across train sample sizes, making our approach especially applicable to small datasets.
Tasks	Eye Tracking, Machine Translation, Multi-Task Learning, Text Generation, Text Simplification
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-5050/
PDF	https://www.aclweb.org/anthology/W17-5050
PWC	https://paperswithcode.com/paper/using-gaze-to-predict-text-readability
Repo
Framework

Projection of Argumentative Corpora from Source to Target Languages


Title	Projection of Argumentative Corpora from Source to Target Languages
Authors	Ahmet Aker, Huangpan Zhang
Abstract	Argumentative corpora are costly to create and are available in only few languages with English dominating the area. In this paper we release the first publicly available Mandarin argumentative corpus. The corpus is created by exploiting the idea of comparable corpora from Statistical Machine Translation. We use existing corpora in English and manually map the claims and premises to comparable corpora in Mandarin. We also implement a simple solution to automate this approach with the view of creating argumentative corpora in other less-resourced languages. In this way we introduce a new task of multi-lingual argument mapping that can be evaluated using our English-Mandarin argumentative corpus. The preliminary results of our automatic argument mapper mirror the simplicity of our approach, but provide a baseline for further improvements.
Tasks	Argument Mining, Machine Translation
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-5108/
PDF	https://www.aclweb.org/anthology/W17-5108
PWC	https://paperswithcode.com/paper/projection-of-argumentative-corpora-from
Repo
Framework

TWINE: A real-time system for TWeet analysis via INformation Extraction


Title	TWINE: A real-time system for TWeet analysis via INformation Extraction
Authors	Debora Nozza, Fausto Ristagno, Matteo Palmonari, Elisabetta Fersini, Manch, Pikakshi a, Enza Messina
Abstract	In the recent years, the amount of user generated contents shared on the Web has significantly increased, especially in social media environment, e.g. Twitter, Facebook, Google+. This large quantity of data has generated the need of reactive and sophisticated systems for capturing and understanding the underlying information enclosed in them. In this paper we present TWINE, a real-time system for the big data analysis and exploration of information extracted from Twitter streams. The proposed system based on a Named Entity Recognition and Linking pipeline and a multi-dimensional spatial geo-localization is managed by a scalable and flexible architecture for an interactive visualization of micropost streams insights. The demo is available at \url{http://twine-mind.cloudapp.net/streaming}.
Tasks	Named Entity Recognition
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-3007/
PDF	https://www.aclweb.org/anthology/E17-3007
PWC	https://paperswithcode.com/paper/twine-a-real-time-system-for-tweet-analysis
Repo
Framework

Don’t Stop Me Now! Using Global Dynamic Oracles to Correct Training Biases of Transition-Based Dependency Parsers


Title	Don’t Stop Me Now! Using Global Dynamic Oracles to Correct Training Biases of Transition-Based Dependency Parsers
Authors	Lauriane Aufrant, Guillaume Wisniewski, Fran{\c{c}}ois Yvon
Abstract	This paper formalizes a sound extension of dynamic oracles to global training, in the frame of transition-based dependency parsers. By dispensing with the pre-computation of references, this extension widens the training strategies that can be entertained for such parsers; we show this by revisiting two standard training procedures, early-update and max-violation, to correct some of their search space sampling biases. Experimentally, on the SPMRL treebanks, this improvement increases the similarity between the train and test distributions and yields performance improvements up to 0.7 UAS, without any computation overhead.
Tasks	Active Learning, Dependency Parsing
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-2051/
PDF	https://www.aclweb.org/anthology/E17-2051
PWC	https://paperswithcode.com/paper/dont-stop-me-now-using-global-dynamic-oracles
Repo
Framework

Can We Create a Tool for General Domain Event Analysis?


Title	Can We Create a Tool for General Domain Event Analysis?
Authors	Siim Orasmaa, Heiki-Jaan Kaalep
Abstract
Tasks	Morphological Analysis, Question Answering
Published	2017-05-01
URL	https://www.aclweb.org/anthology/W17-0222/
PDF	https://www.aclweb.org/anthology/W17-0222
PWC	https://paperswithcode.com/paper/can-we-create-a-tool-for-general-domain-event
Repo
Framework

The Lemlat 3.0 Package for Morphological Analysis of Latin


Title	The Lemlat 3.0 Package for Morphological Analysis of Latin
Authors	Marco Passarotti, Marco Budassi, Eleonora Litta, Paolo Ruffolo
Abstract
Tasks	Morphological Analysis
Published	2017-05-01
URL	https://www.aclweb.org/anthology/W17-0506/
PDF	https://www.aclweb.org/anthology/W17-0506
PWC	https://paperswithcode.com/paper/the-lemlat-30-package-for-morphological
Repo
Framework

The arText prototype: An automatic system for writing specialized texts


Title	The arText prototype: An automatic system for writing specialized texts
Authors	Iria da Cunha, M. Amor Montan{'e}, Luis Hysa
Abstract	This article describes an automatic system for writing specialized texts in Spanish. The arText prototype is a free online text editor that includes different types of linguistic information. It is designed for a variety of end users and domains, including specialists and university students working in the fields of medicine and tourism, and laypersons writing to the public administration. ArText provides guidance on how to structure a text, prompts users to include all necessary contents in each section, and detects lexical and discourse problems in the text.
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-3015/
PDF	https://www.aclweb.org/anthology/E17-3015
PWC	https://paperswithcode.com/paper/the-artext-prototype-an-automatic-system-for
Repo
Framework