July 26, 2019

1721 words 9 mins read

Paper Group NANR 171

Paper Group NANR 171

Variation Autoencoder Based Network Representation Learning for Classification. Enabling Transitivity for Lexical Inference on Chinese Verbs Using Probabilistic Soft Logic. Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison. Cross-lingual tagger evaluation without test data. Annotating errors in student texts: First …

Variation Autoencoder Based Network Representation Learning for Classification

Title Variation Autoencoder Based Network Representation Learning for Classification
Authors Hang Li, Haozheng Wang, Zhenglu Yang, Masato Odagaki
Abstract
Tasks Document Classification, Information Retrieval, Recommendation Systems, Representation Learning
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-3010/
PDF https://www.aclweb.org/anthology/P17-3010
PWC https://paperswithcode.com/paper/variation-autoencoder-based-network
Repo
Framework

Enabling Transitivity for Lexical Inference on Chinese Verbs Using Probabilistic Soft Logic

Title Enabling Transitivity for Lexical Inference on Chinese Verbs Using Probabilistic Soft Logic
Authors Wei-Chung Wang, Lun-Wei Ku
Abstract To learn more knowledge, enabling transitivity is a vital step for lexical inference. However, most of the lexical inference models with good performance are for nouns or noun phrases, which cannot be directly applied to the inference on events or states. In this paper, we construct the largest Chinese verb lexical inference dataset containing 18,029 verb pairs, where for each pair one of four inference relations are annotated. We further build a probabilistic soft logic (PSL) model to infer verb lexicons using the logic language. With PSL, we easily enable transitivity in two layers, the observed layer and the feature layer, which are included in the knowledge base. We further discuss the effect of transitives within and between these layers. Results show the performance of the proposed PSL model can be improved at least 3.5{%} (relative) when the transitivity is enabled. Furthermore, experiments show that enabling transitivity in the observed layer benefits the most.
Tasks Natural Language Inference, Open-Domain Question Answering, Question Answering, Text Generation
Published 2017-11-01
URL https://www.aclweb.org/anthology/I17-1012/
PDF https://www.aclweb.org/anthology/I17-1012
PWC https://paperswithcode.com/paper/enabling-transitivity-for-lexical-inference
Repo
Framework

Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison

Title Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison
Authors Aless Raganato, ro, Jose Camacho-Collados, Roberto Navigli
Abstract Word Sense Disambiguation is a long-standing task in Natural Language Processing, lying at the core of human language understanding. However, the evaluation of automatic systems has been problematic, mainly due to the lack of a reliable evaluation framework. In this paper we develop a unified evaluation framework and analyze the performance of various Word Sense Disambiguation systems in a fair setup. The results show that supervised systems clearly outperform knowledge-based models. Among the supervised systems, a linear classifier trained on conventional local features still proves to be a hard baseline to beat. Nonetheless, recent approaches exploiting neural networks on unlabeled corpora achieve promising results, surpassing this hard baseline in most test sets.
Tasks Word Sense Disambiguation
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-1010/
PDF https://www.aclweb.org/anthology/E17-1010
PWC https://paperswithcode.com/paper/word-sense-disambiguation-a-unified
Repo
Framework

Cross-lingual tagger evaluation without test data

Title Cross-lingual tagger evaluation without test data
Authors {\v{Z}}eljko Agi{'c}, Barbara Plank, Anders S{\o}gaard
Abstract We address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data. We put forth and evaluate two dictionary-based metrics. On the tasks of accuracy prediction and system ranking, we reveal that these metrics are reliable enough to approximate test set-based evaluation, and at the same time lean enough to support assessment for truly low-resource languages.
Tasks
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-2040/
PDF https://www.aclweb.org/anthology/E17-2040
PWC https://paperswithcode.com/paper/cross-lingual-tagger-evaluation-without-test
Repo
Framework

Annotating errors in student texts: First experiences and experiments

Title Annotating errors in student texts: First experiences and experiments
Authors Sara Stymne, Eva Pettersson, Be{'a}ta Megyesi, Anne Palm{'e}r
Abstract
Tasks Language Acquisition, Spelling Correction
Published 2017-05-01
URL https://www.aclweb.org/anthology/W17-0306/
PDF https://www.aclweb.org/anthology/W17-0306
PWC https://paperswithcode.com/paper/annotating-errors-in-student-texts-first
Repo
Framework

OCR and post-correction of historical Finnish texts

Title OCR and post-correction of historical Finnish texts
Authors Senka Drobac, Pekka Kauppinen, Krister Lind{'e}n
Abstract
Tasks Optical Character Recognition, Spelling Correction
Published 2017-05-01
URL https://www.aclweb.org/anthology/W17-0209/
PDF https://www.aclweb.org/anthology/W17-0209
PWC https://paperswithcode.com/paper/ocr-and-post-correction-of-historical-finnish
Repo
Framework

Using Twitter Language to Predict the Real Estate Market

Title Using Twitter Language to Predict the Real Estate Market
Authors Mohammadzaman Zamani, H. Andrew Schwartz
Abstract We explore whether social media can provide a window into community real estate -foreclosure rates and price changes- beyond that of traditional economic and demographic variables. We find language use in Twitter not only predicts real estate outcomes as well as traditional variables across counties, but that including Twitter language in traditional models leads to a significant improvement (e.g. from Pearson r = :50 to r = :59 for price changes). We overcome the challenge of the relative sparsity and noise in Twitter language variables by showing that training on the residual error of the traditional models leads to more accurate overall assessments. Finally, we discover that it is Twitter language related to business (e.g. {}company{'}, {}marketing{'}) and technology (e.g. {}technology{'}, {}internet{'}), among others, that yield predictive power over economics.
Tasks
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-2005/
PDF https://www.aclweb.org/anthology/E17-2005
PWC https://paperswithcode.com/paper/using-twitter-language-to-predict-the-real
Repo
Framework

The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts

Title The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts
Authors Rachele Sprugnoli, Tommaso Caselli, Sara Tonelli, Giovanni Moretti
Abstract This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.
Tasks Sentiment Analysis
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-2042/
PDF https://www.aclweb.org/anthology/E17-2042
PWC https://paperswithcode.com/paper/the-content-types-dataset-a-new-resource-to
Repo
Framework

Using Gaze to Predict Text Readability

Title Using Gaze to Predict Text Readability
Authors Ana Valeria Gonz{'a}lez-Gardu{~n}o, Anders S{\o}gaard
Abstract We show that text readability prediction improves significantly from hard parameter sharing with models predicting first pass duration, total fixation duration and regression duration. Specifically, we induce multi-task Multilayer Perceptrons and Logistic Regression models over sentence representations that capture various aggregate statistics, from two different text readability corpora for English, as well as the Dundee eye-tracking corpus. Our approach leads to significant improvements over Single task learning and over previous systems. In addition, our improvements are consistent across train sample sizes, making our approach especially applicable to small datasets.
Tasks Eye Tracking, Machine Translation, Multi-Task Learning, Text Generation, Text Simplification
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-5050/
PDF https://www.aclweb.org/anthology/W17-5050
PWC https://paperswithcode.com/paper/using-gaze-to-predict-text-readability
Repo
Framework

Projection of Argumentative Corpora from Source to Target Languages

Title Projection of Argumentative Corpora from Source to Target Languages
Authors Ahmet Aker, Huangpan Zhang
Abstract Argumentative corpora are costly to create and are available in only few languages with English dominating the area. In this paper we release the first publicly available Mandarin argumentative corpus. The corpus is created by exploiting the idea of comparable corpora from Statistical Machine Translation. We use existing corpora in English and manually map the claims and premises to comparable corpora in Mandarin. We also implement a simple solution to automate this approach with the view of creating argumentative corpora in other less-resourced languages. In this way we introduce a new task of multi-lingual argument mapping that can be evaluated using our English-Mandarin argumentative corpus. The preliminary results of our automatic argument mapper mirror the simplicity of our approach, but provide a baseline for further improvements.
Tasks Argument Mining, Machine Translation
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-5108/
PDF https://www.aclweb.org/anthology/W17-5108
PWC https://paperswithcode.com/paper/projection-of-argumentative-corpora-from
Repo
Framework

TWINE: A real-time system for TWeet analysis via INformation Extraction

Title TWINE: A real-time system for TWeet analysis via INformation Extraction
Authors Debora Nozza, Fausto Ristagno, Matteo Palmonari, Elisabetta Fersini, Manch, Pikakshi a, Enza Messina
Abstract In the recent years, the amount of user generated contents shared on the Web has significantly increased, especially in social media environment, e.g. Twitter, Facebook, Google+. This large quantity of data has generated the need of reactive and sophisticated systems for capturing and understanding the underlying information enclosed in them. In this paper we present TWINE, a real-time system for the big data analysis and exploration of information extracted from Twitter streams. The proposed system based on a Named Entity Recognition and Linking pipeline and a multi-dimensional spatial geo-localization is managed by a scalable and flexible architecture for an interactive visualization of micropost streams insights. The demo is available at \url{http://twine-mind.cloudapp.net/streaming}.
Tasks Named Entity Recognition
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-3007/
PDF https://www.aclweb.org/anthology/E17-3007
PWC https://paperswithcode.com/paper/twine-a-real-time-system-for-tweet-analysis
Repo
Framework

Don’t Stop Me Now! Using Global Dynamic Oracles to Correct Training Biases of Transition-Based Dependency Parsers

Title Don’t Stop Me Now! Using Global Dynamic Oracles to Correct Training Biases of Transition-Based Dependency Parsers
Authors Lauriane Aufrant, Guillaume Wisniewski, Fran{\c{c}}ois Yvon
Abstract This paper formalizes a sound extension of dynamic oracles to global training, in the frame of transition-based dependency parsers. By dispensing with the pre-computation of references, this extension widens the training strategies that can be entertained for such parsers; we show this by revisiting two standard training procedures, early-update and max-violation, to correct some of their search space sampling biases. Experimentally, on the SPMRL treebanks, this improvement increases the similarity between the train and test distributions and yields performance improvements up to 0.7 UAS, without any computation overhead.
Tasks Active Learning, Dependency Parsing
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-2051/
PDF https://www.aclweb.org/anthology/E17-2051
PWC https://paperswithcode.com/paper/dont-stop-me-now-using-global-dynamic-oracles
Repo
Framework

Can We Create a Tool for General Domain Event Analysis?

Title Can We Create a Tool for General Domain Event Analysis?
Authors Siim Orasmaa, Heiki-Jaan Kaalep
Abstract
Tasks Morphological Analysis, Question Answering
Published 2017-05-01
URL https://www.aclweb.org/anthology/W17-0222/
PDF https://www.aclweb.org/anthology/W17-0222
PWC https://paperswithcode.com/paper/can-we-create-a-tool-for-general-domain-event
Repo
Framework

The Lemlat 3.0 Package for Morphological Analysis of Latin

Title The Lemlat 3.0 Package for Morphological Analysis of Latin
Authors Marco Passarotti, Marco Budassi, Eleonora Litta, Paolo Ruffolo
Abstract
Tasks Morphological Analysis
Published 2017-05-01
URL https://www.aclweb.org/anthology/W17-0506/
PDF https://www.aclweb.org/anthology/W17-0506
PWC https://paperswithcode.com/paper/the-lemlat-30-package-for-morphological
Repo
Framework

The arText prototype: An automatic system for writing specialized texts

Title The arText prototype: An automatic system for writing specialized texts
Authors Iria da Cunha, M. Amor Montan{'e}, Luis Hysa
Abstract This article describes an automatic system for writing specialized texts in Spanish. The arText prototype is a free online text editor that includes different types of linguistic information. It is designed for a variety of end users and domains, including specialists and university students working in the fields of medicine and tourism, and laypersons writing to the public administration. ArText provides guidance on how to structure a text, prompts users to include all necessary contents in each section, and detects lexical and discourse problems in the text.
Tasks
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-3015/
PDF https://www.aclweb.org/anthology/E17-3015
PWC https://paperswithcode.com/paper/the-artext-prototype-an-automatic-system-for
Repo
Framework
comments powered by Disqus