May 5, 2019

1112 words 6 mins read

Paper Group NANR 75

Paper Group NANR 75

A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers. Fluency detection on communication networks. Multi-view Anomaly Detection via Robust Probabilistic Latent Variable Models. An Entity-Focused Approach to Generating Company Descriptions. Publishing the Trove Newspaper Corpus. Latin Vallex. A Treebank-based Semantic Val …

A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers

Title A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers
Authors Joseph Mariani, Gil Francopoulo, Patrick Paroubek
Abstract
Tasks Information Retrieval
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1509/
PDF https://www.aclweb.org/anthology/W16-1509
PWC https://paperswithcode.com/paper/a-study-of-reuse-and-plagiarism-in-speech-and
Repo
Framework

Fluency detection on communication networks

Title Fluency detection on communication networks
Authors Tom Lippincott, Benjamin Van Durme
Abstract
Tasks Language Identification, Part-Of-Speech Tagging
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1107/
PDF https://www.aclweb.org/anthology/D16-1107
PWC https://paperswithcode.com/paper/fluency-detection-on-communication-networks
Repo
Framework

Multi-view Anomaly Detection via Robust Probabilistic Latent Variable Models

Title Multi-view Anomaly Detection via Robust Probabilistic Latent Variable Models
Authors Tomoharu Iwata, Makoto Yamada
Abstract We propose probabilistic latent variable models for multi-view anomaly detection, which is the task of finding instances that have inconsistent views given multi-view data. With the proposed model, all views of a non-anomalous instance are assumed to be generated from a single latent vector. On the other hand, an anomalous instance is assumed to have multiple latent vectors, and its different views are generated from different latent vectors. By inferring the number of latent vectors used for each instance with Dirichlet process priors, we obtain multi-view anomaly scores. The proposed model can be seen as a robust extension of probabilistic canonical correlation analysis for noisy multi-view data. We present Bayesian inference procedures for the proposed model based on a stochastic EM algorithm. The effectiveness of the proposed model is demonstrated in terms of performance when detecting multi-view anomalies.
Tasks Anomaly Detection, Bayesian Inference, Latent Variable Models
Published 2016-12-01
URL http://papers.nips.cc/paper/6456-multi-view-anomaly-detection-via-robust-probabilistic-latent-variable-models
PDF http://papers.nips.cc/paper/6456-multi-view-anomaly-detection-via-robust-probabilistic-latent-variable-models.pdf
PWC https://paperswithcode.com/paper/multi-view-anomaly-detection-via-robust
Repo
Framework

An Entity-Focused Approach to Generating Company Descriptions

Title An Entity-Focused Approach to Generating Company Descriptions
Authors Gavin Saldanha, Or Biran, Kathleen McKeown, Alfio Gliozzo
Abstract
Tasks Document Summarization, Multi-Document Summarization
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-2040/
PDF https://www.aclweb.org/anthology/P16-2040
PWC https://paperswithcode.com/paper/an-entity-focused-approach-to-generating
Repo
Framework

Publishing the Trove Newspaper Corpus

Title Publishing the Trove Newspaper Corpus
Authors Steve Cassidy
Abstract The Trove Newspaper Corpus is derived from the National Library of Australia{'}s digital archive of newspaper text. The corpus is a snapshot of the NLA collection taken in 2015 to be made available for language research as part of the Alveo Virtual Laboratory and contains 143 million articles dating from 1806 to 2007. This paper describes the work we have done to make this large corpus available as a research collection, facilitating access to individual documents and enabling large scale processing of the newspaper text in a cloud-based environment.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1715/
PDF https://www.aclweb.org/anthology/L16-1715
PWC https://paperswithcode.com/paper/publishing-the-trove-newspaper-corpus
Repo
Framework

Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin

Title Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin
Authors Marco Passarotti, Berta Gonz{'a}lez Saavedra, Christophe Onambele
Abstract Despite a centuries-long tradition in lexicography, Latin lacks state-of-the-art computational lexical resources. This situation is strictly related to the still quite limited amount of linguistically annotated textual data for Latin, which can help the building of new lexical resources by supporting them with empirical evidence. However, projects for creating new language resources for Latin have been launched over the last decade to fill this gap. In this paper, we present Latin Vallex, a valency lexicon for Latin built in mutual connection with the semantic and pragmatic annotation of two Latin treebanks featuring texts of different eras. On the one hand, such a connection between the empirical evidence provided by the treebanks and the lexicon allows to enhance each frame entry in the lexicon with its frequency in real data. On the other hand, each valency-capable word in the treebanks is linked to a frame entry in the lexicon.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1414/
PDF https://www.aclweb.org/anthology/L16-1414
PWC https://paperswithcode.com/paper/latin-vallex-a-treebank-based-semantic
Repo
Framework

Pictogrammar: an AAC device based on a semantic grammar

Title Pictogrammar: an AAC device based on a semantic grammar
Authors Fern Mart{'\i}nez-Santiago, o, Miguel {'A}ngel Garc{'\i}a-Cumbreras, Arturo Montejo-R{'a}ez, Manuel Carlos D{'\i}az-Galiano
Abstract
Tasks Language Acquisition
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0516/
PDF https://www.aclweb.org/anthology/W16-0516
PWC https://paperswithcode.com/paper/pictogrammar-an-aac-device-based-on-a
Repo
Framework

Combined Tree Kernel-based classifiers for Assessing Quality of Scientific Text

Title Combined Tree Kernel-based classifiers for Assessing Quality of Scientific Text
Authors Liliana Mamani Sanchez, Hector-Hugo Franco-Penya
Abstract
Tasks Grammatical Error Correction
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0525/
PDF https://www.aclweb.org/anthology/W16-0525
PWC https://paperswithcode.com/paper/combined-tree-kernel-based-classifiers-for
Repo
Framework

Exploring the Intersection of Short Answer Assessment, Authorship Attribution, and Plagiarism Detection

Title Exploring the Intersection of Short Answer Assessment, Authorship Attribution, and Plagiarism Detection
Authors Bj{"o}rn Rudzewitz
Abstract
Tasks Reading Comprehension
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0527/
PDF https://www.aclweb.org/anthology/W16-0527
PWC https://paperswithcode.com/paper/exploring-the-intersection-of-short-answer
Repo
Framework

Candidate re-ranking for SMT-based grammatical error correction

Title Candidate re-ranking for SMT-based grammatical error correction
Authors Zheng Yuan, Ted Briscoe, Mariano Felice
Abstract
Tasks Grammatical Error Correction, Machine Translation
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0530/
PDF https://www.aclweb.org/anthology/W16-0530
PWC https://paperswithcode.com/paper/candidate-re-ranking-for-smt-based
Repo
Framework

Experiments on bridging across languages and genres

Title Experiments on bridging across languages and genres
Authors Yulia Grishina
Abstract
Tasks Coreference Resolution
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0702/
PDF https://www.aclweb.org/anthology/W16-0702
PWC https://paperswithcode.com/paper/experiments-on-bridging-across-languages-and
Repo
Framework

Exploring the steps of Verb Phrase Ellipsis

Title Exploring the steps of Verb Phrase Ellipsis
Authors Zhengzhong Liu, Edgar Gonz{`a}lez Pellicer, Daniel Gillick
Abstract
Tasks Boundary Detection, Coreference Resolution
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0705/
PDF https://www.aclweb.org/anthology/W16-0705
PWC https://paperswithcode.com/paper/exploring-the-steps-of-verb-phrase-ellipsis
Repo
Framework

How to Handle Split Antecedents in Tamil?

Title How to Handle Split Antecedents in Tamil?
Authors Vijay Sundar Ram, Sobha Lalitha Devi
Abstract
Tasks Coreference Resolution
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0712/
PDF https://www.aclweb.org/anthology/W16-0712
PWC https://paperswithcode.com/paper/how-to-handle-split-antecedents-in-tamil
Repo
Framework

Identifying Sensible Participants in Online Discussions

Title Identifying Sensible Participants in Online Discussions
Authors Siddharth Jain
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/W16-6207/
PDF https://www.aclweb.org/anthology/W16-6207
PWC https://paperswithcode.com/paper/identifying-sensible-participants-in-online
Repo
Framework

Personality Estimation from Japanese Text

Title Personality Estimation from Japanese Text
Authors Koichi Kamijo, Tetsuya Nasukawa, Hideya Kitamura
Abstract We created a model to estimate personality trait from authors{'} text written in Japanese and measured its performance by conducting surveys and analyzing the Twitter data of 1,630 users. We used the Big Five personality traits for personality trait estimation. Our approach is a combination of category- and Word2Vec-based approaches. For the category-based element, we added several unique Japanese categories along with the ones regularly used in the English model, and for the Word2Vec-based element, we used a model called GloVe. We found that some of the newly added categories have a stronger correlation with personality traits than other categories do and that the combination of the category- and Word2Vec-based approaches improves the accuracy of the personality trait estimation compared with the case of using just one of them.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4311/
PDF https://www.aclweb.org/anthology/W16-4311
PWC https://paperswithcode.com/paper/personality-estimation-from-japanese-text
Repo
Framework
comments powered by Disqus