May 5, 2019

1112 words 6 mins read

Paper Group NANR 75

A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers. Fluency detection on communication networks. Multi-view Anomaly Detection via Robust Probabilistic Latent Variable Models. An Entity-Focused Approach to Generating Company Descriptions. Publishing the Trove Newspaper Corpus. Latin Vallex. A Treebank-based Semantic Val …

A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers


Title	A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers
Authors	Joseph Mariani, Gil Francopoulo, Patrick Paroubek
Abstract
Tasks	Information Retrieval
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-1509/
PDF	https://www.aclweb.org/anthology/W16-1509
PWC	https://paperswithcode.com/paper/a-study-of-reuse-and-plagiarism-in-speech-and
Repo
Framework

Fluency detection on communication networks


Title	Fluency detection on communication networks
Authors	Tom Lippincott, Benjamin Van Durme
Abstract
Tasks	Language Identification, Part-Of-Speech Tagging
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1107/
PDF	https://www.aclweb.org/anthology/D16-1107
PWC	https://paperswithcode.com/paper/fluency-detection-on-communication-networks
Repo
Framework

Multi-view Anomaly Detection via Robust Probabilistic Latent Variable Models


Title	Multi-view Anomaly Detection via Robust Probabilistic Latent Variable Models
Authors	Tomoharu Iwata, Makoto Yamada
Abstract	We propose probabilistic latent variable models for multi-view anomaly detection, which is the task of finding instances that have inconsistent views given multi-view data. With the proposed model, all views of a non-anomalous instance are assumed to be generated from a single latent vector. On the other hand, an anomalous instance is assumed to have multiple latent vectors, and its different views are generated from different latent vectors. By inferring the number of latent vectors used for each instance with Dirichlet process priors, we obtain multi-view anomaly scores. The proposed model can be seen as a robust extension of probabilistic canonical correlation analysis for noisy multi-view data. We present Bayesian inference procedures for the proposed model based on a stochastic EM algorithm. The effectiveness of the proposed model is demonstrated in terms of performance when detecting multi-view anomalies.
Tasks	Anomaly Detection, Bayesian Inference, Latent Variable Models
Published	2016-12-01
URL	http://papers.nips.cc/paper/6456-multi-view-anomaly-detection-via-robust-probabilistic-latent-variable-models
PDF	http://papers.nips.cc/paper/6456-multi-view-anomaly-detection-via-robust-probabilistic-latent-variable-models.pdf
PWC	https://paperswithcode.com/paper/multi-view-anomaly-detection-via-robust
Repo
Framework

An Entity-Focused Approach to Generating Company Descriptions


Title	An Entity-Focused Approach to Generating Company Descriptions
Authors	Gavin Saldanha, Or Biran, Kathleen McKeown, Alfio Gliozzo
Abstract
Tasks	Document Summarization, Multi-Document Summarization
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-2040/
PDF	https://www.aclweb.org/anthology/P16-2040
PWC	https://paperswithcode.com/paper/an-entity-focused-approach-to-generating
Repo
Framework

Publishing the Trove Newspaper Corpus


Title	Publishing the Trove Newspaper Corpus
Authors	Steve Cassidy
Abstract	The Trove Newspaper Corpus is derived from the National Library of Australia{'}s digital archive of newspaper text. The corpus is a snapshot of the NLA collection taken in 2015 to be made available for language research as part of the Alveo Virtual Laboratory and contains 143 million articles dating from 1806 to 2007. This paper describes the work we have done to make this large corpus available as a research collection, facilitating access to individual documents and enabling large scale processing of the newspaper text in a cloud-based environment.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1715/
PDF	https://www.aclweb.org/anthology/L16-1715
PWC	https://paperswithcode.com/paper/publishing-the-trove-newspaper-corpus
Repo
Framework

Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin


Title	Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin
Authors	Marco Passarotti, Berta Gonz{'a}lez Saavedra, Christophe Onambele
Abstract	Despite a centuries-long tradition in lexicography, Latin lacks state-of-the-art computational lexical resources. This situation is strictly related to the still quite limited amount of linguistically annotated textual data for Latin, which can help the building of new lexical resources by supporting them with empirical evidence. However, projects for creating new language resources for Latin have been launched over the last decade to fill this gap. In this paper, we present Latin Vallex, a valency lexicon for Latin built in mutual connection with the semantic and pragmatic annotation of two Latin treebanks featuring texts of different eras. On the one hand, such a connection between the empirical evidence provided by the treebanks and the lexicon allows to enhance each frame entry in the lexicon with its frequency in real data. On the other hand, each valency-capable word in the treebanks is linked to a frame entry in the lexicon.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1414/
PDF	https://www.aclweb.org/anthology/L16-1414
PWC	https://paperswithcode.com/paper/latin-vallex-a-treebank-based-semantic
Repo
Framework

Pictogrammar: an AAC device based on a semantic grammar


Title	Pictogrammar: an AAC device based on a semantic grammar
Authors	Fern Mart{'\i}nez-Santiago, o, Miguel {'A}ngel Garc{'\i}a-Cumbreras, Arturo Montejo-R{'a}ez, Manuel Carlos D{'\i}az-Galiano
Abstract
Tasks	Language Acquisition
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0516/
PDF	https://www.aclweb.org/anthology/W16-0516
PWC	https://paperswithcode.com/paper/pictogrammar-an-aac-device-based-on-a
Repo
Framework

Combined Tree Kernel-based classifiers for Assessing Quality of Scientific Text


Title	Combined Tree Kernel-based classifiers for Assessing Quality of Scientific Text
Authors	Liliana Mamani Sanchez, Hector-Hugo Franco-Penya
Abstract
Tasks	Grammatical Error Correction
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0525/
PDF	https://www.aclweb.org/anthology/W16-0525
PWC	https://paperswithcode.com/paper/combined-tree-kernel-based-classifiers-for
Repo
Framework

Exploring the Intersection of Short Answer Assessment, Authorship Attribution, and Plagiarism Detection


Title	Exploring the Intersection of Short Answer Assessment, Authorship Attribution, and Plagiarism Detection
Authors	Bj{"o}rn Rudzewitz
Abstract
Tasks	Reading Comprehension
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0527/
PDF	https://www.aclweb.org/anthology/W16-0527
PWC	https://paperswithcode.com/paper/exploring-the-intersection-of-short-answer
Repo
Framework

Candidate re-ranking for SMT-based grammatical error correction


Title	Candidate re-ranking for SMT-based grammatical error correction
Authors	Zheng Yuan, Ted Briscoe, Mariano Felice
Abstract
Tasks	Grammatical Error Correction, Machine Translation
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0530/
PDF	https://www.aclweb.org/anthology/W16-0530
PWC	https://paperswithcode.com/paper/candidate-re-ranking-for-smt-based
Repo
Framework

Experiments on bridging across languages and genres


Title	Experiments on bridging across languages and genres
Authors	Yulia Grishina
Abstract
Tasks	Coreference Resolution
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0702/
PDF	https://www.aclweb.org/anthology/W16-0702
PWC	https://paperswithcode.com/paper/experiments-on-bridging-across-languages-and
Repo
Framework

Exploring the steps of Verb Phrase Ellipsis


Title	Exploring the steps of Verb Phrase Ellipsis
Authors	Zhengzhong Liu, Edgar Gonz{`a}lez Pellicer, Daniel Gillick
Abstract
Tasks	Boundary Detection, Coreference Resolution
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0705/
PDF	https://www.aclweb.org/anthology/W16-0705
PWC	https://paperswithcode.com/paper/exploring-the-steps-of-verb-phrase-ellipsis
Repo
Framework

How to Handle Split Antecedents in Tamil?


Title	How to Handle Split Antecedents in Tamil?
Authors	Vijay Sundar Ram, Sobha Lalitha Devi
Abstract
Tasks	Coreference Resolution
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0712/
PDF	https://www.aclweb.org/anthology/W16-0712
PWC	https://paperswithcode.com/paper/how-to-handle-split-antecedents-in-tamil
Repo
Framework

Identifying Sensible Participants in Online Discussions


Title	Identifying Sensible Participants in Online Discussions
Authors	Siddharth Jain
Abstract
Tasks
Published	2016-11-01
URL	https://www.aclweb.org/anthology/W16-6207/
PDF	https://www.aclweb.org/anthology/W16-6207
PWC	https://paperswithcode.com/paper/identifying-sensible-participants-in-online
Repo
Framework

Personality Estimation from Japanese Text


Title	Personality Estimation from Japanese Text
Authors	Koichi Kamijo, Tetsuya Nasukawa, Hideya Kitamura
Abstract	We created a model to estimate personality trait from authors{'} text written in Japanese and measured its performance by conducting surveys and analyzing the Twitter data of 1,630 users. We used the Big Five personality traits for personality trait estimation. Our approach is a combination of category- and Word2Vec-based approaches. For the category-based element, we added several unique Japanese categories along with the ones regularly used in the English model, and for the Word2Vec-based element, we used a model called GloVe. We found that some of the newly added categories have a stronger correlation with personality traits than other categories do and that the combination of the category- and Word2Vec-based approaches improves the accuracy of the personality trait estimation compared with the case of using just one of them.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4311/
PDF	https://www.aclweb.org/anthology/W16-4311
PWC	https://paperswithcode.com/paper/personality-estimation-from-japanese-text
Repo
Framework