Paper Group NANR 75
A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers. Fluency detection on communication networks. Multi-view Anomaly Detection via Robust Probabilistic Latent Variable Models. An Entity-Focused Approach to Generating Company Descriptions. Publishing the Trove Newspaper Corpus. Latin Vallex. A Treebank-based Semantic Val …
A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers
Title | A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers |
Authors | Joseph Mariani, Gil Francopoulo, Patrick Paroubek |
Abstract | |
Tasks | Information Retrieval |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-1509/ |
https://www.aclweb.org/anthology/W16-1509 | |
PWC | https://paperswithcode.com/paper/a-study-of-reuse-and-plagiarism-in-speech-and |
Repo | |
Framework | |
Fluency detection on communication networks
Title | Fluency detection on communication networks |
Authors | Tom Lippincott, Benjamin Van Durme |
Abstract | |
Tasks | Language Identification, Part-Of-Speech Tagging |
Published | 2016-11-01 |
URL | https://www.aclweb.org/anthology/D16-1107/ |
https://www.aclweb.org/anthology/D16-1107 | |
PWC | https://paperswithcode.com/paper/fluency-detection-on-communication-networks |
Repo | |
Framework | |
Multi-view Anomaly Detection via Robust Probabilistic Latent Variable Models
Title | Multi-view Anomaly Detection via Robust Probabilistic Latent Variable Models |
Authors | Tomoharu Iwata, Makoto Yamada |
Abstract | We propose probabilistic latent variable models for multi-view anomaly detection, which is the task of finding instances that have inconsistent views given multi-view data. With the proposed model, all views of a non-anomalous instance are assumed to be generated from a single latent vector. On the other hand, an anomalous instance is assumed to have multiple latent vectors, and its different views are generated from different latent vectors. By inferring the number of latent vectors used for each instance with Dirichlet process priors, we obtain multi-view anomaly scores. The proposed model can be seen as a robust extension of probabilistic canonical correlation analysis for noisy multi-view data. We present Bayesian inference procedures for the proposed model based on a stochastic EM algorithm. The effectiveness of the proposed model is demonstrated in terms of performance when detecting multi-view anomalies. |
Tasks | Anomaly Detection, Bayesian Inference, Latent Variable Models |
Published | 2016-12-01 |
URL | http://papers.nips.cc/paper/6456-multi-view-anomaly-detection-via-robust-probabilistic-latent-variable-models |
http://papers.nips.cc/paper/6456-multi-view-anomaly-detection-via-robust-probabilistic-latent-variable-models.pdf | |
PWC | https://paperswithcode.com/paper/multi-view-anomaly-detection-via-robust |
Repo | |
Framework | |
An Entity-Focused Approach to Generating Company Descriptions
Title | An Entity-Focused Approach to Generating Company Descriptions |
Authors | Gavin Saldanha, Or Biran, Kathleen McKeown, Alfio Gliozzo |
Abstract | |
Tasks | Document Summarization, Multi-Document Summarization |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/P16-2040/ |
https://www.aclweb.org/anthology/P16-2040 | |
PWC | https://paperswithcode.com/paper/an-entity-focused-approach-to-generating |
Repo | |
Framework | |
Publishing the Trove Newspaper Corpus
Title | Publishing the Trove Newspaper Corpus |
Authors | Steve Cassidy |
Abstract | The Trove Newspaper Corpus is derived from the National Library of Australia{'}s digital archive of newspaper text. The corpus is a snapshot of the NLA collection taken in 2015 to be made available for language research as part of the Alveo Virtual Laboratory and contains 143 million articles dating from 1806 to 2007. This paper describes the work we have done to make this large corpus available as a research collection, facilitating access to individual documents and enabling large scale processing of the newspaper text in a cloud-based environment. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1715/ |
https://www.aclweb.org/anthology/L16-1715 | |
PWC | https://paperswithcode.com/paper/publishing-the-trove-newspaper-corpus |
Repo | |
Framework | |
Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin
Title | Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin |
Authors | Marco Passarotti, Berta Gonz{'a}lez Saavedra, Christophe Onambele |
Abstract | Despite a centuries-long tradition in lexicography, Latin lacks state-of-the-art computational lexical resources. This situation is strictly related to the still quite limited amount of linguistically annotated textual data for Latin, which can help the building of new lexical resources by supporting them with empirical evidence. However, projects for creating new language resources for Latin have been launched over the last decade to fill this gap. In this paper, we present Latin Vallex, a valency lexicon for Latin built in mutual connection with the semantic and pragmatic annotation of two Latin treebanks featuring texts of different eras. On the one hand, such a connection between the empirical evidence provided by the treebanks and the lexicon allows to enhance each frame entry in the lexicon with its frequency in real data. On the other hand, each valency-capable word in the treebanks is linked to a frame entry in the lexicon. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1414/ |
https://www.aclweb.org/anthology/L16-1414 | |
PWC | https://paperswithcode.com/paper/latin-vallex-a-treebank-based-semantic |
Repo | |
Framework | |
Pictogrammar: an AAC device based on a semantic grammar
Title | Pictogrammar: an AAC device based on a semantic grammar |
Authors | Fern Mart{'\i}nez-Santiago, o, Miguel {'A}ngel Garc{'\i}a-Cumbreras, Arturo Montejo-R{'a}ez, Manuel Carlos D{'\i}az-Galiano |
Abstract | |
Tasks | Language Acquisition |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0516/ |
https://www.aclweb.org/anthology/W16-0516 | |
PWC | https://paperswithcode.com/paper/pictogrammar-an-aac-device-based-on-a |
Repo | |
Framework | |
Combined Tree Kernel-based classifiers for Assessing Quality of Scientific Text
Title | Combined Tree Kernel-based classifiers for Assessing Quality of Scientific Text |
Authors | Liliana Mamani Sanchez, Hector-Hugo Franco-Penya |
Abstract | |
Tasks | Grammatical Error Correction |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0525/ |
https://www.aclweb.org/anthology/W16-0525 | |
PWC | https://paperswithcode.com/paper/combined-tree-kernel-based-classifiers-for |
Repo | |
Framework | |
Exploring the Intersection of Short Answer Assessment, Authorship Attribution, and Plagiarism Detection
Title | Exploring the Intersection of Short Answer Assessment, Authorship Attribution, and Plagiarism Detection |
Authors | Bj{"o}rn Rudzewitz |
Abstract | |
Tasks | Reading Comprehension |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0527/ |
https://www.aclweb.org/anthology/W16-0527 | |
PWC | https://paperswithcode.com/paper/exploring-the-intersection-of-short-answer |
Repo | |
Framework | |
Candidate re-ranking for SMT-based grammatical error correction
Title | Candidate re-ranking for SMT-based grammatical error correction |
Authors | Zheng Yuan, Ted Briscoe, Mariano Felice |
Abstract | |
Tasks | Grammatical Error Correction, Machine Translation |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0530/ |
https://www.aclweb.org/anthology/W16-0530 | |
PWC | https://paperswithcode.com/paper/candidate-re-ranking-for-smt-based |
Repo | |
Framework | |
Experiments on bridging across languages and genres
Title | Experiments on bridging across languages and genres |
Authors | Yulia Grishina |
Abstract | |
Tasks | Coreference Resolution |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0702/ |
https://www.aclweb.org/anthology/W16-0702 | |
PWC | https://paperswithcode.com/paper/experiments-on-bridging-across-languages-and |
Repo | |
Framework | |
Exploring the steps of Verb Phrase Ellipsis
Title | Exploring the steps of Verb Phrase Ellipsis |
Authors | Zhengzhong Liu, Edgar Gonz{`a}lez Pellicer, Daniel Gillick |
Abstract | |
Tasks | Boundary Detection, Coreference Resolution |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0705/ |
https://www.aclweb.org/anthology/W16-0705 | |
PWC | https://paperswithcode.com/paper/exploring-the-steps-of-verb-phrase-ellipsis |
Repo | |
Framework | |
How to Handle Split Antecedents in Tamil?
Title | How to Handle Split Antecedents in Tamil? |
Authors | Vijay Sundar Ram, Sobha Lalitha Devi |
Abstract | |
Tasks | Coreference Resolution |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0712/ |
https://www.aclweb.org/anthology/W16-0712 | |
PWC | https://paperswithcode.com/paper/how-to-handle-split-antecedents-in-tamil |
Repo | |
Framework | |
Identifying Sensible Participants in Online Discussions
Title | Identifying Sensible Participants in Online Discussions |
Authors | Siddharth Jain |
Abstract | |
Tasks | |
Published | 2016-11-01 |
URL | https://www.aclweb.org/anthology/W16-6207/ |
https://www.aclweb.org/anthology/W16-6207 | |
PWC | https://paperswithcode.com/paper/identifying-sensible-participants-in-online |
Repo | |
Framework | |
Personality Estimation from Japanese Text
Title | Personality Estimation from Japanese Text |
Authors | Koichi Kamijo, Tetsuya Nasukawa, Hideya Kitamura |
Abstract | We created a model to estimate personality trait from authors{'} text written in Japanese and measured its performance by conducting surveys and analyzing the Twitter data of 1,630 users. We used the Big Five personality traits for personality trait estimation. Our approach is a combination of category- and Word2Vec-based approaches. For the category-based element, we added several unique Japanese categories along with the ones regularly used in the English model, and for the Word2Vec-based element, we used a model called GloVe. We found that some of the newly added categories have a stronger correlation with personality traits than other categories do and that the combination of the category- and Word2Vec-based approaches improves the accuracy of the personality trait estimation compared with the case of using just one of them. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4311/ |
https://www.aclweb.org/anthology/W16-4311 | |
PWC | https://paperswithcode.com/paper/personality-estimation-from-japanese-text |
Repo | |
Framework | |