Paper Group NANR 119
thecerealkiller at SemEval-2016 Task 4: Deep Learning based System for Classifying Sentiment of Tweets on Two Point Scale. What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems. Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing. Leveraging Entity Linking and Related Language …
thecerealkiller at SemEval-2016 Task 4: Deep Learning based System for Classifying Sentiment of Tweets on Two Point Scale
Title | thecerealkiller at SemEval-2016 Task 4: Deep Learning based System for Classifying Sentiment of Tweets on Two Point Scale |
Authors | Vikrant Yadav |
Abstract | |
Tasks | Sentiment Analysis, Word Embeddings |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/S16-1013/ |
https://www.aclweb.org/anthology/S16-1013 | |
PWC | https://paperswithcode.com/paper/thecerealkiller-at-semeval-2016-task-4-deep |
Repo | |
Framework | |
What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems
Title | What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems |
Authors | Emma Barker, Monica Paramita, Adam Funk, Emina Kurtic, Ahmet Aker, Jonathan Foster, Mark Hepple, Robert Gaizauskas |
Abstract | Automatic summarization of reader comments in on-line news is an extremely challenging task and a capability for which there is a clear need. Work to date has focussed on producing extractive summaries using well-known techniques imported from other areas of language processing. But are extractive summaries of comments what users really want? Do they support users in performing the sorts of tasks they are likely to want to perform with reader comments? In this paper we address these questions by doing three things. First, we offer a specification of one possible summary type for reader comment, based on an analysis of reader comment in terms of issues and viewpoints. Second, we define a task-based evaluation framework for reader comment summarization that allows summarization systems to be assessed in terms of how well they support users in a time-limited task of identifying issues and characterising opinion on issues in comments. Third, we describe a pilot evaluation in which we used the task-based evaluation framework to evaluate a prototype reader comment clustering and summarization system, demonstrating the viability of the evaluation framework and illustrating the sorts of insight such an evaluation affords. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1494/ |
https://www.aclweb.org/anthology/L16-1494 | |
PWC | https://paperswithcode.com/paper/whats-the-issue-here-task-based-evaluation-of |
Repo | |
Framework | |
Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing
Title | Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing |
Authors | Manuel Burghardt, Daniel Granvogl, Christian Wolff |
Abstract | Data acquisition in dialectology is typically a tedious task, as dialect samples of spoken language have to be collected via questionnaires or interviews. In this article, we suggest to use the {}web as a corpus{''} approach for dialectology. We present a case study that demonstrates how authentic language data for the Bavarian dialect (ISO 639-3:bar) can be collected automatically from the social network Facebook. We also show that Facebook can be used effectively as a crowdsourcing platform, where users are willing to translate dialect words collaboratively in order to create a common lexicon of their Bavarian dialect. Key insights from the case study are summarized as { }lessons learned{''}, together with suggestions for future enhancements of the lexicon creation approach. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1321/ |
https://www.aclweb.org/anthology/L16-1321 | |
PWC | https://paperswithcode.com/paper/creating-a-lexicon-of-bavarian-dialect-by |
Repo | |
Framework | |
Leveraging Entity Linking and Related Language Projection to Improve Name Transliteration
Title | Leveraging Entity Linking and Related Language Projection to Improve Name Transliteration |
Authors | Ying Lin, Xiaoman Pan, Aliya Deri, Heng Ji, Kevin Knight |
Abstract | |
Tasks | Entity Disambiguation, Entity Linking, Language Modelling, Machine Translation, Transliteration |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/papers/W16-2701/w16-2701 |
https://www.aclweb.org/anthology/W16-2701v2 | |
PWC | https://paperswithcode.com/paper/leveraging-entity-linking-and-related |
Repo | |
Framework | |
An LFG Account of Discontinuous Nominal Expressions
Title | An LFG Account of Discontinuous Nominal Expressions |
Authors | Liselotte Snijders |
Abstract | |
Tasks | |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0901/ |
https://www.aclweb.org/anthology/W16-0901 | |
PWC | https://paperswithcode.com/paper/an-lfg-account-of-discontinuous-nominal |
Repo | |
Framework | |
A Discourse-Annotated Corpus of Conjoined VPs
Title | A Discourse-Annotated Corpus of Conjoined VPs |
Authors | Bonnie Webber, Rashmi Prasad, Alan Lee, Aravind Joshi |
Abstract | |
Tasks | Common Sense Reasoning, Machine Translation, Question Answering |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-1704/ |
https://www.aclweb.org/anthology/W16-1704 | |
PWC | https://paperswithcode.com/paper/a-discourse-annotated-corpus-of-conjoined-vps |
Repo | |
Framework | |
Morphological Reinflection via Discriminative String Transduction
Title | Morphological Reinflection via Discriminative String Transduction |
Authors | Garrett Nicolai, Bradley Hauer, Adam St Arnaud, Grzegorz Kondrak |
Abstract | |
Tasks | Lemmatization |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-2005/ |
https://www.aclweb.org/anthology/W16-2005 | |
PWC | https://paperswithcode.com/paper/morphological-reinflection-via-discriminative |
Repo | |
Framework | |
Discovery of Treatments from Text Corpora
Title | Discovery of Treatments from Text Corpora |
Authors | Christian Fong, Justin Grimmer |
Abstract | |
Tasks | Topic Models |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/P16-1151/ |
https://www.aclweb.org/anthology/P16-1151 | |
PWC | https://paperswithcode.com/paper/discovery-of-treatments-from-text-corpora |
Repo | |
Framework | |
基於詞語分布均勻度的核心詞彙選擇 (A Study on Dispersion Measures for Core Vocabulary Compilation) [In Chinese]
Title | 基於詞語分布均勻度的核心詞彙選擇 (A Study on Dispersion Measures for Core Vocabulary Compilation) [In Chinese] |
Authors | Ming-Hong Bai, Jian-Cheng Wu, Ying-Ni Chien, Shu-Ling Huang, Ching-Lung Lin |
Abstract | |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/O16-3001/ |
https://www.aclweb.org/anthology/O16-3001 | |
PWC | https://paperswithcode.com/paper/ao14eeaaa-aaaoc-aea12e-a-study-on-dispersion |
Repo | |
Framework | |
Character Sequence Models for Colorful Words
Title | Character Sequence Models for Colorful Words |
Authors | Kazuya Kawakami, Chris Dyer, Bryan Routledge, Noah A. Smith |
Abstract | |
Tasks | Language Modelling |
Published | 2016-11-01 |
URL | https://www.aclweb.org/anthology/D16-1202/ |
https://www.aclweb.org/anthology/D16-1202 | |
PWC | https://paperswithcode.com/paper/character-sequence-models-for-colorful-words |
Repo | |
Framework | |
Name Translation based on Fine-grained Named Entity Recognition in a Single Language
Title | Name Translation based on Fine-grained Named Entity Recognition in a Single Language |
Authors | Kugatsu Sadamitsu, Itsumi Saito, Taichi Katayama, Hisako Asano, Yoshihiro Matsuo |
Abstract | We propose named entity abstraction methods with fine-grained named entity labels for improving statistical machine translation (SMT). The methods are based on a bilingual named entity recognizer that uses a monolingual named entity recognizer with transliteration. Through experiments, we demonstrate that incorporating fine-grained named entities into statistical machine translation improves the accuracy of SMT with more adequate granularity compared with the standard SMT, which is a non-named entity abstraction method. |
Tasks | Machine Translation, Named Entity Recognition, Transliteration |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1097/ |
https://www.aclweb.org/anthology/L16-1097 | |
PWC | https://paperswithcode.com/paper/name-translation-based-on-fine-grained-named |
Repo | |
Framework | |
Improving Patent Translation using Bilingual Term Extraction and Re-tokenization for Chinese–Japanese
Title | Improving Patent Translation using Bilingual Term Extraction and Re-tokenization for Chinese–Japanese |
Authors | Wei Yang, Yves Lepage |
Abstract | Unlike European languages, many Asian languages like Chinese and Japanese do not have typographic boundaries in written system. Word segmentation (tokenization) that break sentences down into individual words (tokens) is normally treated as the first step for machine translation (MT). For Chinese and Japanese, different rules and segmentation tools lead different segmentation results in different level of granularity between Chinese and Japanese. To improve the translation accuracy, we adjust and balance the granularity of segmentation results around terms for Chinese{–}Japanese patent corpus for training translation model. In this paper, we describe a statistical machine translation (SMT) system which is built on re-tokenized Chinese-Japanese patent training corpus using extracted bilingual multi-word terms. |
Tasks | Chinese Word Segmentation, Machine Translation, Tokenization |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4619/ |
https://www.aclweb.org/anthology/W16-4619 | |
PWC | https://paperswithcode.com/paper/improving-patent-translation-using-bilingual |
Repo | |
Framework | |
PKUSUMSUM : A Java Platform for Multilingual Document Summarization
Title | PKUSUMSUM : A Java Platform for Multilingual Document Summarization |
Authors | Jianmin Zhang, Tianming Wang, Xiaojun Wan |
Abstract | PKUSUMSUM is a Java platform for multilingual document summarization, and it sup-ports multiple languages, integrates 10 automatic summarization methods, and tackles three typical summarization tasks. The summarization platform has been released and users can easily use and update it. In this paper, we make a brief description of the char-acteristics, the summarization methods, and the evaluation results of the platform, and al-so compare PKUSUMSUM with other summarization toolkits. |
Tasks | Chinese Word Segmentation, Document Summarization, Information Retrieval |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-2060/ |
https://www.aclweb.org/anthology/C16-2060 | |
PWC | https://paperswithcode.com/paper/pkusumsum-a-java-platform-for-multilingual |
Repo | |
Framework | |
A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation
Title | A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation |
Authors | Robert Herms, Laura Seelig, Stefanie M{"u}nch, Maximilian Eibl |
Abstract | In this Paper we present a corpus named SXUCorpus which contains read and spontaneous speech of the Upper Saxon German dialect. The data has been collected from eight archives of local television stations located in the Free State of Saxony. The recordings include broadcasted topics of news, economy, weather, sport, and documentation from the years 1992 to 1996 and have been manually transcribed and labeled. In the paper, we report the methodology of collecting and processing analog audiovisual material, constructing the corpus and describe the properties of the data. In its current version, the corpus is available to the scientific community and is designed for automatic speech recognition (ASR) evaluation with a development set and a test set. We performed ASR experiments with the open-source framework sphinx-4 including a configuration for Standard German on the dataset. Additionally, we show the influence of acoustic model and language model adaptation by the utilization of the development set. |
Tasks | Language Modelling, Speech Recognition |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1736/ |
https://www.aclweb.org/anthology/L16-1736 | |
PWC | https://paperswithcode.com/paper/a-corpus-of-read-and-spontaneous-upper-saxon |
Repo | |
Framework | |
A Fluctuation Smoothing Approach for Unsupervised Automatic Short Answer Grading
Title | A Fluctuation Smoothing Approach for Unsupervised Automatic Short Answer Grading |
Authors | Shourya Roy, D, S apat, ipan, Y. Narahari |
Abstract | We offer a fluctuation smoothing computational approach for unsupervised automatic short answer grading (ASAG) techniques in the educational ecosystem. A major drawback of the existing techniques is the significant effect that variations in model answers could have on their performances. The proposed fluctuation smoothing approach, based on classical sequential pattern mining, exploits lexical overlap in students{'} answers to any typical question. We empirically demonstrate using multiple datasets that the proposed approach improves the overall performance and significantly reduces (up to 63{%}) variation in performance (standard deviation) of unsupervised ASAG techniques. We bring in additional benchmarks such as (a) paraphrasing of model answers and (b) using answers by k top performing students as model answers, to amplify the benefits of the proposed approach. |
Tasks | Sequential Pattern Mining |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4911/ |
https://www.aclweb.org/anthology/W16-4911 | |
PWC | https://paperswithcode.com/paper/a-fluctuation-smoothing-approach-for |
Repo | |
Framework | |