May 4, 2019

1684 words 8 mins read

Paper Group NANR 211

Paper Group NANR 211

Modelling Valence and Arousal in Facebook posts. Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic. Integrating Morphological Desegmentation into Phrase-based Decoding. ASOBEK at SemEval-2016 Task 1: Sentence Representation with Character N-gram Embeddings for Semantic Textual Similarity. A Dataset for Open …

Modelling Valence and Arousal in Facebook posts

Title Modelling Valence and Arousal in Facebook posts
Authors Daniel Preo{\c{t}}iuc-Pietro, H. Andrew Schwartz, Gregory Park, Johannes Eichstaedt, Margaret Kern, Lyle Ungar, Elisabeth Shulman
Abstract
Tasks Aspect-Based Sentiment Analysis, Emotion Classification, Sentiment Analysis
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0404/
PDF https://www.aclweb.org/anthology/W16-0404
PWC https://paperswithcode.com/paper/modelling-valence-and-arousal-in-facebook
Repo
Framework

Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic

Title Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic
Authors Abdelati Hawwari, Mohammed Attia, Mahmoud Ghoneim, Mona Diab
Abstract Idafa in traditional Arabic grammar is an umbrella construction that covers several phenomena including what is expressed in English as noun-noun compounds and Saxon and Norman genitives. Additionally, Idafa participates in some other constructions, such as quantifiers, quasi-prepositions, and adjectives. Identifying the various types of the Idafa construction (IC) is of importance to Natural Language processing (NLP) applications. Noun-Noun compounds exhibit special behavior in most languages impacting their semantic interpretation. Hence distinguishing them could have an impact on downstream NLP applications. The most comprehensive syntactic representation of the Arabic language is the LDC Arabic Treebank (ATB). In the ATB, ICs are not explicitly labeled and furthermore, there is no distinction between ICs of noun-noun relations and other traditional ICs. Hence, we devise a detailed syntactic and semantic typification process of the IC phenomenon in Arabic. We target the ATB as a platform for this classification. We render the ATB annotated with explicit IC labels but with the further semantic characterization which is useful for syntactic, semantic and cross language processing. Our typification of IC comprises 3 main syntactic IC types: FIC, GIC, and TIC, and they are further divided into 10 syntactic subclasses. The TIC group is further classified into semantic relations. We devise a method for automatic IC labeling and compare its yield against the CATiB treebank. Our evaluation shows that we achieve the same level of accuracy, but with the additional fine-grained classification into the various syntactic and semantic types.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1567/
PDF https://www.aclweb.org/anthology/L16-1567
PWC https://paperswithcode.com/paper/explicit-fine-grained-syntactic-and-semantic
Repo
Framework

Integrating Morphological Desegmentation into Phrase-based Decoding

Title Integrating Morphological Desegmentation into Phrase-based Decoding
Authors Mohammad Salameh, Colin Cherry, Grzegorz Kondrak
Abstract
Tasks Language Modelling
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1140/
PDF https://www.aclweb.org/anthology/N16-1140
PWC https://paperswithcode.com/paper/integrating-morphological-desegmentation-into
Repo
Framework

ASOBEK at SemEval-2016 Task 1: Sentence Representation with Character N-gram Embeddings for Semantic Textual Similarity

Title ASOBEK at SemEval-2016 Task 1: Sentence Representation with Character N-gram Embeddings for Semantic Textual Similarity
Authors Asli Eyecioglu, Bill Keller
Abstract
Tasks Language Modelling, Lemmatization, Natural Language Inference, Paraphrase Identification, Semantic Textual Similarity, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1114/
PDF https://www.aclweb.org/anthology/S16-1114
PWC https://paperswithcode.com/paper/asobek-at-semeval-2016-task-1-sentence
Repo
Framework

A Dataset for Open Event Extraction in English

Title A Dataset for Open Event Extraction in English
Authors Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on
Abstract This article presents a corpus for development and testing of event schema induction systems in English. Schema induction is the task of learning templates with no supervision from unlabeled texts, and to group together entities corresponding to the same role in a template. Most of the previous work on this subject relies on the MUC-4 corpus. We describe the limits of using this corpus (size, non-representativeness, similarity of roles across templates) and propose a new, partially-annotated corpus in English which remedies some of these shortcomings. We make use of Wikinews to select the data inside the category Laws {&} Justice, and query Google search engine to retrieve different documents on the same events. Only Wikinews documents are manually annotated and can be used for evaluation, while the others can be used for unsupervised learning. We detail the methodology used for building the corpus and evaluate some existing systems on this new data.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1307/
PDF https://www.aclweb.org/anthology/L16-1307
PWC https://paperswithcode.com/paper/a-dataset-for-open-event-extraction-in
Repo
Framework

Exploiting Mutual Benefits between Syntax and Semantic Roles using Neural Network

Title Exploiting Mutual Benefits between Syntax and Semantic Roles using Neural Network
Authors Peng Shi, Zhiyang Teng, Yue Zhang
Abstract
Tasks Dependency Parsing, Multi-Task Learning, Semantic Role Labeling
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1098/
PDF https://www.aclweb.org/anthology/D16-1098
PWC https://paperswithcode.com/paper/exploiting-mutual-benefits-between-syntax-and
Repo
Framework

Capturing Argument Relationship for Chinese Semantic Role Labeling

Title Capturing Argument Relationship for Chinese Semantic Role Labeling
Authors Lei Sha, Sujian Li, Baobao Chang, Zhifang Sui, Tingsong Jiang
Abstract
Tasks Semantic Role Labeling
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1212/
PDF https://www.aclweb.org/anthology/D16-1212
PWC https://paperswithcode.com/paper/capturing-argument-relationship-for-chinese
Repo
Framework

使用字典學習法於強健性語音辨識 (The Use of Dictionary Learning Approach for Robustness Speech Recognition) [In Chinese]

Title 使用字典學習法於強健性語音辨識 (The Use of Dictionary Learning Approach for Robustness Speech Recognition) [In Chinese]
Authors Bi-Cheng Yan, Chin-Hong Shih, Shih-Hung Liu, Berlin Chen
Abstract
Tasks Dictionary Learning, Speech Recognition
Published 2016-12-01
URL https://www.aclweb.org/anthology/O16-3003/
PDF https://www.aclweb.org/anthology/O16-3003
PWC https://paperswithcode.com/paper/a12c-aa-a-c314a14aeae3e34-e-the-use-of
Repo
Framework

Measuring Non-cooperation in Dialogue

Title Measuring Non-cooperation in Dialogue
Authors Brian Pl{"u}ss, Paul Piwek
Abstract This paper introduces a novel method for measuring non-cooperation in dialogue. The key idea is that linguistic non-cooperation can be measured in terms of the extent to which dialogue participants deviate from conventions regarding the proper introduction and discharging of conversational obligations (e.g., the obligation to respond to a question). Previous work on non cooperation has focused mainly on non-linguistic task-related non-cooperation or modelled non-cooperation in terms of special rules describing non-cooperative behaviours. In contrast, we start from rules for normal/correct dialogue behaviour - i.e., a dialogue game - which in principle can be derived from a corpus of cooperative dialogues, and provide a quantitative measure for the degree to which participants comply with these rules. We evaluated the model on a corpus of political interviews, with encouraging results. The model predicts accurately the degree of cooperation for one of the two dialogue game roles (interviewer) and also the relative cooperation for both roles (i.e., which interlocutor in the conversation was most cooperative). Being able to measure cooperation has applications in many areas from the analysis - manual, semi and fully automatic - of natural language interactions to human-like virtual personal assistants, tutoring agents, sophisticated dialogue systems, and role-playing virtual humans.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1181/
PDF https://www.aclweb.org/anthology/C16-1181
PWC https://paperswithcode.com/paper/measuring-non-cooperation-in-dialogue
Repo
Framework

MetaMind Neural Machine Translation System for WMT 2016

Title MetaMind Neural Machine Translation System for WMT 2016
Authors James Bradbury, Richard Socher
Abstract
Tasks Data Augmentation, Machine Translation, Tokenization
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2308/
PDF https://www.aclweb.org/anthology/W16-2308
PWC https://paperswithcode.com/paper/metamind-neural-machine-translation-system
Repo
Framework

VERSE: Event and Relation Extraction in the BioNLP 2016 Shared Task

Title VERSE: Event and Relation Extraction in the BioNLP 2016 Shared Task
Authors Jake Lever, Steven JM Jones
Abstract
Tasks Question Answering, Relation Extraction
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-3005/
PDF https://www.aclweb.org/anthology/W16-3005
PWC https://paperswithcode.com/paper/verse-event-and-relation-extraction-in-the
Repo
Framework

Arabic to English Person Name Transliteration using Twitter

Title Arabic to English Person Name Transliteration using Twitter
Authors Hamdy Mubarak, Ahmed Abdelali
Abstract Social media outlets are providing new opportunities for harvesting valuable resources. We present a novel approach for mining data from Twitter for the purpose of building transliteration resources and systems. Such resources are crucial in translation and retrieval tasks. We demonstrate the benefits of the approach on Arabic to English transliteration. The contribution of this approach includes the size of data that can be collected and exploited within the span of a limited time; the approach is very generic and can be adopted to other languages and the ability of the approach to cope with new transliteration phenomena and trends. A statistical transliteration system built using this data improved a comparable system built from Wikipedia wikilinks data.
Tasks Transliteration
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1054/
PDF https://www.aclweb.org/anthology/L16-1054
PWC https://paperswithcode.com/paper/arabic-to-english-person-name-transliteration
Repo
Framework

Ensemble Classification of Grants using LDA-based Features

Title Ensemble Classification of Grants using LDA-based Features
Authors Yannis Korkontzelos, Beverley Thomas, Makoto Miwa, Sophia Ananiadou
Abstract Classifying research grants into useful categories is a vital task for a funding body to give structure to the portfolio for analysis, informing strategic planning and decision-making. Automating this classification process would save time and effort, providing the accuracy of the classifications is maintained. We employ five classification models to classify a set of BBSRC-funded research grants in 21 research topics based on unigrams, technical terms and Latent Dirichlet Allocation models. To boost precision, we investigate methods for combining their predictions into five aggregate classifiers. Evaluation confirmed that ensemble classification models lead to higher precision.It was observed that there is not a single best-performing aggregate method for all research topics. Instead, the best-performing method for a research topic depends on the number of positive training instances available for this topic. Subject matter experts considered the predictions of aggregate models to correct erroneous or incomplete manual assignments.
Tasks Decision Making
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1205/
PDF https://www.aclweb.org/anthology/L16-1205
PWC https://paperswithcode.com/paper/ensemble-classification-of-grants-using-lda
Repo
Framework

A Corpus-based Approach for Spanish-Chinese Language Learning

Title A Corpus-based Approach for Spanish-Chinese Language Learning
Authors Shuyuan Cao, Iria da Cunha, Mikel Iruskieta
Abstract Due to the huge population that speaks Spanish and Chinese, these languages occupy an important position in the language learning studies. Although there are some automatic translation systems that benefit the learning of both languages, there is enough space to create resources in order to help language learners. As a quick and effective resource that can give large amount language information, corpus-based learning is becoming more and more popular. In this paper we enrich a Spanish-Chinese parallel corpus automatically with part of-speech (POS) information and manually with discourse segmentation (following the Rhetorical Structure Theory (RST) (Mann and Thompson, 1988)). Two search tools allow the Spanish-Chinese language learners to carry out different queries based on tokens and lemmas. The parallel corpus and the research tools are available to the academic community. We propose some examples to illustrate how learners can use the corpus to learn Spanish and Chinese.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4913/
PDF https://www.aclweb.org/anthology/W16-4913
PWC https://paperswithcode.com/paper/a-corpus-based-approach-for-spanish-chinese
Repo
Framework

Unsupervised Document Classification with Informed Topic Models

Title Unsupervised Document Classification with Informed Topic Models
Authors Timothy Miller, Dmitriy Dligach, Guergana Savova
Abstract
Tasks Active Learning, Document Classification, Topic Models
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2911/
PDF https://www.aclweb.org/anthology/W16-2911
PWC https://paperswithcode.com/paper/unsupervised-document-classification-with
Repo
Framework
comments powered by Disqus