May 4, 2019

1684 words 8 mins read

Paper Group NANR 211

Modelling Valence and Arousal in Facebook posts. Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic. Integrating Morphological Desegmentation into Phrase-based Decoding. ASOBEK at SemEval-2016 Task 1: Sentence Representation with Character N-gram Embeddings for Semantic Textual Similarity. A Dataset for Open …

Modelling Valence and Arousal in Facebook posts


Title	Modelling Valence and Arousal in Facebook posts
Authors	Daniel Preo{\c{t}}iuc-Pietro, H. Andrew Schwartz, Gregory Park, Johannes Eichstaedt, Margaret Kern, Lyle Ungar, Elisabeth Shulman
Abstract
Tasks	Aspect-Based Sentiment Analysis, Emotion Classification, Sentiment Analysis
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0404/
PDF	https://www.aclweb.org/anthology/W16-0404
PWC	https://paperswithcode.com/paper/modelling-valence-and-arousal-in-facebook
Repo
Framework

Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic


Title	Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic
Authors	Abdelati Hawwari, Mohammed Attia, Mahmoud Ghoneim, Mona Diab
Abstract	Idafa in traditional Arabic grammar is an umbrella construction that covers several phenomena including what is expressed in English as noun-noun compounds and Saxon and Norman genitives. Additionally, Idafa participates in some other constructions, such as quantifiers, quasi-prepositions, and adjectives. Identifying the various types of the Idafa construction (IC) is of importance to Natural Language processing (NLP) applications. Noun-Noun compounds exhibit special behavior in most languages impacting their semantic interpretation. Hence distinguishing them could have an impact on downstream NLP applications. The most comprehensive syntactic representation of the Arabic language is the LDC Arabic Treebank (ATB). In the ATB, ICs are not explicitly labeled and furthermore, there is no distinction between ICs of noun-noun relations and other traditional ICs. Hence, we devise a detailed syntactic and semantic typification process of the IC phenomenon in Arabic. We target the ATB as a platform for this classification. We render the ATB annotated with explicit IC labels but with the further semantic characterization which is useful for syntactic, semantic and cross language processing. Our typification of IC comprises 3 main syntactic IC types: FIC, GIC, and TIC, and they are further divided into 10 syntactic subclasses. The TIC group is further classified into semantic relations. We devise a method for automatic IC labeling and compare its yield against the CATiB treebank. Our evaluation shows that we achieve the same level of accuracy, but with the additional fine-grained classification into the various syntactic and semantic types.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1567/
PDF	https://www.aclweb.org/anthology/L16-1567
PWC	https://paperswithcode.com/paper/explicit-fine-grained-syntactic-and-semantic
Repo
Framework

Integrating Morphological Desegmentation into Phrase-based Decoding


Title	Integrating Morphological Desegmentation into Phrase-based Decoding
Authors	Mohammad Salameh, Colin Cherry, Grzegorz Kondrak
Abstract
Tasks	Language Modelling
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-1140/
PDF	https://www.aclweb.org/anthology/N16-1140
PWC	https://paperswithcode.com/paper/integrating-morphological-desegmentation-into
Repo
Framework

ASOBEK at SemEval-2016 Task 1: Sentence Representation with Character N-gram Embeddings for Semantic Textual Similarity


Title	ASOBEK at SemEval-2016 Task 1: Sentence Representation with Character N-gram Embeddings for Semantic Textual Similarity
Authors	Asli Eyecioglu, Bill Keller
Abstract
Tasks	Language Modelling, Lemmatization, Natural Language Inference, Paraphrase Identification, Semantic Textual Similarity, Word Embeddings
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1114/
PDF	https://www.aclweb.org/anthology/S16-1114
PWC	https://paperswithcode.com/paper/asobek-at-semeval-2016-task-1-sentence
Repo
Framework

A Dataset for Open Event Extraction in English


Title	A Dataset for Open Event Extraction in English
Authors	Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on
Abstract	This article presents a corpus for development and testing of event schema induction systems in English. Schema induction is the task of learning templates with no supervision from unlabeled texts, and to group together entities corresponding to the same role in a template. Most of the previous work on this subject relies on the MUC-4 corpus. We describe the limits of using this corpus (size, non-representativeness, similarity of roles across templates) and propose a new, partially-annotated corpus in English which remedies some of these shortcomings. We make use of Wikinews to select the data inside the category Laws {&} Justice, and query Google search engine to retrieve different documents on the same events. Only Wikinews documents are manually annotated and can be used for evaluation, while the others can be used for unsupervised learning. We detail the methodology used for building the corpus and evaluate some existing systems on this new data.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1307/
PDF	https://www.aclweb.org/anthology/L16-1307
PWC	https://paperswithcode.com/paper/a-dataset-for-open-event-extraction-in
Repo
Framework

Exploiting Mutual Benefits between Syntax and Semantic Roles using Neural Network


Title	Exploiting Mutual Benefits between Syntax and Semantic Roles using Neural Network
Authors	Peng Shi, Zhiyang Teng, Yue Zhang
Abstract
Tasks	Dependency Parsing, Multi-Task Learning, Semantic Role Labeling
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1098/
PDF	https://www.aclweb.org/anthology/D16-1098
PWC	https://paperswithcode.com/paper/exploiting-mutual-benefits-between-syntax-and
Repo
Framework

Capturing Argument Relationship for Chinese Semantic Role Labeling


Title	Capturing Argument Relationship for Chinese Semantic Role Labeling
Authors	Lei Sha, Sujian Li, Baobao Chang, Zhifang Sui, Tingsong Jiang
Abstract
Tasks	Semantic Role Labeling
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1212/
PDF	https://www.aclweb.org/anthology/D16-1212
PWC	https://paperswithcode.com/paper/capturing-argument-relationship-for-chinese
Repo
Framework

使用字典學習法於強健性語音辨識 (The Use of Dictionary Learning Approach for Robustness Speech Recognition) [In Chinese]


Title	使用字典學習法於強健性語音辨識 (The Use of Dictionary Learning Approach for Robustness Speech Recognition) [In Chinese]
Authors	Bi-Cheng Yan, Chin-Hong Shih, Shih-Hung Liu, Berlin Chen
Abstract
Tasks	Dictionary Learning, Speech Recognition
Published	2016-12-01
URL	https://www.aclweb.org/anthology/O16-3003/
PDF	https://www.aclweb.org/anthology/O16-3003
PWC	https://paperswithcode.com/paper/a12c-aa-a-c314a14aeae3e34-e-the-use-of
Repo
Framework

Measuring Non-cooperation in Dialogue


Title	Measuring Non-cooperation in Dialogue
Authors	Brian Pl{"u}ss, Paul Piwek
Abstract	This paper introduces a novel method for measuring non-cooperation in dialogue. The key idea is that linguistic non-cooperation can be measured in terms of the extent to which dialogue participants deviate from conventions regarding the proper introduction and discharging of conversational obligations (e.g., the obligation to respond to a question). Previous work on non cooperation has focused mainly on non-linguistic task-related non-cooperation or modelled non-cooperation in terms of special rules describing non-cooperative behaviours. In contrast, we start from rules for normal/correct dialogue behaviour - i.e., a dialogue game - which in principle can be derived from a corpus of cooperative dialogues, and provide a quantitative measure for the degree to which participants comply with these rules. We evaluated the model on a corpus of political interviews, with encouraging results. The model predicts accurately the degree of cooperation for one of the two dialogue game roles (interviewer) and also the relative cooperation for both roles (i.e., which interlocutor in the conversation was most cooperative). Being able to measure cooperation has applications in many areas from the analysis - manual, semi and fully automatic - of natural language interactions to human-like virtual personal assistants, tutoring agents, sophisticated dialogue systems, and role-playing virtual humans.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1181/
PDF	https://www.aclweb.org/anthology/C16-1181
PWC	https://paperswithcode.com/paper/measuring-non-cooperation-in-dialogue
Repo
Framework

MetaMind Neural Machine Translation System for WMT 2016


Title	MetaMind Neural Machine Translation System for WMT 2016
Authors	James Bradbury, Richard Socher
Abstract
Tasks	Data Augmentation, Machine Translation, Tokenization
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2308/
PDF	https://www.aclweb.org/anthology/W16-2308
PWC	https://paperswithcode.com/paper/metamind-neural-machine-translation-system
Repo
Framework

VERSE: Event and Relation Extraction in the BioNLP 2016 Shared Task


Title	VERSE: Event and Relation Extraction in the BioNLP 2016 Shared Task
Authors	Jake Lever, Steven JM Jones
Abstract
Tasks	Question Answering, Relation Extraction
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-3005/
PDF	https://www.aclweb.org/anthology/W16-3005
PWC	https://paperswithcode.com/paper/verse-event-and-relation-extraction-in-the
Repo
Framework

Arabic to English Person Name Transliteration using Twitter


Title	Arabic to English Person Name Transliteration using Twitter
Authors	Hamdy Mubarak, Ahmed Abdelali
Abstract	Social media outlets are providing new opportunities for harvesting valuable resources. We present a novel approach for mining data from Twitter for the purpose of building transliteration resources and systems. Such resources are crucial in translation and retrieval tasks. We demonstrate the benefits of the approach on Arabic to English transliteration. The contribution of this approach includes the size of data that can be collected and exploited within the span of a limited time; the approach is very generic and can be adopted to other languages and the ability of the approach to cope with new transliteration phenomena and trends. A statistical transliteration system built using this data improved a comparable system built from Wikipedia wikilinks data.
Tasks	Transliteration
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1054/
PDF	https://www.aclweb.org/anthology/L16-1054
PWC	https://paperswithcode.com/paper/arabic-to-english-person-name-transliteration
Repo
Framework

Ensemble Classification of Grants using LDA-based Features


Title	Ensemble Classification of Grants using LDA-based Features
Authors	Yannis Korkontzelos, Beverley Thomas, Makoto Miwa, Sophia Ananiadou
Abstract	Classifying research grants into useful categories is a vital task for a funding body to give structure to the portfolio for analysis, informing strategic planning and decision-making. Automating this classification process would save time and effort, providing the accuracy of the classifications is maintained. We employ five classification models to classify a set of BBSRC-funded research grants in 21 research topics based on unigrams, technical terms and Latent Dirichlet Allocation models. To boost precision, we investigate methods for combining their predictions into five aggregate classifiers. Evaluation confirmed that ensemble classification models lead to higher precision.It was observed that there is not a single best-performing aggregate method for all research topics. Instead, the best-performing method for a research topic depends on the number of positive training instances available for this topic. Subject matter experts considered the predictions of aggregate models to correct erroneous or incomplete manual assignments.
Tasks	Decision Making
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1205/
PDF	https://www.aclweb.org/anthology/L16-1205
PWC	https://paperswithcode.com/paper/ensemble-classification-of-grants-using-lda
Repo
Framework

A Corpus-based Approach for Spanish-Chinese Language Learning


Title	A Corpus-based Approach for Spanish-Chinese Language Learning
Authors	Shuyuan Cao, Iria da Cunha, Mikel Iruskieta
Abstract	Due to the huge population that speaks Spanish and Chinese, these languages occupy an important position in the language learning studies. Although there are some automatic translation systems that benefit the learning of both languages, there is enough space to create resources in order to help language learners. As a quick and effective resource that can give large amount language information, corpus-based learning is becoming more and more popular. In this paper we enrich a Spanish-Chinese parallel corpus automatically with part of-speech (POS) information and manually with discourse segmentation (following the Rhetorical Structure Theory (RST) (Mann and Thompson, 1988)). Two search tools allow the Spanish-Chinese language learners to carry out different queries based on tokens and lemmas. The parallel corpus and the research tools are available to the academic community. We propose some examples to illustrate how learners can use the corpus to learn Spanish and Chinese.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4913/
PDF	https://www.aclweb.org/anthology/W16-4913
PWC	https://paperswithcode.com/paper/a-corpus-based-approach-for-spanish-chinese
Repo
Framework

Unsupervised Document Classification with Informed Topic Models


Title	Unsupervised Document Classification with Informed Topic Models
Authors	Timothy Miller, Dmitriy Dligach, Guergana Savova
Abstract
Tasks	Active Learning, Document Classification, Topic Models
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2911/
PDF	https://www.aclweb.org/anthology/W16-2911
PWC	https://paperswithcode.com/paper/unsupervised-document-classification-with
Repo
Framework