Paper Group NANR 137
Unity in Diversity: A Unified Parsing Strategy for Major Indian Languages. Proceedings of the First ACL Workshop on Ethics in Natural Language Processing. A Statistical Machine Translation Model with Forest-to-Tree Algorithm for Semantic Parsing. Ranking Right-Wing Extremist Social Media Profiles by Similarity to Democratic and Extremist Groups. A …
Unity in Diversity: A Unified Parsing Strategy for Major Indian Languages
Title | Unity in Diversity: A Unified Parsing Strategy for Major Indian Languages |
Authors | T, Juhi on, Dipti Misra Sharma |
Abstract | |
Tasks | Constituency Parsing, Feature Engineering, Machine Translation |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/W17-6529/ |
https://www.aclweb.org/anthology/W17-6529 | |
PWC | https://paperswithcode.com/paper/unity-in-diversity-a-unified-parsing-strategy |
Repo | |
Framework | |
Proceedings of the First ACL Workshop on Ethics in Natural Language Processing
Title | Proceedings of the First ACL Workshop on Ethics in Natural Language Processing |
Authors | |
Abstract | |
Tasks | |
Published | 2017-04-01 |
URL | https://www.aclweb.org/anthology/W17-1600/ |
https://www.aclweb.org/anthology/W17-1600 | |
PWC | https://paperswithcode.com/paper/proceedings-of-the-first-acl-workshop-on |
Repo | |
Framework | |
A Statistical Machine Translation Model with Forest-to-Tree Algorithm for Semantic Parsing
Title | A Statistical Machine Translation Model with Forest-to-Tree Algorithm for Semantic Parsing |
Authors | Zhihua Liao, Yan Xie |
Abstract | In this paper, we propose a novel supervised model for parsing natural language sentences into their formal semantic representations. This model treats sentence-to-lambda-logical expression conversion within the framework of the statistical machine translation with forest-to-tree algorithm. To make this work, we transform the lambda-logical expression structure into a form suitable for the mechanics of statistical machine translation and useful for modeling. We show that our model is able to yield new state-of-the-art results on both standard datasets with simple features. |
Tasks | Machine Translation, Semantic Parsing |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/R17-1059/ |
https://doi.org/10.26615/978-954-452-049-6_059 | |
PWC | https://paperswithcode.com/paper/a-statistical-machine-translation-model-with |
Repo | |
Framework | |
Ranking Right-Wing Extremist Social Media Profiles by Similarity to Democratic and Extremist Groups
Title | Ranking Right-Wing Extremist Social Media Profiles by Similarity to Democratic and Extremist Groups |
Authors | Matthias Hartung, Roman Klinger, Franziska Schmidtke, Lars Vogel |
Abstract | Social media are used by an increasing number of political actors. A small subset of these is interested in pursuing extremist motives such as mobilization, recruiting or radicalization activities. In order to counteract these trends, online providers and state institutions reinforce their monitoring efforts, mostly relying on manual workflows. We propose a machine learning approach to support manual attempts towards identifying right-wing extremist content in German Twitter profiles. Based on a fine-grained conceptualization of right-wing extremism, we frame the task as ranking each individual profile on a continuum spanning different degrees of right-wing extremism, based on a nearest neighbour approach. A quantitative evaluation reveals that our ranking model yields robust performance (up to 0.81 F$_1$ score) when being used for predicting discrete class labels. At the same time, the model provides plausible continuous ranking scores for a small sample of borderline cases at the division of right-wing extremism and New Right political movements. |
Tasks | |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/W17-5204/ |
https://www.aclweb.org/anthology/W17-5204 | |
PWC | https://paperswithcode.com/paper/ranking-right-wing-extremist-social-media |
Repo | |
Framework | |
A Implementa\cc~ao de uma Minigram'atica do Portugu^es Brasileiro sob a Perspectiva da LFG (Implementation of a grammar fragment of Brazilian Portuguese in the Lexical-Functional Grammar formalism)[In Portuguese]
Title | A Implementa\cc~ao de uma Minigram'atica do Portugu^es Brasileiro sob a Perspectiva da LFG (Implementation of a grammar fragment of Brazilian Portuguese in the Lexical-Functional Grammar formalism)[In Portuguese] |
Authors | Daniel Soares, Francisco Nogueira, Leonel Figueiredo Alencar |
Abstract | |
Tasks | |
Published | 2017-10-01 |
URL | https://www.aclweb.org/anthology/W17-6621/ |
https://www.aclweb.org/anthology/W17-6621 | |
PWC | https://paperswithcode.com/paper/a-implementaaao-de-uma-minigramatica-do |
Repo | |
Framework | |
Knowledge Distillation for Bilingual Dictionary Induction
Title | Knowledge Distillation for Bilingual Dictionary Induction |
Authors | Ndap Nakashole, ula, Raphael Flauger |
Abstract | Leveraging zero-shot learning to learn mapping functions between vector spaces of different languages is a promising approach to bilingual dictionary induction. However, methods using this approach have not yet achieved high accuracy on the task. In this paper, we propose a bridging approach, where our main contribution is a knowledge distillation training objective. As teachers, rich resource translation paths are exploited in this role. And as learners, translation paths involving low resource languages learn from the teachers. Our training objective allows seamless addition of teacher translation paths for any given low resource pair. Since our approach relies on the quality of monolingual word embeddings, we also propose to enhance vector representations of both the source and target language with linguistic information. Our experiments on various languages show large performance gains from our distillation training objective, obtaining as high as 17{%} accuracy improvements. |
Tasks | Word Embeddings, Zero-Shot Learning |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/D17-1264/ |
https://www.aclweb.org/anthology/D17-1264 | |
PWC | https://paperswithcode.com/paper/knowledge-distillation-for-bilingual |
Repo | |
Framework | |
Discourse Mode Identification in Essays
Title | Discourse Mode Identification in Essays |
Authors | Wei Song, Dong Wang, Ruiji Fu, Lizhen Liu, Ting Liu, Guoping Hu |
Abstract | Discourse modes play an important role in writing composition and evaluation. This paper presents a study on the manual and automatic identification of narration,exposition, description, argument and emotion expressing sentences in narrative essays. We annotate a corpus to study the characteristics of discourse modes and describe a neural sequence labeling model for identification. Evaluation results show that discourse modes can be identified automatically with an average F1-score of 0.7. We further demonstrate that discourse modes can be used as features that improve automatic essay scoring (AES). The impacts of discourse modes for AES are also discussed. |
Tasks | |
Published | 2017-07-01 |
URL | https://www.aclweb.org/anthology/P17-1011/ |
https://www.aclweb.org/anthology/P17-1011 | |
PWC | https://paperswithcode.com/paper/discourse-mode-identification-in-essays |
Repo | |
Framework | |
Word Etymology as Native Language Interference
Title | Word Etymology as Native Language Interference |
Authors | Vivi Nastase, Carlo Strapparava |
Abstract | We present experiments that show the influence of native language on lexical choice when producing text in another language {–} in this particular case English. We start from the premise that non-native English speakers will choose lexical items that are close to words in their native language. This leads us to an etymology-based representation of documents written by people whose mother tongue is an Indo-European language. Based on this representation we grow a language family tree, that matches closely the Indo-European language tree. |
Tasks | |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/D17-1286/ |
https://www.aclweb.org/anthology/D17-1286 | |
PWC | https://paperswithcode.com/paper/word-etymology-as-native-language |
Repo | |
Framework | |
Learning Fine-grained Relations from Chinese User Generated Categories
Title | Learning Fine-grained Relations from Chinese User Generated Categories |
Authors | Chengyu Wang, Yan Fan, Xiaofeng He, Aoying Zhou |
Abstract | User generated categories (UGCs) are short texts that reflect how people describe and organize entities, expressing rich semantic relations implicitly. While most methods on UGC relation extraction are based on pattern matching in English circumstances, learning relations from Chinese UGCs poses different challenges due to the flexibility of expressions. In this paper, we present a weakly supervised learning framework to harvest relations from Chinese UGCs. We identify is-a relations via word embedding based projection and inference, extract non-taxonomic relations and their category patterns by graph mining. We conduct experiments on Chinese Wikipedia and achieve high accuracy, outperforming state-of-the-art methods. |
Tasks | Relation Extraction |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/D17-1273/ |
https://www.aclweb.org/anthology/D17-1273 | |
PWC | https://paperswithcode.com/paper/learning-fine-grained-relations-from-chinese |
Repo | |
Framework | |
Quantifying the Effects of Text Duplication on Semantic Models
Title | Quantifying the Effects of Text Duplication on Semantic Models |
Authors | Alex Schofield, ra, Laure Thompson, David Mimno |
Abstract | Duplicate documents are a pervasive problem in text datasets and can have a strong effect on unsupervised models. Methods to remove duplicate texts are typically heuristic or very expensive, so it is vital to know when and why they are needed. We measure the sensitivity of two latent semantic methods to the presence of different levels of document repetition. By artificially creating different forms of duplicate text we confirm several hypotheses about how repeated text impacts models. While a small amount of duplication is tolerable, substantial over-representation of subsets of the text may overwhelm meaningful topical patterns. |
Tasks | |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/D17-1290/ |
https://www.aclweb.org/anthology/D17-1290 | |
PWC | https://paperswithcode.com/paper/quantifying-the-effects-of-text-duplication |
Repo | |
Framework | |
Constru\cc~oes de Estrutura Argumental no ^ambito do Constructicon da FrameNet Brasil: proposta de uma modelagem lingu'\istico-computacional (Structural Constructs of Arguments in the Context of the Construction of FrameNet Brasil: a proposal for a computational-linguistic modeling)[In Portuguese]
Title | Constru\cc~oes de Estrutura Argumental no ^ambito do Constructicon da FrameNet Brasil: proposta de uma modelagem lingu'\istico-computacional (Structural Constructs of Arguments in the Context of the Construction of FrameNet Brasil: a proposal for a computational-linguistic modeling)[In Portuguese] |
Authors | V{^a}nia Gomes Almeida, Tiago Timponi Torrent |
Abstract | |
Tasks | |
Published | 2017-10-01 |
URL | https://www.aclweb.org/anthology/W17-6625/ |
https://www.aclweb.org/anthology/W17-6625 | |
PWC | https://paperswithcode.com/paper/construaaes-de-estrutura-argumental-no-ambito |
Repo | |
Framework | |
Demographic Inference on Twitter using Recursive Neural Networks
Title | Demographic Inference on Twitter using Recursive Neural Networks |
Authors | Sunghwan Mac Kim, Qiongkai Xu, Lizhen Qu, Stephen Wan, C{'e}cile Paris |
Abstract | In social media, demographic inference is a critical task in order to gain a better understanding of a cohort and to facilitate interacting with one{'}s audience. Most previous work has made independence assumptions over topological, textual and label information on social networks. In this work, we employ recursive neural networks to break down these independence assumptions to obtain inference about demographic characteristics on Twitter. We show that our model performs better than existing models including the state-of-the-art. |
Tasks | Network Embedding |
Published | 2017-07-01 |
URL | https://www.aclweb.org/anthology/P17-2075/ |
https://www.aclweb.org/anthology/P17-2075 | |
PWC | https://paperswithcode.com/paper/demographic-inference-on-twitter-using |
Repo | |
Framework | |
Investiga\cc~ao Preliminar Sobre a Pros'odia Sem^antica de Verbos de Elocu\cc~ao: o Caso do Verbo Confessar'' (A Preliminar Investigation About the Semantic Prosody of Elocution Verbs: the Case for the Verb
Confess’')[In Portuguese]
Title | Investiga\cc~ao Preliminar Sobre a Pros'odia Sem^antica de Verbos de Elocu\cc~ao: o Caso do Verbo Confessar'' (A Preliminar Investigation About the Semantic Prosody of Elocution Verbs: the Case for the Verb Confess’')[In Portuguese] |
Authors | Barbara C. Ramos |
Abstract | |
Tasks | |
Published | 2017-10-01 |
URL | https://www.aclweb.org/anthology/W17-6626/ |
https://www.aclweb.org/anthology/W17-6626 | |
PWC | https://paperswithcode.com/paper/investigaaao-preliminar-sobre-a-prosa3dia |
Repo | |
Framework | |
What Analogies Reveal about Word Vectors and their Compositionality
Title | What Analogies Reveal about Word Vectors and their Compositionality |
Authors | Gregory Finley, Stephanie Farmer, Serguei Pakhomov |
Abstract | Analogy completion via vector arithmetic has become a common means of demonstrating the compositionality of word embeddings. Previous work have shown that this strategy works more reliably for certain types of analogical word relationships than for others, but these studies have not offered a convincing account for why this is the case. We arrive at such an account through an experiment that targets a wide variety of analogy questions and defines a baseline condition to more accurately measure the efficacy of our system. We find that the most reliably solvable analogy categories involve either 1) the application of a morpheme with clear syntactic effects, 2) male{–}female alternations, or 3) named entities. These broader types do not pattern cleanly along a syntactic{–}semantic divide. We suggest instead that their commonality is distributional, in that the difference between the distributions of two words in any given pair encompasses a relatively small number of word types. Our study offers a needed explanation for why analogy tests succeed and fail where they do and provides nuanced insight into the relationship between word distributions and the theoretical linguistic domains of syntax and semantics. |
Tasks | Semantic Textual Similarity, Word Embeddings |
Published | 2017-08-01 |
URL | https://www.aclweb.org/anthology/S17-1001/ |
https://www.aclweb.org/anthology/S17-1001 | |
PWC | https://paperswithcode.com/paper/what-analogies-reveal-about-word-vectors-and |
Repo | |
Framework | |
Facebook Integration into University Classes: Opportunities and Challenges
Title | Facebook Integration into University Classes: Opportunities and Challenges |
Authors | Romualdo Mabuan, Gregorio Ebron Jr. |
Abstract | |
Tasks | |
Published | 2017-11-01 |
URL | https://www.aclweb.org/anthology/Y17-1036/ |
https://www.aclweb.org/anthology/Y17-1036 | |
PWC | https://paperswithcode.com/paper/facebook-integration-into-university-classes |
Repo | |
Framework | |