May 5, 2019

1540 words 8 mins read

Paper Group NANR 133

Paper Group NANR 133

Flexible Non-Terminals for Dependency Tree-to-Tree Reordering. A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances. Semi-supervised Gender Classification with Joint Textual and Social Modeling. An Empirical Exploration of Moral Foundations Theory in Partisan News Sources. Predicting proficiency levels in lear …

Flexible Non-Terminals for Dependency Tree-to-Tree Reordering

Title Flexible Non-Terminals for Dependency Tree-to-Tree Reordering
Authors John Richardson, Fabien Cromi{`e}res, Toshiaki Nakazawa, Sadao Kurohashi
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1002/
PDF https://www.aclweb.org/anthology/N16-1002
PWC https://paperswithcode.com/paper/flexible-non-terminals-for-dependency-tree-to
Repo
Framework

A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances

Title A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances
Authors Roman Sergienko, Muhammad Shan, Wolfgang Minker
Abstract The paper describes a comparative study of existing and novel text preprocessing and classification techniques for domain detection of user utterances. Two corpora are considered. The first one contains customer calls to a call centre for further call routing; the second one contains answers of call centre employees with different kinds of customer orientation behaviour. Seven different unsupervised and supervised term weighting methods were applied. The collective use of term weighting methods is proposed for classification effectiveness improvement. Four different dimensionality reduction methods were applied: stop-words filtering with stemming, feature selection based on term weights, feature transformation based on term clustering, and a novel feature transformation method based on terms belonging to classes. As classification algorithms we used k-NN and a SVM-based algorithm. The numerical experiments have shown that the simultaneous use of the novel proposed approaches (collectives of term weighting methods and the novel feature transformation method) allows reaching the high classification results with very small number of features.
Tasks Dimensionality Reduction, Feature Selection
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1288/
PDF https://www.aclweb.org/anthology/L16-1288
PWC https://paperswithcode.com/paper/a-comparative-study-of-text-preprocessing
Repo
Framework

Semi-supervised Gender Classification with Joint Textual and Social Modeling

Title Semi-supervised Gender Classification with Joint Textual and Social Modeling
Authors Shoushan Li, Bin Dai, Zhengxian Gong, Guodong Zhou
Abstract In gender classification, labeled data is often limited while unlabeled data is ample. This motivates semi-supervised learning for gender classification to improve the performance by exploring the knowledge in both labeled and unlabeled data. In this paper, we propose a semi-supervised approach to gender classification by leveraging textual features and a specific kind of indirect links among the users which we call {}same-interest{''} links. Specifically, we propose a factor graph, namely Textual and Social Factor Graph (TSFG), to model both the textual and the {}same-interest{''} link information. Empirical studies demonstrate the effectiveness of the proposed approach to semi-supervised gender classification.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1197/
PDF https://www.aclweb.org/anthology/C16-1197
PWC https://paperswithcode.com/paper/semi-supervised-gender-classification-with
Repo
Framework

An Empirical Exploration of Moral Foundations Theory in Partisan News Sources

Title An Empirical Exploration of Moral Foundations Theory in Partisan News Sources
Authors Dean Fulgoni, Jordan Carpenter, Lyle Ungar, Daniel Preo{\c{t}}iuc-Pietro
Abstract News sources frame issues in different ways in order to appeal or control the perception of their readers. We present a large scale study of news articles from partisan sources in the US across a variety of different issues. We first highlight that differences between sides exist by predicting the political leaning of articles of unseen political bias. Framing can be driven by different types of morality that each group values. We emphasize differences in framing of different news building on the moral foundations theory quantified using hand crafted lexicons. Our results show that partisan sources frame political issues differently both in terms of words usage and through the moral foundations they relate to.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1591/
PDF https://www.aclweb.org/anthology/L16-1591
PWC https://paperswithcode.com/paper/an-empirical-exploration-of-moral-foundations
Repo
Framework

Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooks

Title Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooks
Authors Ildik{'o} Pil{'a}n, Elena Volodina, Torsten Zesch
Abstract The lack of a sufficient amount of data tailored for a task is a well-recognized problem for many statistical NLP methods. In this paper, we explore whether data sparsity can be successfully tackled when classifying language proficiency levels in the domain of learner-written output texts. We aim at overcoming data sparsity by incorporating knowledge in the trained model from another domain consisting of input texts written by teaching professionals for learners. We compare different domain adaptation techniques and find that a weighted combination of the two types of data performs best, which can even rival systems based on considerably larger amounts of in-domain data. Moreover, we show that normalizing errors in learners{'} texts can substantially improve classification when level-annotated in-domain data is not available.
Tasks Domain Adaptation, Language Acquisition, Transfer Learning
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1198/
PDF https://www.aclweb.org/anthology/C16-1198
PWC https://paperswithcode.com/paper/predicting-proficiency-levels-in-learner
Repo
Framework

An Analysis of Prerequisite Skills for Reading Comprehension

Title An Analysis of Prerequisite Skills for Reading Comprehension
Authors Saku Sugawara, Akiko Aizawa
Abstract
Tasks Reading Comprehension
Published 2016-11-01
URL https://www.aclweb.org/anthology/W16-6001/
PDF https://www.aclweb.org/anthology/W16-6001
PWC https://paperswithcode.com/paper/an-analysis-of-prerequisite-skills-for
Repo
Framework

YSDA Participation in the WMT’16 Quality Estimation Shared Task

Title YSDA Participation in the WMT’16 Quality Estimation Shared Task
Authors Anna Kozlova, Mariya Shmatova, Anton Frolov
Abstract
Tasks Machine Translation
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2385/
PDF https://www.aclweb.org/anthology/W16-2385
PWC https://paperswithcode.com/paper/ysda-participation-in-the-wmt16-quality
Repo
Framework

Deep neural networks for learning graph representations

Title Deep neural networks for learning graph representations
Authors Shaosheng Cao, Wei Lu, Qiongkai Xu
Abstract In this paper, we propose a novel model for learning graph representations, which generates a low-dimensional vector representation for each vertex by capturing the graph structural information. Different from other previous research efforts, we adopt a random surfing model to capture graph structural information directly, instead of using the sampling-based method for generating linear sequences proposed by Perozzi et al. (2014). The advantages of our approach will be illustrated from both theorical and empirical perspectives. We also give a new perspective for the matrix factorization method proposed by Levy and Goldberg (2014), in which the pointwise mutual information (PMI) matrix is considered as an analytical solution to the objective function of the skip-gram model with negative sampling proposed by Mikolov et al. (2013). Unlike their approach which involves the use of the SVD for finding the low-dimensitonal projections from the PMI matrix, however, the stacked denoising autoencoder is introduced in our model to extract complex features and model non-linearities. To demonstrate the effectiveness of our model, we conduct experiments on clustering and visualization tasks, employing the learned vertex representations as features. Empirical results on datasets of varying sizes show that our model outperforms other stat-of-the-art models in such tasks.
Tasks Denoising, Graph Clustering
Published 2016-02-21
URL https://www.aaai.org/ocs/index.php/AAAI/AAAI16/paper/view/12423
PDF https://pdfs.semanticscholar.org/1a37/f07606d60df365d74752857e8ce909f700b3.pdf
PWC https://paperswithcode.com/paper/deep-neural-networks-for-learning-graph
Repo
Framework

Synchronous Context-Free Grammars and Optimal Parsing Strategies

Title Synchronous Context-Free Grammars and Optimal Parsing Strategies
Authors Daniel Gildea, Giorgio Satta
Abstract
Tasks Machine Translation
Published 2016-06-01
URL https://www.aclweb.org/anthology/J16-2002/
PDF https://www.aclweb.org/anthology/J16-2002
PWC https://paperswithcode.com/paper/synchronous-context-free-grammars-and-optimal
Repo
Framework

A Pipeline Japanese Entity Linking System with Embedding Features

Title A Pipeline Japanese Entity Linking System with Embedding Features
Authors Shuangshuang Zhou
Abstract
Tasks Coreference Resolution, Entity Linking, Information Retrieval, Knowledge Base Population, Named Entity Recognition, Question Answering
Published 2016-10-01
URL https://www.aclweb.org/anthology/Y16-2025/
PDF https://www.aclweb.org/anthology/Y16-2025
PWC https://paperswithcode.com/paper/a-pipeline-japanese-entity-linking-system
Repo
Framework

Modeling language evolution with codes that utilize context and phonetic features

Title Modeling language evolution with codes that utilize context and phonetic features
Authors Javad Nouri, Roman Yangarber
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/K16-1014/
PDF https://www.aclweb.org/anthology/K16-1014
PWC https://paperswithcode.com/paper/modeling-language-evolution-with-codes-that
Repo
Framework

Content Selection through Paraphrase Detection: Capturing different Semantic Realisations of the Same Idea

Title Content Selection through Paraphrase Detection: Capturing different Semantic Realisations of the Same Idea
Authors Elena Lloret, Claire Gardent
Abstract
Tasks Paraphrase Identification, Semantic Textual Similarity, Text Generation, Word Embeddings
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3505/
PDF https://www.aclweb.org/anthology/W16-3505
PWC https://paperswithcode.com/paper/content-selection-through-paraphrase
Repo
Framework

Event Detection and Co-reference with Minimal Supervision

Title Event Detection and Co-reference with Minimal Supervision
Authors Haoruo Peng, Yangqiu Song, Dan Roth
Abstract
Tasks Semantic Role Labeling, Semantic Textual Similarity
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1038/
PDF https://www.aclweb.org/anthology/D16-1038
PWC https://paperswithcode.com/paper/event-detection-and-co-reference-with-minimal
Repo
Framework

Why Do They Leave: Modeling Participation in Online Depression Forums

Title Why Do They Leave: Modeling Participation in Online Depression Forums
Authors Farig Sadeque, Ted Pedersen, Thamar Solorio, Prasha Shrestha, Nicolas Rey-Villamizar, Steven Bethard
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/W16-6203/
PDF https://www.aclweb.org/anthology/W16-6203
PWC https://paperswithcode.com/paper/why-do-they-leave-modeling-participation-in
Repo
Framework

Selecting Sentences versus Selecting Tree Constituents for Automatic Question Ranking

Title Selecting Sentences versus Selecting Tree Constituents for Automatic Question Ranking
Authors Alberto Barr{'o}n-Cede{~n}o, Giovanni Da San Martino, Salvatore Romeo, Aless Moschitti, ro
Abstract Community question answering (cQA) websites are focused on users who query questions onto an online forum, expecting for other users to provide them answers or suggestions. Unlike other social media, the length of the posted queries has no limits and queries tend to be multi-sentence elaborations combining context, actual questions, and irrelevant information. We approach the problem of question ranking: given a user{'}s new question, to retrieve those previously-posted questions which could be equivalent, or highly relevant. This could prevent the posting of nearly-duplicate questions and provide the user with instantaneous answers. For the first time in cQA, we address the selection of relevant text {—}both at sentence- and at constituent-level{—} for parse tree-based representations. Our supervised models for text selection boost the performance of a tree kernel-based machine learning model, allowing it to overtake the current state of the art on a recently released cQA evaluation framework.
Tasks Community Question Answering, Machine Translation, Question Answering
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1237/
PDF https://www.aclweb.org/anthology/C16-1237
PWC https://paperswithcode.com/paper/selecting-sentences-versus-selecting-tree
Repo
Framework
comments powered by Disqus