May 5, 2019

1540 words 8 mins read

Paper Group NANR 133

Flexible Non-Terminals for Dependency Tree-to-Tree Reordering. A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances. Semi-supervised Gender Classification with Joint Textual and Social Modeling. An Empirical Exploration of Moral Foundations Theory in Partisan News Sources. Predicting proficiency levels in lear …

Flexible Non-Terminals for Dependency Tree-to-Tree Reordering


Title	Flexible Non-Terminals for Dependency Tree-to-Tree Reordering
Authors	John Richardson, Fabien Cromi{`e}res, Toshiaki Nakazawa, Sadao Kurohashi
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-1002/
PDF	https://www.aclweb.org/anthology/N16-1002
PWC	https://paperswithcode.com/paper/flexible-non-terminals-for-dependency-tree-to
Repo
Framework

A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances


Title	A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances
Authors	Roman Sergienko, Muhammad Shan, Wolfgang Minker
Abstract	The paper describes a comparative study of existing and novel text preprocessing and classification techniques for domain detection of user utterances. Two corpora are considered. The first one contains customer calls to a call centre for further call routing; the second one contains answers of call centre employees with different kinds of customer orientation behaviour. Seven different unsupervised and supervised term weighting methods were applied. The collective use of term weighting methods is proposed for classification effectiveness improvement. Four different dimensionality reduction methods were applied: stop-words filtering with stemming, feature selection based on term weights, feature transformation based on term clustering, and a novel feature transformation method based on terms belonging to classes. As classification algorithms we used k-NN and a SVM-based algorithm. The numerical experiments have shown that the simultaneous use of the novel proposed approaches (collectives of term weighting methods and the novel feature transformation method) allows reaching the high classification results with very small number of features.
Tasks	Dimensionality Reduction, Feature Selection
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1288/
PDF	https://www.aclweb.org/anthology/L16-1288
PWC	https://paperswithcode.com/paper/a-comparative-study-of-text-preprocessing
Repo
Framework


Title	Semi-supervised Gender Classification with Joint Textual and Social Modeling
Authors	Shoushan Li, Bin Dai, Zhengxian Gong, Guodong Zhou
Abstract	In gender classification, labeled data is often limited while unlabeled data is ample. This motivates semi-supervised learning for gender classification to improve the performance by exploring the knowledge in both labeled and unlabeled data. In this paper, we propose a semi-supervised approach to gender classification by leveraging textual features and a specific kind of indirect links among the users which we call {`}same-interest{''} links. Specifically, we propose a factor graph, namely Textual and Social Factor Graph (TSFG), to model both the textual and the {`}same-interest{''} link information. Empirical studies demonstrate the effectiveness of the proposed approach to semi-supervised gender classification.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1197/
PDF	https://www.aclweb.org/anthology/C16-1197
PWC	https://paperswithcode.com/paper/semi-supervised-gender-classification-with
Repo
Framework

An Empirical Exploration of Moral Foundations Theory in Partisan News Sources


Title	An Empirical Exploration of Moral Foundations Theory in Partisan News Sources
Authors	Dean Fulgoni, Jordan Carpenter, Lyle Ungar, Daniel Preo{\c{t}}iuc-Pietro
Abstract	News sources frame issues in different ways in order to appeal or control the perception of their readers. We present a large scale study of news articles from partisan sources in the US across a variety of different issues. We first highlight that differences between sides exist by predicting the political leaning of articles of unseen political bias. Framing can be driven by different types of morality that each group values. We emphasize differences in framing of different news building on the moral foundations theory quantified using hand crafted lexicons. Our results show that partisan sources frame political issues differently both in terms of words usage and through the moral foundations they relate to.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1591/
PDF	https://www.aclweb.org/anthology/L16-1591
PWC	https://paperswithcode.com/paper/an-empirical-exploration-of-moral-foundations
Repo
Framework

Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooks


Title	Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooks
Authors	Ildik{'o} Pil{'a}n, Elena Volodina, Torsten Zesch
Abstract	The lack of a sufficient amount of data tailored for a task is a well-recognized problem for many statistical NLP methods. In this paper, we explore whether data sparsity can be successfully tackled when classifying language proficiency levels in the domain of learner-written output texts. We aim at overcoming data sparsity by incorporating knowledge in the trained model from another domain consisting of input texts written by teaching professionals for learners. We compare different domain adaptation techniques and find that a weighted combination of the two types of data performs best, which can even rival systems based on considerably larger amounts of in-domain data. Moreover, we show that normalizing errors in learners{'} texts can substantially improve classification when level-annotated in-domain data is not available.
Tasks	Domain Adaptation, Language Acquisition, Transfer Learning
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1198/
PDF	https://www.aclweb.org/anthology/C16-1198
PWC	https://paperswithcode.com/paper/predicting-proficiency-levels-in-learner
Repo
Framework

An Analysis of Prerequisite Skills for Reading Comprehension


Title	An Analysis of Prerequisite Skills for Reading Comprehension
Authors	Saku Sugawara, Akiko Aizawa
Abstract
Tasks	Reading Comprehension
Published	2016-11-01
URL	https://www.aclweb.org/anthology/W16-6001/
PDF	https://www.aclweb.org/anthology/W16-6001
PWC	https://paperswithcode.com/paper/an-analysis-of-prerequisite-skills-for
Repo
Framework

YSDA Participation in the WMT’16 Quality Estimation Shared Task


Title	YSDA Participation in the WMT’16 Quality Estimation Shared Task
Authors	Anna Kozlova, Mariya Shmatova, Anton Frolov
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2385/
PDF	https://www.aclweb.org/anthology/W16-2385
PWC	https://paperswithcode.com/paper/ysda-participation-in-the-wmt16-quality
Repo
Framework

Deep neural networks for learning graph representations


Title	Deep neural networks for learning graph representations
Authors	Shaosheng Cao, Wei Lu, Qiongkai Xu
Abstract	In this paper, we propose a novel model for learning graph representations, which generates a low-dimensional vector representation for each vertex by capturing the graph structural information. Different from other previous research efforts, we adopt a random surfing model to capture graph structural information directly, instead of using the sampling-based method for generating linear sequences proposed by Perozzi et al. (2014). The advantages of our approach will be illustrated from both theorical and empirical perspectives. We also give a new perspective for the matrix factorization method proposed by Levy and Goldberg (2014), in which the pointwise mutual information (PMI) matrix is considered as an analytical solution to the objective function of the skip-gram model with negative sampling proposed by Mikolov et al. (2013). Unlike their approach which involves the use of the SVD for finding the low-dimensitonal projections from the PMI matrix, however, the stacked denoising autoencoder is introduced in our model to extract complex features and model non-linearities. To demonstrate the effectiveness of our model, we conduct experiments on clustering and visualization tasks, employing the learned vertex representations as features. Empirical results on datasets of varying sizes show that our model outperforms other stat-of-the-art models in such tasks.
Tasks	Denoising, Graph Clustering
Published	2016-02-21
URL	https://www.aaai.org/ocs/index.php/AAAI/AAAI16/paper/view/12423
PDF	https://pdfs.semanticscholar.org/1a37/f07606d60df365d74752857e8ce909f700b3.pdf
PWC	https://paperswithcode.com/paper/deep-neural-networks-for-learning-graph
Repo
Framework

Synchronous Context-Free Grammars and Optimal Parsing Strategies


Title	Synchronous Context-Free Grammars and Optimal Parsing Strategies
Authors	Daniel Gildea, Giorgio Satta
Abstract
Tasks	Machine Translation
Published	2016-06-01
URL	https://www.aclweb.org/anthology/J16-2002/
PDF	https://www.aclweb.org/anthology/J16-2002
PWC	https://paperswithcode.com/paper/synchronous-context-free-grammars-and-optimal
Repo
Framework

A Pipeline Japanese Entity Linking System with Embedding Features


Title	A Pipeline Japanese Entity Linking System with Embedding Features
Authors	Shuangshuang Zhou
Abstract
Tasks	Coreference Resolution, Entity Linking, Information Retrieval, Knowledge Base Population, Named Entity Recognition, Question Answering
Published	2016-10-01
URL	https://www.aclweb.org/anthology/Y16-2025/
PDF	https://www.aclweb.org/anthology/Y16-2025
PWC	https://paperswithcode.com/paper/a-pipeline-japanese-entity-linking-system
Repo
Framework

Modeling language evolution with codes that utilize context and phonetic features


Title	Modeling language evolution with codes that utilize context and phonetic features
Authors	Javad Nouri, Roman Yangarber
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/K16-1014/
PDF	https://www.aclweb.org/anthology/K16-1014
PWC	https://paperswithcode.com/paper/modeling-language-evolution-with-codes-that
Repo
Framework

Content Selection through Paraphrase Detection: Capturing different Semantic Realisations of the Same Idea


Title	Content Selection through Paraphrase Detection: Capturing different Semantic Realisations of the Same Idea
Authors	Elena Lloret, Claire Gardent
Abstract
Tasks	Paraphrase Identification, Semantic Textual Similarity, Text Generation, Word Embeddings
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-3505/
PDF	https://www.aclweb.org/anthology/W16-3505
PWC	https://paperswithcode.com/paper/content-selection-through-paraphrase
Repo
Framework

Event Detection and Co-reference with Minimal Supervision


Title	Event Detection and Co-reference with Minimal Supervision
Authors	Haoruo Peng, Yangqiu Song, Dan Roth
Abstract
Tasks	Semantic Role Labeling, Semantic Textual Similarity
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1038/
PDF	https://www.aclweb.org/anthology/D16-1038
PWC	https://paperswithcode.com/paper/event-detection-and-co-reference-with-minimal
Repo
Framework

Why Do They Leave: Modeling Participation in Online Depression Forums


Title	Why Do They Leave: Modeling Participation in Online Depression Forums
Authors	Farig Sadeque, Ted Pedersen, Thamar Solorio, Prasha Shrestha, Nicolas Rey-Villamizar, Steven Bethard
Abstract
Tasks
Published	2016-11-01
URL	https://www.aclweb.org/anthology/W16-6203/
PDF	https://www.aclweb.org/anthology/W16-6203
PWC	https://paperswithcode.com/paper/why-do-they-leave-modeling-participation-in
Repo
Framework

Selecting Sentences versus Selecting Tree Constituents for Automatic Question Ranking


Title	Selecting Sentences versus Selecting Tree Constituents for Automatic Question Ranking
Authors	Alberto Barr{'o}n-Cede{~n}o, Giovanni Da San Martino, Salvatore Romeo, Aless Moschitti, ro
Abstract	Community question answering (cQA) websites are focused on users who query questions onto an online forum, expecting for other users to provide them answers or suggestions. Unlike other social media, the length of the posted queries has no limits and queries tend to be multi-sentence elaborations combining context, actual questions, and irrelevant information. We approach the problem of question ranking: given a user{'}s new question, to retrieve those previously-posted questions which could be equivalent, or highly relevant. This could prevent the posting of nearly-duplicate questions and provide the user with instantaneous answers. For the first time in cQA, we address the selection of relevant text {—}both at sentence- and at constituent-level{—} for parse tree-based representations. Our supervised models for text selection boost the performance of a tree kernel-based machine learning model, allowing it to overtake the current state of the art on a recently released cQA evaluation framework.
Tasks	Community Question Answering, Machine Translation, Question Answering
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1237/
PDF	https://www.aclweb.org/anthology/C16-1237
PWC	https://paperswithcode.com/paper/selecting-sentences-versus-selecting-tree
Repo
Framework