July 26, 2019

2144 words 11 mins read

Paper Group NANR 78

Parsing Graphs with Regular Graph Grammars. Similarity-Aware Spectral Sparsification by Edge Filtering. Towards Universal Dependencies for Learner Chinese. Text Sentiment Analysis based on Fusion of Structural Information and Serialization Information. Multiword expressions and lexicalism: the view from LFG. IITP at SemEval-2017 Task 8 : A Supervis …

Parsing Graphs with Regular Graph Grammars


Title	Parsing Graphs with Regular Graph Grammars
Authors	Sorcha Gilroy, Adam Lopez, Sebastian Maneth
Abstract	Recently, several datasets have become available which represent natural language phenomena as graphs. Hyperedge Replacement Languages (HRL) have been the focus of much attention as a formalism to represent the graphs in these datasets. Chiang et al. (2013) prove that HRL graphs can be parsed in polynomial time with respect to the size of the input graph. We believe that HRL are more expressive than is necessary to represent semantic graphs and we propose the use of Regular Graph Languages (RGL; Courcelle 1991), which is a subfamily of HRL, as a possible alternative. We provide a top-down parsing algorithm for RGL that runs in time linear in the size of the input graph.
Tasks	Machine Translation
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-1024/
PDF	https://www.aclweb.org/anthology/S17-1024
PWC	https://paperswithcode.com/paper/parsing-graphs-with-regular-graph-grammars
Repo
Framework

Similarity-Aware Spectral Sparsification by Edge Filtering


Title	Similarity-Aware Spectral Sparsification by Edge Filtering
Authors	Zhuo Feng
Abstract	In recent years, spectral graph sparsification techniques that can compute ultra-sparse graph proxies have been extensively studied for accelerating various numerical and graph-related applications. Prior nearly-linear-time spectral sparsification methods first extract low-stretch spanning tree from the original graph to form the backbone of the sparsifier, and then recover small portions of spectrally-critical off-tree edges to the spanning tree to significantly improve the approximation quality. However, it is not clear how many off-tree edges should be recovered for achieving a desired spectral similarity level within the sparsifier. Motivated by recent graph signal processing techniques, this paper proposes a similarity-aware spectral graph sparsification framework that leverages efficient spectral off-tree edge embedding and filtering schemes to construct spectral sparsifiers with guaranteed spectral similarity (relative condition number) level. An iterative graph densification scheme is introduced to facilitate efficient and effective filtering of off-tree edges for highly ill-conditioned problems. The proposed method has been validated using various kinds of graphs obtained from public domain sparse matrix collections relevant to VLSI CAD, finite element analysis, as well as social and data networks frequently studied in many machine learning and data mining applications.
Tasks
Published	2017-11-14
URL	https://arxiv.org/abs/1711.05135
PDF	https://arxiv.org/abs/1711.05135
PWC	https://paperswithcode.com/paper/similarity-aware-spectral-sparsification-by
Repo
Framework

Towards Universal Dependencies for Learner Chinese


Title	Towards Universal Dependencies for Learner Chinese
Authors	John Lee, Herman Leung, Keying Li
Abstract
Tasks	Grammatical Error Correction
Published	2017-05-01
URL	https://www.aclweb.org/anthology/W17-0408/
PDF	https://www.aclweb.org/anthology/W17-0408
PWC	https://paperswithcode.com/paper/towards-universal-dependencies-for-learner
Repo
Framework

Text Sentiment Analysis based on Fusion of Structural Information and Serialization Information


Title	Text Sentiment Analysis based on Fusion of Structural Information and Serialization Information
Authors	Ling Gan, Houyu Gong
Abstract	Tree-structured Long Short-Term Memory (Tree-LSTM) has been proved to be an effective method in the sentiment analysis task. It extracts structural information on text, and uses Long Short-Term Memory (LSTM) cell to prevent gradient vanish. However, though combining the LSTM cell, it is still a kind of model that extracts the structural information and almost not extracts serialization information. In this paper, we propose three new models in order to combine those two kinds of information: the structural information generated by the Constituency Tree-LSTM and the serialization information generated by Long-Short Term Memory neural network. Our experiments show that combining those two kinds of information can give contributes to the performance of the sentiment analysis task compared with the single Constituency Tree-LSTM model and the LSTM model.
Tasks	Opinion Mining, Sentiment Analysis
Published	2017-11-01
URL	https://www.aclweb.org/anthology/I17-1034/
PDF	https://www.aclweb.org/anthology/I17-1034
PWC	https://paperswithcode.com/paper/text-sentiment-analysis-based-on-fusion-of
Repo
Framework

Multiword expressions and lexicalism: the view from LFG


Title	Multiword expressions and lexicalism: the view from LFG
Authors	Jamie Y. Findlay
Abstract	Multiword expressions (MWEs) pose a problem for lexicalist theories like Lexical Functional Grammar (LFG), since they are prima facie counterexamples to a strong form of the lexical integrity principle, which entails that a lexical item can only be realised as a single, syntactically atomic word. In this paper, I demonstrate some of the problems facing any strongly lexicalist account of MWEs, and argue that the lexical integrity principle must be weakened. I conclude by sketching a formalism which integrates a Tree Adjoining Grammar into the LFG architecture, taking advantage of this relaxation.
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1709/
PDF	https://www.aclweb.org/anthology/W17-1709
PWC	https://paperswithcode.com/paper/multiword-expressions-and-lexicalism-the-view
Repo
Framework

IITP at SemEval-2017 Task 8 : A Supervised Approach for Rumour Evaluation


Title	IITP at SemEval-2017 Task 8 : A Supervised Approach for Rumour Evaluation
Authors	Vikram Singh, Sunny Narayan, Md Shad Akhtar, Asif Ekbal, Pushpak Bhattacharyya
Abstract	This paper describes our system participation in the SemEval-2017 Task 8 {`}RumourEval: Determining rumour veracity and support for rumours{'}. The objective of this task was to predict the stance and veracity of the underlying rumour. We propose a supervised classification approach employing several lexical, content and twitter specific features for learning. Evaluation shows promising results for both the problems. \|
Tasks	Decision Making, Rumour Detection
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-2087/
PDF	https://www.aclweb.org/anthology/S17-2087
PWC	https://paperswithcode.com/paper/iitp-at-semeval-2017-task-8-a-supervised
Repo
Framework


Title	Deciphering Related Languages
Authors	Nima Pourdamghani, Kevin Knight
Abstract	We present a method for translating texts between close language pairs. The method does not require parallel data, and it does not require the languages to be written in the same script. We show results for six language pairs: Afrikaans/Dutch, Bosnian/Serbian, Danish/Swedish, Macedonian/Bulgarian, Malaysian/Indonesian, and Polish/Belorussian. We report BLEU scores showing our method to outperform others that do not use parallel data.
Tasks	Language Modelling, Machine Translation
Published	2017-09-01
URL	https://www.aclweb.org/anthology/D17-1266/
PDF	https://www.aclweb.org/anthology/D17-1266
PWC	https://paperswithcode.com/paper/deciphering-related-languages
Repo
Framework

Evaluative Language Beyond Bags of Words: Linguistic Insights and Computational Applications


Title	Evaluative Language Beyond Bags of Words: Linguistic Insights and Computational Applications
Authors	Farah Benamara, Maite Taboada, Yannick Mathieu
Abstract	The study of evaluation, affect, and subjectivity is a multidisciplinary enterprise, including sociology, psychology, economics, linguistics, and computer science. A number of excellent computational linguistics and linguistic surveys of the field exist. Most surveys, however, do not bring the two disciplines together to show how methods from linguistics can benefit computational sentiment analysis systems. In this survey, we show how incorporating linguistic insights, discourse information, and other contextual phenomena, in combination with the statistical exploitation of data, can result in an improvement over approaches that take advantage of only one of these perspectives. We first provide a comprehensive introduction to evaluative language from both a linguistic and computational perspective. We then argue that the standard computational definition of the concept of evaluative language neglects the dynamic nature of evaluation, in which the interpretation of a given evaluation depends on linguistic and extra-linguistic contextual factors. We thus propose a dynamic definition that incorporates update functions. The update functions allow for different contextual aspects to be incorporated into the calculation of sentiment for evaluative words or expressions, and can be applied at all levels of discourse. We explore each level and highlight which linguistic aspects contribute to accurate extraction of sentiment. We end the review by outlining what we believe the future directions of sentiment analysis are, and the role that discourse and contextual information need to play.
Tasks	Sentiment Analysis
Published	2017-04-01
URL	https://www.aclweb.org/anthology/J17-1006/
PDF	https://www.aclweb.org/anthology/J17-1006
PWC	https://paperswithcode.com/paper/evaluative-language-beyond-bags-of-words
Repo
Framework

Building and using language resources and infrastructure to develop e-learning programs for a minority language


Title	Building and using language resources and infrastructure to develop e-learning programs for a minority language
Authors	Heli Uibo, Jack Rueter, Sulev Iva
Abstract
Tasks	Language Acquisition, Speech Synthesis
Published	2017-05-01
URL	https://www.aclweb.org/anthology/W17-0307/
PDF	https://www.aclweb.org/anthology/W17-0307
PWC	https://paperswithcode.com/paper/building-and-using-language-resources-and
Repo
Framework


Title	A Network Framework for Noisy Label Aggregation in Social Media
Authors	Xueying Zhan, Yaowei Wang, Yanghui Rao, Haoran Xie, Qing Li, Fu Lee Wang, Tak-Lam Wong
Abstract	This paper focuses on the task of noisy label aggregation in social media, where users with different social or culture backgrounds may annotate invalid or malicious tags for documents. To aggregate noisy labels at a small cost, a network framework is proposed by calculating the matching degree of a document{'}s topics and the annotators{'} meta-data. Unlike using the back-propagation algorithm, a probabilistic inference approach is adopted to estimate network parameters. Finally, a new simulation method is designed for validating the effectiveness of the proposed framework in aggregating noisy labels.
Tasks	Image Classification, Learning-To-Rank, Topic Models
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-2077/
PDF	https://www.aclweb.org/anthology/P17-2077
PWC	https://paperswithcode.com/paper/a-network-framework-for-noisy-label
Repo
Framework

Cross-lingual Name Tagging and Linking for 282 Languages


Title	Cross-lingual Name Tagging and Linking for 282 Languages
Authors	Xiaoman Pan, Boliang Zhang, Jonathan May, Joel Nothman, Kevin Knight, Heng Ji
Abstract	The ambitious goal of this work is to develop a cross-lingual name tagging and linking framework for 282 languages that exist in Wikipedia. Given a document in any of these languages, our framework is able to identify name mentions, assign a coarse-grained or fine-grained type to each mention, and link it to an English Knowledge Base (KB) if it is linkable. We achieve this goal by performing a series of new KB mining methods: generating {``}silver-standard{''} annotations by transferring annotations from English to other languages through cross-lingual links and KB properties, refining annotations through self-training and topic selection, deriving language-specific morphology features from anchor links, and mining word translation pairs from cross-lingual links. Both name tagging and linking results for 282 languages are promising on Wikipedia data and on-Wikipedia data. \|
Tasks
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-1178/
PDF	https://www.aclweb.org/anthology/P17-1178
PWC	https://paperswithcode.com/paper/cross-lingual-name-tagging-and-linking-for
Repo
Framework

Improved Recognition and Normalisation of Polish Temporal Expressions


Title	Improved Recognition and Normalisation of Polish Temporal Expressions
Authors	Jan Koco{'n}, Micha{\l} Marci{'n}czuk
Abstract	In this article we present the result of the recent research in the recognition and normalisation of Polish temporal expressions. The temporal information extracted from the text plays major role in many information extraction systems, like question answering, event recognition or discourse analysis. We proposed a new method for the temporal expressions normalisation, called Cascade of Partial Rules. Here we describe results achieved by updated version of Liner2 machine learning system.
Tasks	Question Answering
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1051/
PDF	https://doi.org/10.26615/978-954-452-049-6_051
PWC	https://paperswithcode.com/paper/improved-recognition-and-normalisation-of
Repo
Framework

Word Sense Disambiguation with Recurrent Neural Networks


Title	Word Sense Disambiguation with Recurrent Neural Networks
Authors	Alex Popov, er
Abstract	This paper presents a neural network architecture for word sense disambiguation (WSD). The architecture employs recurrent neural layers and more specifically LSTM cells, in order to capture information about word order and to easily incorporate distributed word representations (embeddings) as features, without having to use a fixed window of text. The paper demonstrates that the architecture is able to compete with the most successful supervised systems for WSD and that there is an abundance of possible improvements to take it to the current state of the art. In addition, it explores briefly the potential of combining different types of embeddings as input features; it also discusses possible ways for generating {``}artificial corpora{''} from knowledge bases {–} for the purpose of producing training data and in relation to possible applications of embedding lemmas and word senses in the same space. \|
Tasks	Word Sense Disambiguation
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-2004/
PDF	https://doi.org/10.26615/issn.1314-9156.2017_004
PWC	https://paperswithcode.com/paper/word-sense-disambiguation-with-recurrent
Repo
Framework

Natural Language Processing Technologies for Document Profiling


Title	Natural Language Processing Technologies for Document Profiling
Authors	Antonio Guill{'e}n, Yoan Guti{'e}rrez, Rafael Mu{~n}oz
Abstract	Nowadays, search for documents on the Internet is becoming increasingly difficult. The reason is the amount of content published by users (articles, comments, blogs, reviews). How to facilitate that the users can find their required documents? What would be necessary to provide useful document meta-data for supporting search engines? In this article, we present a study of some Natural Language Processing (NLP) technologies that can be useful for facilitating the proper identification of documents according to the user needs. For this purpose, it is designed a document profile that will be able to represent semantic meta-data extracted from documents by using NLP technologies. The research is basically focused on the study of different NLP technologies in order to support the creation our novel document profile proposal from semantic perspectives.
Tasks
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1039/
PDF	https://doi.org/10.26615/978-954-452-049-6_039
PWC	https://paperswithcode.com/paper/natural-language-processing-technologies-for
Repo
Framework

Pyramid-based Summary Evaluation Using Abstract Meaning Representation


Title	Pyramid-based Summary Evaluation Using Abstract Meaning Representation
Authors	Josef Steinberger, Peter Krejzl, Tom{'a}{\v{s}} Brychc{'\i}n
Abstract	We propose a novel metric for evaluating summary content coverage. The evaluation framework follows the Pyramid approach to measure how many summarization content units, considered important by human annotators, are contained in an automatic summary. Our approach automatizes the evaluation process, which does not need any manual intervention on the evaluated summary side. Our approach compares abstract meaning representations of each content unit mention and each summary sentence. We found that the proposed metric complements well the widely-used ROUGE metrics.
Tasks	Graph Similarity
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1090/
PDF	https://doi.org/10.26615/978-954-452-049-6_090
PWC	https://paperswithcode.com/paper/pyramid-based-summary-evaluation-using
Repo
Framework