May 4, 2019

1726 words 9 mins read

Paper Group NANR 213

Paper Group NANR 213

Selecting Domain-Specific Concepts for Question Generation With Lightly-Supervised Methods. Are Cohesive Features Relevant for Text Readability Evaluation?. Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus. Measuring the Information Content of Financial News. Explicit Argument Identification for Discourse Parsing In Hindi: A H …

Selecting Domain-Specific Concepts for Question Generation With Lightly-Supervised Methods

Title Selecting Domain-Specific Concepts for Question Generation With Lightly-Supervised Methods
Authors Yiping Jin, Phu Le
Abstract
Tasks Open Information Extraction, Question Generation, Reading Comprehension, Text Generation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-6623/
PDF https://www.aclweb.org/anthology/W16-6623
PWC https://paperswithcode.com/paper/selecting-domain-specific-concepts-for
Repo
Framework

Are Cohesive Features Relevant for Text Readability Evaluation?

Title Are Cohesive Features Relevant for Text Readability Evaluation?
Authors Amalia Todirascu, Thomas Fran{\c{c}}ois, Delphine Bernhard, N{'u}ria Gala, Anne-Laure Ligozat
Abstract This paper investigates the effectiveness of 65 cohesion-based variables that are commonly used in the literature as predictive features to assess text readability. We evaluate the efficiency of these variables across narrative and informative texts intended for an audience of L2 French learners. In our experiments, we use a French corpus that has been both manually and automatically annotated as regards to co-reference and anaphoric chains. The efficiency of the 65 variables for readability is analyzed through a correlational analysis and some modelling experiments.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1094/
PDF https://www.aclweb.org/anthology/C16-1094
PWC https://paperswithcode.com/paper/are-cohesive-features-relevant-for-text
Repo
Framework

Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus

Title Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus
Authors Andy Luecking, Alex Mehler, er, D{'e}sir{'e}e Walther, Marcel Mauri, Dennis Kurf{"u}rst
Abstract The Frankfurt Image GestURE corpus (FIGURE) is introduced. The corpus data is collected in an experimental setting where 50 naive participants spontaneously produced gestures in response to five to six terms from a total of 27 stimulus terms. The stimulus terms have been compiled mainly from image schemata from psycholinguistics, since such schemata provide a panoply of abstract contents derived from natural language use. The gestures have been annotated for kinetic features. FIGURE aims at finding (sets of) stable kinetic feature configurations associated with the stimulus terms. Given such configurations, they can be used for designing HCI gestures that go beyond pre-defined gesture vocabularies or touchpad gestures. It is found, for instance, that movement trajectories are far more informative than handshapes, speaking against purely handshape-based HCI vocabularies. Furthermore, the mean temporal duration of hand and arm movements associated vary with the stimulus terms, indicating a dynamic dimension not covered by vocabulary-based approaches. Descriptive results are presented and related to findings from gesture studies and natural language dialogue.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1227/
PDF https://www.aclweb.org/anthology/L16-1227
PWC https://paperswithcode.com/paper/finding-recurrent-features-of-image-schema
Repo
Framework

Measuring the Information Content of Financial News

Title Measuring the Information Content of Financial News
Authors Ching-Yun Chang, Yue Zhang, Zhiyang Teng, Zahn Bozanic, Bin Ke
Abstract Measuring the information content of news text is useful for decision makers in their investments since news information can influence the intrinsic values of companies. We propose a model to automatically measure the information content given news text, trained using news and corresponding cumulative abnormal returns of listed companies. Existing methods in finance literature exploit sentiment signal features, which are limited by not considering factors such as events. We address this issue by leveraging deep neural models to extract rich semantic features from news text. In particular, a novel tree-structured LSTM is used to find target-specific representations of news text given syntax structures. Empirical results show that the neural models can outperform sentiment-based models, demonstrating the effectiveness of recent NLP technology advances for computational finance.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1303/
PDF https://www.aclweb.org/anthology/C16-1303
PWC https://paperswithcode.com/paper/measuring-the-information-content-of
Repo
Framework

Explicit Argument Identification for Discourse Parsing In Hindi: A Hybrid Pipeline

Title Explicit Argument Identification for Discourse Parsing In Hindi: A Hybrid Pipeline
Authors Rohit Jain, Dipti Sharma
Abstract
Tasks Question Answering, Text Summarization
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-2010/
PDF https://www.aclweb.org/anthology/N16-2010
PWC https://paperswithcode.com/paper/explicit-argument-identification-for
Repo
Framework

Kathaa : NLP Systems as Edge-Labeled Directed Acyclic MultiGraphs

Title Kathaa : NLP Systems as Edge-Labeled Directed Acyclic MultiGraphs
Authors Sharada Mohanty, Nehal J Wani, Manish Srivastava, Dipti Sharma
Abstract We present Kathaa, an Open Source web-based Visual Programming Framework for Natural Language Processing (NLP) Systems. Kathaa supports the design, execution and analysis of complex NLP systems by visually connecting NLP components from an easily extensible Module Library. It models NLP systems an edge-labeled Directed Acyclic MultiGraph, and lets the user use publicly co-created modules in their own NLP applications irrespective of their technical proficiency in Natural Language Processing. Kathaa exposes an intuitive web based Interface for the users to interact with and modify complex NLP Systems; and a precise Module definition API to allow easy integration of new state of the art NLP components. Kathaa enables researchers to publish their services in a standardized format to enable the masses to use their services out of the box. The vision of this work is to pave the way for a system like Kathaa, to be the Lego blocks of NLP Research and Applications. As a practical use case we use Kathaa to visually implement the Sampark Hindi-Panjabi Machine Translation Pipeline and the Sampark Hindi-Urdu Machine Translation Pipeline, to demonstrate the fact that Kathaa can handle really complex NLP systems while still being intuitive for the end user.
Tasks Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-5201/
PDF https://www.aclweb.org/anthology/W16-5201
PWC https://paperswithcode.com/paper/kathaa-nlp-systems-as-edge-labeled-directed
Repo
Framework

Learning Supervised PageRank with Gradient-Based and Gradient-Free Optimization Methods

Title Learning Supervised PageRank with Gradient-Based and Gradient-Free Optimization Methods
Authors Lev Bogolubsky, Pavel Dvurechenskii, Alexander Gasnikov, Gleb Gusev, Yurii Nesterov, Andrei M. Raigorodskii, Aleksey Tikhonov, Maksim Zhukovskii
Abstract In this paper, we consider a non-convex loss-minimization problem of learning Supervised PageRank models, which can account for features of nodes and edges. We propose gradient-based and random gradient-free methods to solve this problem. Our algorithms are based on the concept of an inexact oracle and unlike the state-of-the-art gradient-based method we manage to provide theoretically the convergence rate guarantees for both of them. Finally, we compare the performance of the proposed optimization methods with the state of the art applied to a ranking task.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6565-learning-supervised-pagerank-with-gradient-based-and-gradient-free-optimization-methods
PDF http://papers.nips.cc/paper/6565-learning-supervised-pagerank-with-gradient-based-and-gradient-free-optimization-methods.pdf
PWC https://paperswithcode.com/paper/learning-supervised-pagerank-with-gradient
Repo
Framework

Automatic Construction of Large Readability Corpora

Title Automatic Construction of Large Readability Corpora
Authors Jorge Alberto Wagner Filho, Rodrigo Wilkens, Aline Villavicencio
Abstract This work presents a framework for the automatic construction of large Web corpora classified by readability level. We compare different Machine Learning classifiers for the task of readability assessment focusing on Portuguese and English texts, analysing the impact of variables like the feature inventory used in the resulting corpus. In a comparison between shallow and deeper features, the former already produce F-measures of over 0.75 for Portuguese texts, but the use of additional features results in even better results, in most cases. For English, shallow features also perform well as do classic readability formulas. Comparing different classifiers for the task, logistic regression obtained, in general, the best results, but with considerable differences between the results for two and those for three-classes, especially regarding the intermediary class. Given the large scale of the resulting corpus, for evaluation we adopt the agreement between different classifiers as an indication of readability assessment certainty. As a result of this work, a large corpus for Brazilian Portuguese was built, including 1.7 million documents and about 1.6 billion tokens, already parsed and annotated with 134 different textual attributes, along with the agreement among the various classifiers.
Tasks Text Classification, Text Simplification
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4119/
PDF https://www.aclweb.org/anthology/W16-4119
PWC https://paperswithcode.com/paper/automatic-construction-of-large-readability
Repo
Framework

Towards assessing depth of argumentation

Title Towards assessing depth of argumentation
Authors Manfred Stede
Abstract For analyzing argumentative text, we propose to study the {`}depth{'} of argumentation as one important component, which we distinguish from argument quality. In a pilot study with German newspaper commentary texts, we asked students to rate the degree of argumentativeness, and then looked for correlations with features of the annotated argumentation structure and the rhetorical structure (in terms of RST). The results indicate that the human judgements correlate with our operationalization of depth and with certain structural features of RST trees. |
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1312/
PDF https://www.aclweb.org/anthology/C16-1312
PWC https://paperswithcode.com/paper/towards-assessing-depth-of-argumentation
Repo
Framework

Machine Comprehension using Rich Semantic Representations

Title Machine Comprehension using Rich Semantic Representations
Authors Mrinmaya Sachan, Eric Xing
Abstract
Tasks Natural Language Inference, Reading Comprehension
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-2079/
PDF https://www.aclweb.org/anthology/P16-2079
PWC https://paperswithcode.com/paper/machine-comprehension-using-rich-semantic
Repo
Framework

Aligning Users Across Social Networks Using Network Embedding

Title Aligning Users Across Social Networks Using Network Embedding
Authors Li Liu, William K. Cheung, Xin Li, Lejian Liao
Abstract Li Liu,1 William K. Cheung,2 Xin Li,1⇤ and Lejian Liao1
Tasks Network Embedding
Published 2016-06-09
URL https://dl.acm.org/citation.cfm?id=3060869
PDF https://www.ijcai.org/Proceedings/16/Papers/254.pdf
PWC https://paperswithcode.com/paper/aligning-users-across-social-networks-using
Repo
Framework

Corpus Fusion for Emotion Classification

Title Corpus Fusion for Emotion Classification
Authors Suyang Zhu, Shoushan Li, Ying Chen, Guodong Zhou
Abstract Machine learning-based methods have obtained great progress on emotion classification. However, in most previous studies, the models are learned based on a single corpus which often suffers from insufficient labeled data. In this paper, we propose a corpus fusion approach to address emotion classification across two corpora which use different emotion taxonomies. The objective of this approach is to utilize the annotated data from one corpus to help the emotion classification on another corpus. An Integer Linear Programming (ILP) optimization is proposed to refine the classification results. Empirical studies show the effectiveness of the proposed approach to corpus fusion for emotion classification.
Tasks Emotion Classification
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1310/
PDF https://www.aclweb.org/anthology/C16-1310
PWC https://paperswithcode.com/paper/corpus-fusion-for-emotion-classification
Repo
Framework

Results of the WNUT16 Named Entity Recognition Shared Task

Title Results of the WNUT16 Named Entity Recognition Shared Task
Authors Benjamin Strauss, Bethany Toma, Alan Ritter, Marie-Catherine de Marneffe, Wei Xu
Abstract This paper presents the results of the Twitter Named Entity Recognition shared task associated with W-NUT 2016: a named entity tagging task with 10 teams participating. We outline the shared task, annotation process and dataset statistics, and provide a high-level overview of the participating systems for each shared task.
Tasks Named Entity Recognition
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3919/
PDF https://www.aclweb.org/anthology/W16-3919
PWC https://paperswithcode.com/paper/results-of-the-wnut16-named-entity
Repo
Framework

Proceedings of the 5th Workshop on Vision and Language

Title Proceedings of the 5th Workshop on Vision and Language
Authors
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-3200/
PDF https://www.aclweb.org/anthology/W16-3200
PWC https://paperswithcode.com/paper/proceedings-of-the-5th-workshop-on-vision-and
Repo
Framework

Proceedings of the 13th International Conference on Natural Language Processing

Title Proceedings of the 13th International Conference on Natural Language Processing
Authors
Abstract
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-6300/
PDF https://www.aclweb.org/anthology/W16-6300
PWC https://paperswithcode.com/paper/proceedings-of-the-13th-international-2
Repo
Framework
comments powered by Disqus