July 26, 2019

2192 words 11 mins read

Paper Group NANR 41

Paper Group NANR 41

Hashtag Processing for Enhanced Clustering of Tweets. ECNU at SemEval-2017 Task 5: An Ensemble of Regression Algorithms with Effective Features for Fine-Grained Sentiment Analysis in Financial Domain. Multiscale Semi-Markov Dynamics for Intracortical Brain-Computer Interfaces. TextFlow: A Text Similarity Measure based on Continuous Sequences. A Com …

Hashtag Processing for Enhanced Clustering of Tweets

Title Hashtag Processing for Enhanced Clustering of Tweets
Authors Dagmar Gromann, Thierry Declerck
Abstract Rich data provided by tweets have beenanalyzed, clustered, and explored in a variety of studies. Typically those studies focus on named entity recognition, entity linking, and entity disambiguation or clustering. Tweets and hashtags are generally analyzed on sentential or word level but not on a compositional level of concatenated words. We propose an approach for a closer analysis of compounds in hashtags, and in the long run also of other types of text sequences in tweets, in order to enhance the clustering of such text documents. Hashtags have been used before as primary topic indicators to cluster tweets, however, their segmentation and its effect on clustering results have not been investigated to the best of our knowledge. Our results with a standard dataset from the Text REtrieval Conference (TREC) show that segmented and harmonized hashtags positively impact effective clustering.
Tasks Entity Disambiguation, Entity Linking, Named Entity Recognition
Published 2017-09-01
URL https://www.aclweb.org/anthology/R17-1038/
PDF https://doi.org/10.26615/978-954-452-049-6_038
PWC https://paperswithcode.com/paper/hashtag-processing-for-enhanced-clustering-of
Repo
Framework

ECNU at SemEval-2017 Task 5: An Ensemble of Regression Algorithms with Effective Features for Fine-Grained Sentiment Analysis in Financial Domain

Title ECNU at SemEval-2017 Task 5: An Ensemble of Regression Algorithms with Effective Features for Fine-Grained Sentiment Analysis in Financial Domain
Authors Mengxiao Jiang, Man Lan, Yuanbin Wu
Abstract This paper describes our systems submitted to the Fine-Grained Sentiment Analysis on Financial Microblogs and News task (i.e., Task 5) in SemEval-2017. This task includes two subtasks in microblogs and news headline domain respectively. To settle this problem, we extract four types of effective features, including linguistic features, sentiment lexicon features, domain-specific features and word embedding features. Then we employ these features to construct models by using ensemble regression algorithms. Our submissions rank 1st and rank 5th in subtask 1 and subtask 2 respectively.
Tasks Feature Engineering, Sentiment Analysis
Published 2017-08-01
URL https://www.aclweb.org/anthology/S17-2152/
PDF https://www.aclweb.org/anthology/S17-2152
PWC https://paperswithcode.com/paper/ecnu-at-semeval-2017-task-5-an-ensemble-of
Repo
Framework

Multiscale Semi-Markov Dynamics for Intracortical Brain-Computer Interfaces

Title Multiscale Semi-Markov Dynamics for Intracortical Brain-Computer Interfaces
Authors Daniel Milstein, Jason Pacheco, Leigh Hochberg, John D. Simeral, Beata Jarosiewicz, Erik Sudderth
Abstract Intracortical brain-computer interfaces (iBCIs) have allowed people with tetraplegia to control a computer cursor by imagining the movement of their paralyzed arm or hand. State-of-the-art decoders deployed in human iBCIs are derived from a Kalman filter that assumes Markov dynamics on the angle of intended movement, and a unimodal dependence on intended angle for each channel of neural activity. Due to errors made in the decoding of noisy neural data, as a user attempts to move the cursor to a goal, the angle between cursor and goal positions may change rapidly. We propose a dynamic Bayesian network that includes the on-screen goal position as part of its latent state, and thus allows the person’s intended angle of movement to be aggregated over a much longer history of neural activity. This multiscale model explicitly captures the relationship between instantaneous angles of motion and long-term goals, and incorporates semi-Markov dynamics for motion trajectories. We also introduce a multimodal likelihood model for recordings of neural populations which can be rapidly calibrated for clinical applications. In offline experiments with recorded neural data, we demonstrate significantly improved prediction of motion directions compared to the Kalman filter. We derive an efficient online inference algorithm, enabling a clinical trial participant with tetraplegia to control a computer cursor with neural activity in real time. The observed kinematics of cursor movement are objectively straighter and smoother than prior iBCI decoding models without loss of responsiveness.
Tasks
Published 2017-12-01
URL http://papers.nips.cc/paper/6688-multiscale-semi-markov-dynamics-for-intracortical-brain-computer-interfaces
PDF http://papers.nips.cc/paper/6688-multiscale-semi-markov-dynamics-for-intracortical-brain-computer-interfaces.pdf
PWC https://paperswithcode.com/paper/multiscale-semi-markov-dynamics-for
Repo
Framework

TextFlow: A Text Similarity Measure based on Continuous Sequences

Title TextFlow: A Text Similarity Measure based on Continuous Sequences
Authors Yassine Mrabet, Halil Kilicoglu, Dina Demner-Fushman
Abstract Text similarity measures are used in multiple tasks such as plagiarism detection, information ranking and recognition of paraphrases and textual entailment. While recent advances in deep learning highlighted the relevance of sequential models in natural language generation, existing similarity measures do not fully exploit the sequential nature of language. Examples of such similarity measures include n-grams and skip-grams overlap which rely on distinct slices of the input texts. In this paper we present a novel text similarity measure inspired from a common representation in DNA sequence alignment algorithms. The new measure, called TextFlow, represents input text pairs as continuous curves and uses both the actual position of the words and sequence matching to compute the similarity value. Our experiments on 8 different datasets show very encouraging results in paraphrase detection, textual entailment recognition and ranking relevance.
Tasks Natural Language Inference, Text Generation
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-1071/
PDF https://www.aclweb.org/anthology/P17-1071
PWC https://paperswithcode.com/paper/textflow-a-text-similarity-measure-based-on
Repo
Framework

A Computational Analysis of the Language of Drug Addiction

Title A Computational Analysis of the Language of Drug Addiction
Authors Carlo Strapparava, Rada Mihalcea
Abstract We present a computational analysis of the language of drug users when talking about their drug experiences. We introduce a new dataset of over 4,000 descriptions of experiences reported by users of four main drug types, and show that we can predict with an F1-score of up to 88{%} the drug behind a certain experience. We also perform an analysis of the dominant psycholinguistic processes and dominant emotions associated with each drug type, which sheds light on the characteristics of drug users.
Tasks
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-2022/
PDF https://www.aclweb.org/anthology/E17-2022
PWC https://paperswithcode.com/paper/a-computational-analysis-of-the-language-of
Repo
Framework

Deriving Word Prosody from Orthography in Hindi

Title Deriving Word Prosody from Orthography in Hindi
Authors Somnath Roy
Abstract
Tasks
Published 2017-12-01
URL https://www.aclweb.org/anthology/W17-7502/
PDF https://www.aclweb.org/anthology/W17-7502
PWC https://paperswithcode.com/paper/deriving-word-prosody-from-orthography-in
Repo
Framework

Proceedings of the Workshop on New Frontiers in Summarization

Title Proceedings of the Workshop on New Frontiers in Summarization
Authors
Abstract
Tasks
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-4500/
PDF https://www.aclweb.org/anthology/W17-4500
PWC https://paperswithcode.com/paper/proceedings-of-the-workshop-on-new-frontiers
Repo
Framework

Obtaining referential word meanings from visual and distributional information: Experiments on object naming

Title Obtaining referential word meanings from visual and distributional information: Experiments on object naming
Authors Sina Zarrie{\ss}, David Schlangen
Abstract We investigate object naming, which is an important sub-task of referring expression generation on real-world images. As opposed to mutually exclusive labels used in object recognition, object names are more flexible, subject to communicative preferences and semantically related to each other. Therefore, we investigate models of referential word meaning that link visual to lexical information which we assume to be given through distributional word embeddings. We present a model that learns individual predictors for object names that link visual and distributional aspects of word meaning during training. We show that this is particularly beneficial for zero-shot learning, as compared to projecting visual objects directly into the distributional space. In a standard object naming task, we find that different ways of combining lexical and visual information achieve very similar performance, though experiments on model combination suggest that they capture complementary aspects of referential meaning.
Tasks Object Classification, Object Recognition, Word Embeddings, Zero-Shot Learning
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-1023/
PDF https://www.aclweb.org/anthology/P17-1023
PWC https://paperswithcode.com/paper/obtaining-referential-word-meanings-from
Repo
Framework

The AMU-UEdin Submission to the WMT 2017 Shared Task on Automatic Post-Editing

Title The AMU-UEdin Submission to the WMT 2017 Shared Task on Automatic Post-Editing
Authors Marcin Junczys-Dowmunt, Marcin Junczys-Dowmunt
Abstract
Tasks Automatic Post-Editing, Machine Translation
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-4774/
PDF https://www.aclweb.org/anthology/W17-4774
PWC https://paperswithcode.com/paper/the-amu-uedin-submission-to-the-wmt-2017
Repo
Framework

Continuous fluency tracking and the challenges of varying text complexity

Title Continuous fluency tracking and the challenges of varying text complexity
Authors Beata Beigman Klebanov, Anastassia Loukina, John Sabatini, Tenaha O{'}Reilly
Abstract This paper is a preliminary report on using text complexity measurement in the service of a new educational application. We describe a reading intervention where a child takes turns reading a book aloud with a virtual reading partner. Our ultimate goal is to provide meaningful feedback to the parent or the teacher by continuously tracking the child{'}s improvement in reading fluency. We show that this would not be a simple endeavor, due to an intricate relationship between text complexity from the point of view of comprehension and reading rate.
Tasks Reading Comprehension
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-5003/
PDF https://www.aclweb.org/anthology/W17-5003
PWC https://paperswithcode.com/paper/continuous-fluency-tracking-and-the
Repo
Framework

Multi-objective radiomics model for predicting distant failure in lung SBRT

Title Multi-objective radiomics model for predicting distant failure in lung SBRT
Authors Zhou Z1, Folkert M, Iyengar P, Westover K, Zhang Y, Choy H, Timmerman R, Jiang S, Wang J.
Abstract Stereotactic body radiation therapy (SBRT) has demonstrated high local control rates in early stage non-small cell lung cancer patients who are not ideal surgical candidates. However, distant failure after SBRT is still common. For patients at high risk of early distant failure after SBRT treatment, additional systemic therapy may reduce the risk of distant relapse and improve overall survival. Therefore, a strategy that can correctly stratify patients at high risk of failure is needed. The field of radiomics holds great potential in predicting treatment outcomes by using high-throughput extraction of quantitative imaging features. The construction of predictive models in radiomics is typically based on a single objective such as overall accuracy or the area under the curve (AUC). However, because of imbalanced positive and negative events in the training datasets, a single objective may not be ideal to guide model construction. To overcome these limitations, we propose a multi-objective radiomics model that simultaneously considers sensitivity and specificity as objective functions. To design a more accurate and reliable model, an iterative multi-objective immune algorithm (IMIA) was proposed to optimize these objective functions. The multi-objective radiomics model is more sensitive than the single-objective model, while maintaining the same levels of specificity and AUC. The IMIA performs better than the traditional immune-inspired multi-objective algorithm.
Tasks
Published 2017-06-07
URL https://www.ncbi.nlm.nih.gov/pubmed/28480871
PDF https://iopscience.iop.org/article/10.1088/1361-6560/aa6ae5/pdf
PWC https://paperswithcode.com/paper/multi-objective-radiomics-model-for
Repo
Framework

Finding Individual Word Sense Changes and their Delay in Appearance

Title Finding Individual Word Sense Changes and their Delay in Appearance
Authors Nina Tahmasebi, Thomas Risse
Abstract We present a method for detecting word sense changes by utilizing automatically induced word senses. Our method works on the level of individual senses and allows a word to have e.g. one stable sense and then add a novel sense that later experiences change. Senses are grouped based on polysemy to find linguistic concepts and we can find broadening and narrowing as well as novel (polysemous and homonymic) senses. We evaluate on a testset, present recall and estimates of the time between expected and found change.
Tasks Word Embeddings, Word Sense Induction
Published 2017-09-01
URL https://www.aclweb.org/anthology/R17-1095/
PDF https://doi.org/10.26615/978-954-452-049-6_095
PWC https://paperswithcode.com/paper/finding-individual-word-sense-changes-and
Repo
Framework

Learning bilingual word embeddings with (almost) no bilingual data

Title Learning bilingual word embeddings with (almost) no bilingual data
Authors Mikel Artetxe, Gorka Labaka, Eneko Agirre
Abstract Most methods to learn bilingual word embeddings rely on large parallel corpora, which is difficult to obtain for most language pairs. This has motivated an active research line to relax this requirement, with methods that use document-aligned corpora or bilingual dictionaries of a few thousand words instead. In this work, we further reduce the need of bilingual resources using a very simple self-learning approach that can be combined with any dictionary-based mapping technique. Our method exploits the structural similarity of embedding spaces, and works with as little bilingual evidence as a 25 word dictionary or even an automatically generated list of numerals, obtaining results comparable to those of systems that use richer resources.
Tasks Document Classification, Entity Linking, Machine Translation, Multilingual Word Embeddings, Part-Of-Speech Tagging, Transfer Learning, Word Embeddings
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-1042/
PDF https://www.aclweb.org/anthology/P17-1042
PWC https://paperswithcode.com/paper/learning-bilingual-word-embeddings-with
Repo
Framework

A Corpus of Natural Language for Visual Reasoning

Title A Corpus of Natural Language for Visual Reasoning
Authors Alane Suhr, Mike Lewis, James Yeh, Yoav Artzi
Abstract We present a new visual reasoning language dataset, containing 92,244 pairs of examples of natural statements grounded in synthetic images with 3,962 unique sentences. We describe a method of crowdsourcing linguistically-diverse data, and present an analysis of our data. The data demonstrates a broad set of linguistic phenomena, requiring visual and set-theoretic reasoning. We experiment with various models, and show the data presents a strong challenge for future research.
Tasks Question Answering, Visual Question Answering, Visual Reasoning
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-2034/
PDF https://www.aclweb.org/anthology/P17-2034
PWC https://paperswithcode.com/paper/a-corpus-of-natural-language-for-visual
Repo
Framework

A Learning Error Analysis for Structured Prediction with Approximate Inference

Title A Learning Error Analysis for Structured Prediction with Approximate Inference
Authors Yuanbin Wu, Man Lan, Shiliang Sun, Qi Zhang, Xuanjing Huang
Abstract In this work, we try to understand the differences between exact and approximate inference algorithms in structured prediction. We compare the estimation and approximation error of both underestimate and overestimate models. The result shows that, from the perspective of learning errors, performances of approximate inference could be as good as exact inference. The error analyses also suggest a new margin for existing learning algorithms. Empirical evaluations on text classification, sequential labelling and dependency parsing witness the success of approximate inference and the benefit of the proposed margin.
Tasks Dependency Parsing, Structured Prediction, Text Classification
Published 2017-12-01
URL http://papers.nips.cc/paper/7193-a-learning-error-analysis-for-structured-prediction-with-approximate-inference
PDF http://papers.nips.cc/paper/7193-a-learning-error-analysis-for-structured-prediction-with-approximate-inference.pdf
PWC https://paperswithcode.com/paper/a-learning-error-analysis-for-structured
Repo
Framework
comments powered by Disqus