July 26, 2019

2192 words 11 mins read

Paper Group NANR 41

Hashtag Processing for Enhanced Clustering of Tweets. ECNU at SemEval-2017 Task 5: An Ensemble of Regression Algorithms with Effective Features for Fine-Grained Sentiment Analysis in Financial Domain. Multiscale Semi-Markov Dynamics for Intracortical Brain-Computer Interfaces. TextFlow: A Text Similarity Measure based on Continuous Sequences. A Com …

Hashtag Processing for Enhanced Clustering of Tweets


Title	Hashtag Processing for Enhanced Clustering of Tweets
Authors	Dagmar Gromann, Thierry Declerck
Abstract	Rich data provided by tweets have beenanalyzed, clustered, and explored in a variety of studies. Typically those studies focus on named entity recognition, entity linking, and entity disambiguation or clustering. Tweets and hashtags are generally analyzed on sentential or word level but not on a compositional level of concatenated words. We propose an approach for a closer analysis of compounds in hashtags, and in the long run also of other types of text sequences in tweets, in order to enhance the clustering of such text documents. Hashtags have been used before as primary topic indicators to cluster tweets, however, their segmentation and its effect on clustering results have not been investigated to the best of our knowledge. Our results with a standard dataset from the Text REtrieval Conference (TREC) show that segmented and harmonized hashtags positively impact effective clustering.
Tasks	Entity Disambiguation, Entity Linking, Named Entity Recognition
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1038/
PDF	https://doi.org/10.26615/978-954-452-049-6_038
PWC	https://paperswithcode.com/paper/hashtag-processing-for-enhanced-clustering-of
Repo
Framework

ECNU at SemEval-2017 Task 5: An Ensemble of Regression Algorithms with Effective Features for Fine-Grained Sentiment Analysis in Financial Domain


Title	ECNU at SemEval-2017 Task 5: An Ensemble of Regression Algorithms with Effective Features for Fine-Grained Sentiment Analysis in Financial Domain
Authors	Mengxiao Jiang, Man Lan, Yuanbin Wu
Abstract	This paper describes our systems submitted to the Fine-Grained Sentiment Analysis on Financial Microblogs and News task (i.e., Task 5) in SemEval-2017. This task includes two subtasks in microblogs and news headline domain respectively. To settle this problem, we extract four types of effective features, including linguistic features, sentiment lexicon features, domain-specific features and word embedding features. Then we employ these features to construct models by using ensemble regression algorithms. Our submissions rank 1st and rank 5th in subtask 1 and subtask 2 respectively.
Tasks	Feature Engineering, Sentiment Analysis
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-2152/
PDF	https://www.aclweb.org/anthology/S17-2152
PWC	https://paperswithcode.com/paper/ecnu-at-semeval-2017-task-5-an-ensemble-of
Repo
Framework

Multiscale Semi-Markov Dynamics for Intracortical Brain-Computer Interfaces


Title	Multiscale Semi-Markov Dynamics for Intracortical Brain-Computer Interfaces
Authors	Daniel Milstein, Jason Pacheco, Leigh Hochberg, John D. Simeral, Beata Jarosiewicz, Erik Sudderth
Abstract	Intracortical brain-computer interfaces (iBCIs) have allowed people with tetraplegia to control a computer cursor by imagining the movement of their paralyzed arm or hand. State-of-the-art decoders deployed in human iBCIs are derived from a Kalman filter that assumes Markov dynamics on the angle of intended movement, and a unimodal dependence on intended angle for each channel of neural activity. Due to errors made in the decoding of noisy neural data, as a user attempts to move the cursor to a goal, the angle between cursor and goal positions may change rapidly. We propose a dynamic Bayesian network that includes the on-screen goal position as part of its latent state, and thus allows the person’s intended angle of movement to be aggregated over a much longer history of neural activity. This multiscale model explicitly captures the relationship between instantaneous angles of motion and long-term goals, and incorporates semi-Markov dynamics for motion trajectories. We also introduce a multimodal likelihood model for recordings of neural populations which can be rapidly calibrated for clinical applications. In offline experiments with recorded neural data, we demonstrate significantly improved prediction of motion directions compared to the Kalman filter. We derive an efficient online inference algorithm, enabling a clinical trial participant with tetraplegia to control a computer cursor with neural activity in real time. The observed kinematics of cursor movement are objectively straighter and smoother than prior iBCI decoding models without loss of responsiveness.
Tasks
Published	2017-12-01
URL	http://papers.nips.cc/paper/6688-multiscale-semi-markov-dynamics-for-intracortical-brain-computer-interfaces
PDF	http://papers.nips.cc/paper/6688-multiscale-semi-markov-dynamics-for-intracortical-brain-computer-interfaces.pdf
PWC	https://paperswithcode.com/paper/multiscale-semi-markov-dynamics-for
Repo
Framework

TextFlow: A Text Similarity Measure based on Continuous Sequences


Title	TextFlow: A Text Similarity Measure based on Continuous Sequences
Authors	Yassine Mrabet, Halil Kilicoglu, Dina Demner-Fushman
Abstract	Text similarity measures are used in multiple tasks such as plagiarism detection, information ranking and recognition of paraphrases and textual entailment. While recent advances in deep learning highlighted the relevance of sequential models in natural language generation, existing similarity measures do not fully exploit the sequential nature of language. Examples of such similarity measures include n-grams and skip-grams overlap which rely on distinct slices of the input texts. In this paper we present a novel text similarity measure inspired from a common representation in DNA sequence alignment algorithms. The new measure, called TextFlow, represents input text pairs as continuous curves and uses both the actual position of the words and sequence matching to compute the similarity value. Our experiments on 8 different datasets show very encouraging results in paraphrase detection, textual entailment recognition and ranking relevance.
Tasks	Natural Language Inference, Text Generation
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-1071/
PDF	https://www.aclweb.org/anthology/P17-1071
PWC	https://paperswithcode.com/paper/textflow-a-text-similarity-measure-based-on
Repo
Framework

A Computational Analysis of the Language of Drug Addiction


Title	A Computational Analysis of the Language of Drug Addiction
Authors	Carlo Strapparava, Rada Mihalcea
Abstract	We present a computational analysis of the language of drug users when talking about their drug experiences. We introduce a new dataset of over 4,000 descriptions of experiences reported by users of four main drug types, and show that we can predict with an F1-score of up to 88{%} the drug behind a certain experience. We also perform an analysis of the dominant psycholinguistic processes and dominant emotions associated with each drug type, which sheds light on the characteristics of drug users.
Tasks
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-2022/
PDF	https://www.aclweb.org/anthology/E17-2022
PWC	https://paperswithcode.com/paper/a-computational-analysis-of-the-language-of
Repo
Framework

Deriving Word Prosody from Orthography in Hindi


Title	Deriving Word Prosody from Orthography in Hindi
Authors	Somnath Roy
Abstract
Tasks
Published	2017-12-01
URL	https://www.aclweb.org/anthology/W17-7502/
PDF	https://www.aclweb.org/anthology/W17-7502
PWC	https://paperswithcode.com/paper/deriving-word-prosody-from-orthography-in
Repo
Framework

Proceedings of the Workshop on New Frontiers in Summarization


Title	Proceedings of the Workshop on New Frontiers in Summarization
Authors
Abstract
Tasks
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-4500/
PDF	https://www.aclweb.org/anthology/W17-4500
PWC	https://paperswithcode.com/paper/proceedings-of-the-workshop-on-new-frontiers
Repo
Framework

Obtaining referential word meanings from visual and distributional information: Experiments on object naming


Title	Obtaining referential word meanings from visual and distributional information: Experiments on object naming
Authors	Sina Zarrie{\ss}, David Schlangen
Abstract	We investigate object naming, which is an important sub-task of referring expression generation on real-world images. As opposed to mutually exclusive labels used in object recognition, object names are more flexible, subject to communicative preferences and semantically related to each other. Therefore, we investigate models of referential word meaning that link visual to lexical information which we assume to be given through distributional word embeddings. We present a model that learns individual predictors for object names that link visual and distributional aspects of word meaning during training. We show that this is particularly beneficial for zero-shot learning, as compared to projecting visual objects directly into the distributional space. In a standard object naming task, we find that different ways of combining lexical and visual information achieve very similar performance, though experiments on model combination suggest that they capture complementary aspects of referential meaning.
Tasks	Object Classification, Object Recognition, Word Embeddings, Zero-Shot Learning
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-1023/
PDF	https://www.aclweb.org/anthology/P17-1023
PWC	https://paperswithcode.com/paper/obtaining-referential-word-meanings-from
Repo
Framework

The AMU-UEdin Submission to the WMT 2017 Shared Task on Automatic Post-Editing


Title	The AMU-UEdin Submission to the WMT 2017 Shared Task on Automatic Post-Editing
Authors	Marcin Junczys-Dowmunt, Marcin Junczys-Dowmunt
Abstract
Tasks	Automatic Post-Editing, Machine Translation
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-4774/
PDF	https://www.aclweb.org/anthology/W17-4774
PWC	https://paperswithcode.com/paper/the-amu-uedin-submission-to-the-wmt-2017
Repo
Framework

Continuous fluency tracking and the challenges of varying text complexity


Title	Continuous fluency tracking and the challenges of varying text complexity
Authors	Beata Beigman Klebanov, Anastassia Loukina, John Sabatini, Tenaha O{'}Reilly
Abstract	This paper is a preliminary report on using text complexity measurement in the service of a new educational application. We describe a reading intervention where a child takes turns reading a book aloud with a virtual reading partner. Our ultimate goal is to provide meaningful feedback to the parent or the teacher by continuously tracking the child{'}s improvement in reading fluency. We show that this would not be a simple endeavor, due to an intricate relationship between text complexity from the point of view of comprehension and reading rate.
Tasks	Reading Comprehension
Published	2017-09-01
URL	https://www.aclweb.org/anthology/W17-5003/
PDF	https://www.aclweb.org/anthology/W17-5003
PWC	https://paperswithcode.com/paper/continuous-fluency-tracking-and-the
Repo
Framework

Multi-objective radiomics model for predicting distant failure in lung SBRT


Title	Multi-objective radiomics model for predicting distant failure in lung SBRT
Authors	Zhou Z1, Folkert M, Iyengar P, Westover K, Zhang Y, Choy H, Timmerman R, Jiang S, Wang J.
Abstract	Stereotactic body radiation therapy (SBRT) has demonstrated high local control rates in early stage non-small cell lung cancer patients who are not ideal surgical candidates. However, distant failure after SBRT is still common. For patients at high risk of early distant failure after SBRT treatment, additional systemic therapy may reduce the risk of distant relapse and improve overall survival. Therefore, a strategy that can correctly stratify patients at high risk of failure is needed. The field of radiomics holds great potential in predicting treatment outcomes by using high-throughput extraction of quantitative imaging features. The construction of predictive models in radiomics is typically based on a single objective such as overall accuracy or the area under the curve (AUC). However, because of imbalanced positive and negative events in the training datasets, a single objective may not be ideal to guide model construction. To overcome these limitations, we propose a multi-objective radiomics model that simultaneously considers sensitivity and specificity as objective functions. To design a more accurate and reliable model, an iterative multi-objective immune algorithm (IMIA) was proposed to optimize these objective functions. The multi-objective radiomics model is more sensitive than the single-objective model, while maintaining the same levels of specificity and AUC. The IMIA performs better than the traditional immune-inspired multi-objective algorithm.
Tasks
Published	2017-06-07
URL	https://www.ncbi.nlm.nih.gov/pubmed/28480871
PDF	https://iopscience.iop.org/article/10.1088/1361-6560/aa6ae5/pdf
PWC	https://paperswithcode.com/paper/multi-objective-radiomics-model-for
Repo
Framework

Finding Individual Word Sense Changes and their Delay in Appearance


Title	Finding Individual Word Sense Changes and their Delay in Appearance
Authors	Nina Tahmasebi, Thomas Risse
Abstract	We present a method for detecting word sense changes by utilizing automatically induced word senses. Our method works on the level of individual senses and allows a word to have e.g. one stable sense and then add a novel sense that later experiences change. Senses are grouped based on polysemy to find linguistic concepts and we can find broadening and narrowing as well as novel (polysemous and homonymic) senses. We evaluate on a testset, present recall and estimates of the time between expected and found change.
Tasks	Word Embeddings, Word Sense Induction
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1095/
PDF	https://doi.org/10.26615/978-954-452-049-6_095
PWC	https://paperswithcode.com/paper/finding-individual-word-sense-changes-and
Repo
Framework

Learning bilingual word embeddings with (almost) no bilingual data


Title	Learning bilingual word embeddings with (almost) no bilingual data
Authors	Mikel Artetxe, Gorka Labaka, Eneko Agirre
Abstract	Most methods to learn bilingual word embeddings rely on large parallel corpora, which is difficult to obtain for most language pairs. This has motivated an active research line to relax this requirement, with methods that use document-aligned corpora or bilingual dictionaries of a few thousand words instead. In this work, we further reduce the need of bilingual resources using a very simple self-learning approach that can be combined with any dictionary-based mapping technique. Our method exploits the structural similarity of embedding spaces, and works with as little bilingual evidence as a 25 word dictionary or even an automatically generated list of numerals, obtaining results comparable to those of systems that use richer resources.
Tasks	Document Classification, Entity Linking, Machine Translation, Multilingual Word Embeddings, Part-Of-Speech Tagging, Transfer Learning, Word Embeddings
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-1042/
PDF	https://www.aclweb.org/anthology/P17-1042
PWC	https://paperswithcode.com/paper/learning-bilingual-word-embeddings-with
Repo
Framework

A Corpus of Natural Language for Visual Reasoning


Title	A Corpus of Natural Language for Visual Reasoning
Authors	Alane Suhr, Mike Lewis, James Yeh, Yoav Artzi
Abstract	We present a new visual reasoning language dataset, containing 92,244 pairs of examples of natural statements grounded in synthetic images with 3,962 unique sentences. We describe a method of crowdsourcing linguistically-diverse data, and present an analysis of our data. The data demonstrates a broad set of linguistic phenomena, requiring visual and set-theoretic reasoning. We experiment with various models, and show the data presents a strong challenge for future research.
Tasks	Question Answering, Visual Question Answering, Visual Reasoning
Published	2017-07-01
URL	https://www.aclweb.org/anthology/P17-2034/
PDF	https://www.aclweb.org/anthology/P17-2034
PWC	https://paperswithcode.com/paper/a-corpus-of-natural-language-for-visual
Repo
Framework

A Learning Error Analysis for Structured Prediction with Approximate Inference


Title	A Learning Error Analysis for Structured Prediction with Approximate Inference
Authors	Yuanbin Wu, Man Lan, Shiliang Sun, Qi Zhang, Xuanjing Huang
Abstract	In this work, we try to understand the differences between exact and approximate inference algorithms in structured prediction. We compare the estimation and approximation error of both underestimate and overestimate models. The result shows that, from the perspective of learning errors, performances of approximate inference could be as good as exact inference. The error analyses also suggest a new margin for existing learning algorithms. Empirical evaluations on text classification, sequential labelling and dependency parsing witness the success of approximate inference and the benefit of the proposed margin.
Tasks	Dependency Parsing, Structured Prediction, Text Classification
Published	2017-12-01
URL	http://papers.nips.cc/paper/7193-a-learning-error-analysis-for-structured-prediction-with-approximate-inference
PDF	http://papers.nips.cc/paper/7193-a-learning-error-analysis-for-structured-prediction-with-approximate-inference.pdf
PWC	https://paperswithcode.com/paper/a-learning-error-analysis-for-structured
Repo
Framework