July 26, 2019

2036 words 10 mins read

Paper Group NANR 23

Paper Group NANR 23

KnowYourNyms? A Game of Semantic Relationships. Stacked Sentence-Document Classifier Approach for Improving Native Language Identification. CVBed: Structuring CVs usingWord Embeddings. BuzzSaw at SemEval-2017 Task 7: Global vs. Local Context for Interpreting and Locating Homographic English Puns with Sense Embeddings. Proceedings of the NoDaLiDa 20 …

KnowYourNyms? A Game of Semantic Relationships

Title KnowYourNyms? A Game of Semantic Relationships
Authors Ross Mechanic, Dean Fulgoni, Hannah Cutler, Sneha Rajana, Zheyuan Liu, Bradley Jackson, Anne Cocos, Chris Callison-Burch, Marianna Apidianaki
Abstract Semantic relation knowledge is crucial for natural language understanding. We introduce {``}KnowYourNyms?{''}, a web-based game for learning semantic relations. While providing users with an engaging experience, the application collects large amounts of data that can be used to improve semantic relation classifiers. The data also broadly informs us of how people perceive the relationships between words, providing useful insights for research in psychology and linguistics. |
Tasks Text Classification
Published 2017-09-01
URL https://www.aclweb.org/anthology/D17-2007/
PDF https://www.aclweb.org/anthology/D17-2007
PWC https://paperswithcode.com/paper/knowyournyms-a-game-of-semantic-relationships
Repo
Framework

Stacked Sentence-Document Classifier Approach for Improving Native Language Identification

Title Stacked Sentence-Document Classifier Approach for Improving Native Language Identification
Authors Andrea Cimino, Felice Dell{'}Orletta
Abstract In this paper, we describe the approach of the ItaliaNLP Lab team to native language identification and discuss the results we submitted as participants to the essay track of NLI Shared Task 2017. We introduce for the first time a 2-stacked sentence-document architecture for native language identification that is able to exploit both local sentence information and a wide set of general-purpose features qualifying the lexical and grammatical structure of the whole document. When evaluated on the official test set, our sentence-document stacked architecture obtained the best result among all the participants of the essay track with an F1 score of 0.8818.
Tasks Document Classification, Language Identification, Native Language Identification, Sentence Classification, Text Classification
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-5049/
PDF https://www.aclweb.org/anthology/W17-5049
PWC https://paperswithcode.com/paper/stacked-sentence-document-classifier-approach
Repo
Framework

CVBed: Structuring CVs usingWord Embeddings

Title CVBed: Structuring CVs usingWord Embeddings
Authors Shweta Garg, Sudhanshu S Singh, Abhijit Mishra, Kuntal Dey
Abstract Automatic analysis of curriculum vitae (CVs) of applicants is of tremendous importance in recruitment scenarios. The semi-structuredness of CVs, however, makes CV processing a challenging task. We propose a solution towards transforming CVs to follow a unified structure, thereby, paving ways for smoother CV analysis. The problem of restructuring is posed as a section relabeling problem, where each section of a given CV gets reassigned to a predefined label. Our relabeling method relies on semantic relatedness computed between section header, content and labels, based on phrase-embeddings learned from a large pool of CVs. We follow different heuristics to measure semantic relatedness. Our best heuristic achieves an F-score of 93.17{%} on a test dataset with gold-standard labels obtained using manual annotation.
Tasks Word Embeddings
Published 2017-11-01
URL https://www.aclweb.org/anthology/I17-2059/
PDF https://www.aclweb.org/anthology/I17-2059
PWC https://paperswithcode.com/paper/cvbed-structuring-cvs-usingword-embeddings
Repo
Framework

BuzzSaw at SemEval-2017 Task 7: Global vs. Local Context for Interpreting and Locating Homographic English Puns with Sense Embeddings

Title BuzzSaw at SemEval-2017 Task 7: Global vs. Local Context for Interpreting and Locating Homographic English Puns with Sense Embeddings
Authors Dieke Oele, Kilian Evang
Abstract This paper describes our system participating in the SemEval-2017 Task 7, for the subtasks of homographic pun location and homographic pun interpretation. For pun interpretation, we use a knowledge-based Word Sense Disambiguation (WSD) method based on sense embeddings. Pun-based jokes can be divided into two parts, each containing information about the two distinct senses of the pun. To exploit this structure we split the context that is input to the WSD system into two local contexts and find the best sense for each of them. We use the output of pun interpretation for pun location. As we expect the two meanings of a pun to be very dissimilar, we compute sense embedding cosine distances for each sense-pair and select the word that has the highest distance. We describe experiments on different methods of splitting the context and compare our method to several baselines. We find evidence supporting our hypotheses and obtain competitive results for pun interpretation.
Tasks Word Sense Disambiguation
Published 2017-08-01
URL https://www.aclweb.org/anthology/S17-2076/
PDF https://www.aclweb.org/anthology/S17-2076
PWC https://paperswithcode.com/paper/buzzsaw-at-semeval-2017-task-7-global-vs
Repo
Framework

Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language

Title Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language
Authors
Abstract
Tasks
Published 2017-05-01
URL https://www.aclweb.org/anthology/W17-0500/
PDF https://www.aclweb.org/anthology/W17-0500
PWC https://paperswithcode.com/paper/proceedings-of-the-nodalida-2017-workshop-on-1
Repo
Framework

Language Generation from DB Query

Title Language Generation from DB Query
Authors Kristina Kocijan, Bo{\v{z}}o Bekavac, Kre{\v{s}}imir {\v{S}}ojat
Abstract
Tasks Machine Translation, Text Generation
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-3804/
PDF https://www.aclweb.org/anthology/W17-3804
PWC https://paperswithcode.com/paper/language-generation-from-db-query
Repo
Framework

TOTEMSS: Topic-based, Temporal Sentiment Summarisation for Twitter

Title TOTEMSS: Topic-based, Temporal Sentiment Summarisation for Twitter
Authors Bo Wang, Maria Liakata, Adam Tsakalidis, Spiros Georgakopoulos Kolaitis, Symeon Papadopoulos, Lazaros Apostolidis, Arkaitz Zubiaga, Rob Procter, Yiannis Kompatsiaris
Abstract We present a system for time sensitive, topic based summarisation of the sentiment around target entities and topics in collections of tweets. We describe the main elements of the system and illustrate its functionality with two examples of sentiment analysis of topics related to the 2017 UK general election.
Tasks Opinion Mining, Sentiment Analysis
Published 2017-11-01
URL https://www.aclweb.org/anthology/I17-3006/
PDF https://www.aclweb.org/anthology/I17-3006
PWC https://paperswithcode.com/paper/totemss-topic-based-temporal-sentiment
Repo
Framework

Detecting Good Arguments in a Non-Topic-Specific Way: An Oxymoron?

Title Detecting Good Arguments in a Non-Topic-Specific Way: An Oxymoron?
Authors Beata Beigman Klebanov, Binod Gyawali, Yi Song
Abstract Automatic identification of good arguments on a controversial topic has applications in civics and education, to name a few. While in the civics context it might be acceptable to create separate models for each topic, in the context of scoring of students{'} writing there is a preference for a single model that applies to all responses. Given that good arguments for one topic are likely to be irrelevant for another, is a single model for detecting good arguments a contradiction in terms? We investigate the extent to which it is possible to close the performance gap between topic-specific and across-topics models for identification of good arguments.
Tasks
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-2038/
PDF https://www.aclweb.org/anthology/P17-2038
PWC https://paperswithcode.com/paper/detecting-good-arguments-in-a-non-topic
Repo
Framework

Using Stanford Part-of-Speech Tagger for the Morphologically-rich Filipino Language

Title Using Stanford Part-of-Speech Tagger for the Morphologically-rich Filipino Language
Authors Matthew Phillip Go, Nicco Nocon
Abstract
Tasks Word Sense Disambiguation
Published 2017-11-01
URL https://www.aclweb.org/anthology/Y17-1014/
PDF https://www.aclweb.org/anthology/Y17-1014
PWC https://paperswithcode.com/paper/using-stanford-part-of-speech-tagger-for-the
Repo
Framework

Forecasting Consumer Spending from Purchase Intentions Expressed on Social Media

Title Forecasting Consumer Spending from Purchase Intentions Expressed on Social Media
Authors Viktor Pekar, Jane Binner
Abstract Consumer spending is an important macroeconomic indicator that is used by policy-makers to judge the health of an economy. In this paper we present a novel method for predicting future consumer spending from social media data. In contrast to previous work that largely relied on sentiment analysis, the proposed method models consumer spending from purchase intentions found on social media. Our experiments with time series analysis models and machine-learning regression models reveal utility of this data for making short-term forecasts of consumer spending: for three- and seven-day horizons, prediction variables derived from social media help to improve forecast accuracy by 11{%} to 18{%} for all the three models, in comparison to models that used only autoregressive predictors.
Tasks Sentiment Analysis, Time Series, Time Series Analysis
Published 2017-09-01
URL https://www.aclweb.org/anthology/W17-5212/
PDF https://www.aclweb.org/anthology/W17-5212
PWC https://paperswithcode.com/paper/forecasting-consumer-spending-from-purchase
Repo
Framework

Event extraction from Twitter using Non-Parametric Bayesian Mixture Model with Word Embeddings

Title Event extraction from Twitter using Non-Parametric Bayesian Mixture Model with Word Embeddings
Authors Deyu Zhou, Xuan Zhang, Yulan He
Abstract To extract structured representations of newsworthy events from Twitter, unsupervised models typically assume that tweets involving the same named entities and expressed using similar words are likely to belong to the same event. Hence, they group tweets into clusters based on the co-occurrence patterns of named entities and topical keywords. However, there are two main limitations. First, they require the number of events to be known beforehand, which is not realistic in practical applications. Second, they don{'}t recognise that the same named entity might be referred to by multiple mentions and tweets using different mentions would be wrongly assigned to different events. To overcome these limitations, we propose a non-parametric Bayesian mixture model with word embeddings for event extraction, in which the number of events can be inferred automatically and the issue of lexical variations for the same named entity can be dealt with properly. Our model has been evaluated on three datasets with sizes ranging between 2,499 and over 60 million tweets. Experimental results show that our model outperforms the baseline approach on all datasets by 5-8{%} in F-measure.
Tasks Word Embeddings
Published 2017-04-01
URL https://www.aclweb.org/anthology/E17-1076/
PDF https://www.aclweb.org/anthology/E17-1076
PWC https://paperswithcode.com/paper/event-extraction-from-twitter-using-non
Repo
Framework

Indexicals and Compositionality: Inside-Out or Outside-In?

Title Indexicals and Compositionality: Inside-Out or Outside-In?
Authors Johan Bos
Abstract
Tasks Semantic Parsing
Published 2017-01-01
URL https://www.aclweb.org/anthology/W17-6905/
PDF https://www.aclweb.org/anthology/W17-6905
PWC https://paperswithcode.com/paper/indexicals-and-compositionality-inside-out-or
Repo
Framework

Polish evaluation dataset for compositional distributional semantics models

Title Polish evaluation dataset for compositional distributional semantics models
Authors Alina Wr{'o}blewska, Katarzyna Krasnowska-Kiera{'s}
Abstract The paper presents a procedure of building an evaluation dataset. for the validation of compositional distributional semantics models estimated for languages other than English. The procedure generally builds on steps designed to assemble the SICK corpus, which contains pairs of English sentences annotated for semantic relatedness and entailment, because we aim at building a comparable dataset. However, the implementation of particular building steps significantly differs from the original SICK design assumptions, which is caused by both lack of necessary extraneous resources for an investigated language and the need for language-specific transformation rules. The designed procedure is verified on Polish, a fusional language with a relatively free word order, and contributes to building a Polish evaluation dataset. The resource consists of 10K sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish.
Tasks Semantic Composition
Published 2017-07-01
URL https://www.aclweb.org/anthology/P17-1073/
PDF https://www.aclweb.org/anthology/P17-1073
PWC https://paperswithcode.com/paper/polish-evaluation-dataset-for-compositional
Repo
Framework

Partial Hard Thresholding: Towards A Principled Analysis of Support Recovery

Title Partial Hard Thresholding: Towards A Principled Analysis of Support Recovery
Authors Jie Shen, Ping Li
Abstract In machine learning and compressed sensing, it is of central importance to understand when a tractable algorithm recovers the support of a sparse signal from its compressed measurements. In this paper, we present a principled analysis on the support recovery performance for a family of hard thresholding algorithms. To this end, we appeal to the partial hard thresholding (PHT) operator proposed recently by Jain et al. [IEEE Trans. Information Theory, 2017]. We show that under proper conditions, PHT recovers an arbitrary “s”-sparse signal within O(s kappa log kappa) iterations where “kappa” is an appropriate condition number. Specifying the PHT operator, we obtain the best known result for hard thresholding pursuit and orthogonal matching pursuit with replacement. Experiments on the simulated data complement our theoretical findings and also illustrate the effectiveness of PHT compared to other popular recovery methods.
Tasks
Published 2017-12-01
URL http://papers.nips.cc/paper/6905-partial-hard-thresholding-towards-a-principled-analysis-of-support-recovery
PDF http://papers.nips.cc/paper/6905-partial-hard-thresholding-towards-a-principled-analysis-of-support-recovery.pdf
PWC https://paperswithcode.com/paper/partial-hard-thresholding-towards-a
Repo
Framework

Adaptive stimulus selection for optimizing neural population responses

Title Adaptive stimulus selection for optimizing neural population responses
Authors Benjamin Cowley, Ryan Williamson, Katerina Clemens, Matthew Smith, Byron M. Yu
Abstract Adaptive stimulus selection methods in neuroscience have primarily focused on maximizing the firing rate of a single recorded neuron. When recording from a population of neurons, it is usually not possible to find a single stimulus that maximizes the firing rates of all neurons. This motivates optimizing an objective function that takes into account the responses of all recorded neurons together. We propose “Adept,” an adaptive stimulus selection method that can optimize population objective functions. In simulations, we first confirmed that population objective functions elicited more diverse stimulus responses than single-neuron objective functions. Then, we tested Adept in a closed-loop electrophysiological experiment in which population activity was recorded from macaque V4, a cortical area known for mid-level visual processing. To predict neural responses, we used the outputs of a deep convolutional neural network model as feature embeddings. Images chosen by Adept elicited mean neural responses that were 20% larger than those for randomly-chosen natural images, and also evoked a larger diversity of neural responses. Such adaptive stimulus selection methods can facilitate experiments that involve neurons far from the sensory periphery, for which it is often unclear which stimuli to present.
Tasks
Published 2017-12-01
URL http://papers.nips.cc/paper/6738-adaptive-stimulus-selection-for-optimizing-neural-population-responses
PDF http://papers.nips.cc/paper/6738-adaptive-stimulus-selection-for-optimizing-neural-population-responses.pdf
PWC https://paperswithcode.com/paper/adaptive-stimulus-selection-for-optimizing
Repo
Framework
comments powered by Disqus