Paper Group NANR 138
Correcting General Purpose ASR Errors using Posteriors. Multi-source morphosyntactic tagging for spoken Rusyn. Did you ever read about Frogs drinking Coffee? Investigating the Compositionality of Multi-Emoji Expressions. PLN-PUCRS at EmoInt-2017: Psycholinguistic features for emotion intensity prediction in tweets. Stanford’s Graph-based Neural Dep …
Correcting General Purpose ASR Errors using Posteriors
Title | Correcting General Purpose ASR Errors using Posteriors |
Authors | Sunil Kumar Kopparapu, C. Anantaram |
Abstract | |
Tasks | |
Published | 2017-12-01 |
URL | https://www.aclweb.org/anthology/W17-7535/ |
https://www.aclweb.org/anthology/W17-7535 | |
PWC | https://paperswithcode.com/paper/correcting-general-purpose-asr-errors-using |
Repo | |
Framework | |
Multi-source morphosyntactic tagging for spoken Rusyn
Title | Multi-source morphosyntactic tagging for spoken Rusyn |
Authors | Yves Scherrer, Achim Rabus |
Abstract | This paper deals with the development of morphosyntactic taggers for spoken varieties of the Slavic minority language Rusyn. As neither annotated corpora nor parallel corpora are electronically available for Rusyn, we propose to combine existing resources from the etymologically close Slavic languages Russian, Ukrainian, Slovak, and Polish and adapt them to Rusyn. Using MarMoT as tagging toolkit, we show that a tagger trained on a balanced set of the four source languages outperforms single language taggers by about 9{%}, and that additional automatically induced morphosyntactic lexicons lead to further improvements. The best observed accuracies for Rusyn are 82.4{%} for part-of-speech tagging and 75.5{%} for full morphological tagging. |
Tasks | Morphological Tagging, Part-Of-Speech Tagging |
Published | 2017-04-01 |
URL | https://www.aclweb.org/anthology/W17-1210/ |
https://www.aclweb.org/anthology/W17-1210 | |
PWC | https://paperswithcode.com/paper/multi-source-morphosyntactic-tagging-for |
Repo | |
Framework | |
Did you ever read about Frogs drinking Coffee? Investigating the Compositionality of Multi-Emoji Expressions
Title | Did you ever read about Frogs drinking Coffee? Investigating the Compositionality of Multi-Emoji Expressions |
Authors | Rebeca Padilla L{'o}pez, Fabienne Cap |
Abstract | In this work, we present a first attempt to investigate multi-emoji expressions and whether they behave similarly to multiword expressions in terms of non-compositionality. We focus on the combination of the frog and the hot beverage emoji, but also show some preliminary results for other non-compositional emoji combinations. We use off-the-shelf sentiment analysers as well as manual classifications to approach the compositionality of these emoji combinations. |
Tasks | Sentiment Analysis |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/W17-5215/ |
https://www.aclweb.org/anthology/W17-5215 | |
PWC | https://paperswithcode.com/paper/did-you-ever-read-about-frogs-drinking-coffee |
Repo | |
Framework | |
PLN-PUCRS at EmoInt-2017: Psycholinguistic features for emotion intensity prediction in tweets
Title | PLN-PUCRS at EmoInt-2017: Psycholinguistic features for emotion intensity prediction in tweets |
Authors | Henrique Santos, Renata Vieira |
Abstract | Linguistic Inquiry and Word Count (LIWC) is a rich dictionary that map words into several psychological categories such as Affective, Social, Cognitive, Perceptual and Biological processes. In this work, we have used LIWC psycholinguistic categories to train regression models and predict emotion intensity in tweets for the EmoInt-2017 task. Results show that LIWC features may boost emotion intensity prediction on the basis of a low dimension set. |
Tasks | Sentiment Analysis |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/W17-5225/ |
https://www.aclweb.org/anthology/W17-5225 | |
PWC | https://paperswithcode.com/paper/pln-pucrs-at-emoint-2017-psycholinguistic |
Repo | |
Framework | |
Stanford’s Graph-based Neural Dependency Parser at the CoNLL 2017 Shared Task
Title | Stanford’s Graph-based Neural Dependency Parser at the CoNLL 2017 Shared Task |
Authors | Timothy Dozat, Peng Qi, Christopher D. Manning |
Abstract | This paper describes the neural dependency parser submitted by Stanford to the CoNLL 2017 Shared Task on parsing Universal Dependencies. Our system uses relatively simple LSTM networks to produce part of speech tags and labeled dependency parses from segmented and tokenized sequences of words. In order to address the rare word problem that abounds in languages with complex morphology, we include a character-based word representation that uses an LSTM to produce embeddings from sequences of characters. Our system was ranked first according to all five relevant metrics for the system: UPOS tagging (93.09{%}), XPOS tagging (82.27{%}), unlabeled attachment score (81.30{%}), labeled attachment score (76.30{%}), and content word labeled attachment score (72.57{%}). |
Tasks | Accuracy Metrics, Dependency Parsing |
Published | 2017-08-01 |
URL | https://www.aclweb.org/anthology/K17-3002/ |
https://www.aclweb.org/anthology/K17-3002 | |
PWC | https://paperswithcode.com/paper/stanfords-graph-based-neural-dependency |
Repo | |
Framework | |
IITP at EmoInt-2017: Measuring Intensity of Emotions using Sentence Embeddings and Optimized Features
Title | IITP at EmoInt-2017: Measuring Intensity of Emotions using Sentence Embeddings and Optimized Features |
Authors | Md Shad Akhtar, Palaash Sawant, Asif Ekbal, Jyoti Pawar, Pushpak Bhattacharyya |
Abstract | This paper describes the system that we submitted as part of our participation in the shared task on Emotion Intensity (EmoInt-2017). We propose a Long short term memory (LSTM) based architecture cascaded with Support Vector Regressor (SVR) for intensity prediction. We also employ Particle Swarm Optimization (PSO) based feature selection algorithm for obtaining an optimized feature set for training and evaluation. System evaluation shows interesting results on the four emotion datasets i.e. anger, fear, joy and sadness. In comparison to the other participating teams our system was ranked 5th in the competition. |
Tasks | Emotion Recognition, Feature Selection, Sentence Embeddings, Word Embeddings |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/W17-5229/ |
https://www.aclweb.org/anthology/W17-5229 | |
PWC | https://paperswithcode.com/paper/iitp-at-emoint-2017-measuring-intensity-of |
Repo | |
Framework | |
Deep Learning for Biomedical Information Retrieval: Learning Textual Relevance from Click Logs
Title | Deep Learning for Biomedical Information Retrieval: Learning Textual Relevance from Click Logs |
Authors | Sunil Mohan, Nicolas Fiorini, Sun Kim, Zhiyong Lu |
Abstract | We describe a Deep Learning approach to modeling the relevance of a document{'}s text to a query, applied to biomedical literature. Instead of mapping each document and query to a common semantic space, we compute a variable-length difference vector between the query and document which is then passed through a deep convolution stage followed by a deep regression network to produce the estimated probability of the document{'}s relevance to the query. Despite the small amount of training data, this approach produces a more robust predictor than computing similarities between semantic vector representations of the query and document, and also results in significant improvements over traditional IR text factors. In the future, we plan to explore its application in improving PubMed search. |
Tasks | Information Retrieval, Text Matching |
Published | 2017-08-01 |
URL | https://www.aclweb.org/anthology/W17-2328/ |
https://www.aclweb.org/anthology/W17-2328 | |
PWC | https://paperswithcode.com/paper/deep-learning-for-biomedical-information |
Repo | |
Framework | |
Traversal-Free Word Vector Evaluation in Analogy Space
Title | Traversal-Free Word Vector Evaluation in Analogy Space |
Authors | Xiaoyin Che, Nico Ring, Willi Raschkowski, Haojin Yang, Christoph Meinel |
Abstract | In this paper, we propose an alternative evaluating metric for word analogy questions (A to B is as C to D) in word vector evaluation. Different from the traditional method which predicts the fourth word by the given three, we measure the similarity directly on the {``}relations{''} of two pairs of given words, just as shifting the relation vectors into a new analogy space. Cosine and Euclidean distances are then calculated as measurements. Observation and experiments shows the proposed analogy space evaluation could offer a more comprehensive evaluating result on word vectors with word analogy questions. Meanwhile, computational complexity are remarkably reduced by avoiding traversing the vocabulary. | |
Tasks | |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/W17-5302/ |
https://www.aclweb.org/anthology/W17-5302 | |
PWC | https://paperswithcode.com/paper/traversal-free-word-vector-evaluation-in |
Repo | |
Framework | |
Iconic Locations in Swedish Sign Language: Mapping Form to Meaning with Lexical Databases
Title | Iconic Locations in Swedish Sign Language: Mapping Form to Meaning with Lexical Databases |
Authors | Carl B{"o}rstell, Robert {"O}stling |
Abstract | |
Tasks | |
Published | 2017-05-01 |
URL | https://www.aclweb.org/anthology/W17-0226/ |
https://www.aclweb.org/anthology/W17-0226 | |
PWC | https://paperswithcode.com/paper/iconic-locations-in-swedish-sign-language |
Repo | |
Framework | |
Recognizing Textual Entailment in Twitter Using Word Embeddings
Title | Recognizing Textual Entailment in Twitter Using Word Embeddings |
Authors | Octavia-Maria {\c{S}}ulea |
Abstract | In this paper, we investigate the application of machine learning techniques and word embeddings to the task of Recognizing Textual Entailment (RTE) in Social Media. We look at a manually labeled dataset consisting of user generated short texts posted on Twitter (tweets) and related to four recent media events (the Charlie Hebdo shooting, the Ottawa shooting, the Sydney Siege, and the German Wings crash) and test to what extent neural techniques and embeddings are able to distinguish between tweets that entail or contradict each other or that claim unrelated things. We obtain comparable results to the state of the art in a train-test setting, but we show that, due to the noisy aspect of the data, results plummet in an evaluation strategy crafted to better simulate a real-life train-test scenario. |
Tasks | Information Retrieval, Machine Translation, Natural Language Inference, Question Answering, Text Classification, Word Embeddings |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/W17-5306/ |
https://www.aclweb.org/anthology/W17-5306 | |
PWC | https://paperswithcode.com/paper/recognizing-textual-entailment-in-twitter |
Repo | |
Framework | |
Towards Full Text Shallow Discourse Relation Annotation: Experiments with Cross-Paragraph Implicit Relations in the PDTB
Title | Towards Full Text Shallow Discourse Relation Annotation: Experiments with Cross-Paragraph Implicit Relations in the PDTB |
Authors | Rashmi Prasad, Katherine Forbes Riley, Alan Lee |
Abstract | Full text discourse parsing relies on texts comprehensively annotated with discourse relations. To this end, we address a significant gap in the inter-sentential discourse relations annotated in the Penn Discourse Treebank (PDTB), namely the class of cross-paragraph implicit relations, which account for 30{%} of inter-sentential relations in the corpus. We present our annotation study to explore the incidence rate of adjacent vs. non-adjacent implicit relations in cross-paragraph contexts, and the relative degree of difficulty in annotating them. Our experiments show a high incidence of non-adjacent relations that are difficult to annotate reliably, suggesting the practicality of backing off from their annotation to reduce noise for corpus-based studies. Our resulting guidelines follow the PDTB adjacency constraint for implicits while employing an underspecified representation of non-adjacent implicits, and yield 62{%} inter-annotator agreement on this task. |
Tasks | |
Published | 2017-08-01 |
URL | https://www.aclweb.org/anthology/W17-5502/ |
https://www.aclweb.org/anthology/W17-5502 | |
PWC | https://paperswithcode.com/paper/towards-full-text-shallow-discourse-relation |
Repo | |
Framework | |
As bases de dados verbais ADESSE e ViPEr: uma an'alise constrastiva das constru\cc~oes locativas em espanhol e em portugu^es (The verbal databases ADESSE and ViPEr: a contrastive analysis of locative constructs in Spanish and Portuguese)[In Portuguese]
Title | As bases de dados verbais ADESSE e ViPEr: uma an'alise constrastiva das constru\cc~oes locativas em espanhol e em portugu^es (The verbal databases ADESSE and ViPEr: a contrastive analysis of locative constructs in Spanish and Portuguese)[In Portuguese] |
Authors | Roana Rodrigues, Oto Vale, Laura Alonso Alemany |
Abstract | |
Tasks | |
Published | 2017-10-01 |
URL | https://www.aclweb.org/anthology/W17-6631/ |
https://www.aclweb.org/anthology/W17-6631 | |
PWC | https://paperswithcode.com/paper/as-bases-de-dados-verbais-adesse-e-viper-uma |
Repo | |
Framework | |
Multilingual and Cross-Lingual Complex Word Identification
Title | Multilingual and Cross-Lingual Complex Word Identification |
Authors | Seid Muhie Yimam, Sanja {\v{S}}tajner, Martin Riedl, Chris Biemann |
Abstract | Complex Word Identification (CWI) is an important task in lexical simplification and text accessibility. Due to the lack of CWI datasets, previous works largely depend on Simple English Wikipedia and edit histories for obtaining {`}gold standard{'} annotations, which are of doubtable quality, and limited only to English. We collect complex words/phrases (CP) for English, German and Spanish, annotated by both native and non-native speakers, and propose language independent features that can be used to train multilingual and cross-lingual CWI models. We show that the performance of cross-lingual CWI systems (using a model trained on one language and applying it on the other languages) is comparable to the performance of monolingual CWI systems. | |
Tasks | Complex Word Identification, Lexical Simplification |
Published | 2017-09-01 |
URL | https://www.aclweb.org/anthology/R17-1104/ |
https://doi.org/10.26615/978-954-452-049-6_104 | |
PWC | https://paperswithcode.com/paper/multilingual-and-cross-lingual-complex-word |
Repo | |
Framework | |
Detecting Untranslated Content for Neural Machine Translation
Title | Detecting Untranslated Content for Neural Machine Translation |
Authors | Isao Goto, Hideki Tanaka |
Abstract | Despite its promise, neural machine translation (NMT) has a serious problem in that source content may be mistakenly left untranslated. The ability to detect untranslated content is important for the practical use of NMT. We evaluate two types of probability with which to detect untranslated content: the cumulative attention (ATN) probability and back translation (BT) probability from the target sentence to the source sentence. Experiments on detecting untranslated content in Japanese-English patent translations show that ATN and BT are each more effective than random choice, BT is more effective than ATN, and the combination of the two provides further improvements. We also confirmed the effectiveness of using ATN and BT to rerank the n-best NMT outputs. |
Tasks | Machine Translation |
Published | 2017-08-01 |
URL | https://www.aclweb.org/anthology/W17-3206/ |
https://www.aclweb.org/anthology/W17-3206 | |
PWC | https://paperswithcode.com/paper/detecting-untranslated-content-for-neural |
Repo | |
Framework | |
Demonstration of interactive teaching for end-to-end dialog control with hybrid code networks
Title | Demonstration of interactive teaching for end-to-end dialog control with hybrid code networks |
Authors | Jason D. Williams, Lars Liden |
Abstract | This is a demonstration of interactive teaching for practical end-to-end dialog systems driven by a recurrent neural network. In this approach, a developer teaches the network by interacting with the system and providing on-the-spot corrections. Once a system is deployed, a developer can also correct mistakes in logged dialogs. This demonstration shows both of these teaching methods applied to dialog systems in three domains: pizza ordering, restaurant information, and weather forecasts. |
Tasks | Dialog Learning, Entity Extraction, Intent Detection |
Published | 2017-08-01 |
URL | https://www.aclweb.org/anthology/W17-5511/ |
https://www.aclweb.org/anthology/W17-5511 | |
PWC | https://paperswithcode.com/paper/demonstration-of-interactive-teaching-for-end |
Repo | |
Framework | |