May 5, 2019

1678 words 8 mins read

Paper Group NANR 116

Paper Group NANR 116

Classifying ReachOut posts with a radial basis function SVM. A Hybrid Approach to Generation of Missing Abstracts in Biomedical Literature. Sub-Word Similarity based Search for Embeddings: Inducing Rare-Word Embeddings for Word Similarity Tasks and Language Modelling. An Open Web Platform for Rule-Based Speech-to-Sign Translation. Domain Adaptation …

Classifying ReachOut posts with a radial basis function SVM

Title Classifying ReachOut posts with a radial basis function SVM
Authors Chris Brew
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0315/
PDF https://www.aclweb.org/anthology/W16-0315
PWC https://paperswithcode.com/paper/classifying-reachout-posts-with-a-radial
Repo
Framework

A Hybrid Approach to Generation of Missing Abstracts in Biomedical Literature

Title A Hybrid Approach to Generation of Missing Abstracts in Biomedical Literature
Authors Suchet Chachra, Asma Ben Abacha, Sonya Shooshan, Laritza Rodriguez, Dina Demner-Fushman
Abstract Readers usually rely on abstracts to identify relevant medical information from scientific articles. Abstracts are also essential to advanced information retrieval methods. More than 50 thousand scientific publications in PubMed lack author-generated abstracts, and the relevancy judgements for these papers have to be based on their titles alone. In this paper, we propose a hybrid summarization technique that aims to select the most pertinent sentences from articles to generate an extractive summary in lieu of a missing abstract. We combine i) health outcome detection, ii) keyphrase extraction, and iii) textual entailment recognition between sentences. We evaluate our hybrid approach and analyze the improvements of multi-factor summarization over techniques that rely on a single method, using a collection of 295 manually generated reference summaries. The obtained results show that the hybrid approach outperforms the baseline techniques with an improvement of 13{%} in recall and 4{%} in F1 score.
Tasks Information Retrieval, Natural Language Inference, Text Summarization
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1104/
PDF https://www.aclweb.org/anthology/C16-1104
PWC https://paperswithcode.com/paper/a-hybrid-approach-to-generation-of-missing
Repo
Framework

Sub-Word Similarity based Search for Embeddings: Inducing Rare-Word Embeddings for Word Similarity Tasks and Language Modelling

Title Sub-Word Similarity based Search for Embeddings: Inducing Rare-Word Embeddings for Word Similarity Tasks and Language Modelling
Authors Mittul Singh, Clayton Greenberg, Youssef Oualil, Dietrich Klakow
Abstract Training good word embeddings requires large amounts of data. Out-of-vocabulary words will still be encountered at test-time, leaving these words without embeddings. To overcome this lack of embeddings for rare words, existing methods leverage morphological features to generate embeddings. While the existing methods use computationally-intensive rule-based (Soricut and Och, 2015) or tool-based (Botha and Blunsom, 2014) morphological analysis to generate embeddings, our system applies a computationally-simpler sub-word search on words that have existing embeddings. Embeddings of the sub-word search results are then combined using string similarity functions to generate rare word embeddings. We augmented pre-trained word embeddings with these novel embeddings and evaluated on a rare word similarity task, obtaining up to 3 times improvement in correlation over the original set of embeddings. Applying our technique to embeddings trained on larger datasets led to on-par performance with the existing state-of-the-art for this task. Additionally, while analysing augmented embeddings in a log-bilinear language model, we observed up to 50{%} reduction in rare word perplexity in comparison to other more complex language models.
Tasks Language Modelling, Morphological Analysis, Word Embeddings
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1194/
PDF https://www.aclweb.org/anthology/C16-1194
PWC https://paperswithcode.com/paper/sub-word-similarity-based-search-for
Repo
Framework

An Open Web Platform for Rule-Based Speech-to-Sign Translation

Title An Open Web Platform for Rule-Based Speech-to-Sign Translation
Authors Manny Rayner, Pierrette Bouillon, Sarah Ebling, Johanna Gerlach, Irene Strasly, Nikos Tsourakis
Abstract
Tasks Machine Translation, Sign Language Recognition, Sign Language Translation, Speech Recognition
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-2027/
PDF https://www.aclweb.org/anthology/P16-2027
PWC https://paperswithcode.com/paper/an-open-web-platform-for-rule-based-speech-to
Repo
Framework

Domain Adaptation of Polarity Lexicon combining Term Frequency and Bootstrapping

Title Domain Adaptation of Polarity Lexicon combining Term Frequency and Bootstrapping
Authors Salud Mar{'\i}a Jim{'e}nez-Zafra, Maite Martin, M. Dolores Molina-Gonzalez, L. Alfonso Ure{~n}a-L{'o}pez
Abstract
Tasks Domain Adaptation, Sentiment Analysis
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0422/
PDF https://www.aclweb.org/anthology/W16-0422
PWC https://paperswithcode.com/paper/domain-adaptation-of-polarity-lexicon
Repo
Framework

How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment

Title How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
Authors Patrick Holthaus, Christian Leichsenring, Jasmin Bernotat, Viktor Richter, Marian Pohling, Birte Carlmeyer, Norman K{"o}ster, Sebastian Meyer zu Borgsen, Ren{'e} Zorn, Birte Schiffhauer, Kai Frederic Engelmann, Florian Lier, Simon Schulz, Philipp Cimiano, Friederike Eyssel, Thomas Hermann, Franz Kummert, David Schlangen, Sven Wachsmuth, Petra Wagner, Britta Wrede, Sebastian Wrede
Abstract In order to explore intuitive verbal and non-verbal interfaces in smart environments we recorded user interactions with an intelligent apartment. Besides offering various interactive capabilities itself, the apartment is also inhabited by a social robot that is available as a humanoid interface. This paper presents a multi-modal corpus that contains goal-directed actions of naive users in attempts to solve a number of predefined tasks. Alongside audio and video recordings, our data-set consists of large amount of temporally aligned sensory data and system behavior provided by the environment and its interactive components. Non-verbal system responses such as changes in light or display contents, as well as robot and apartment utterances and gestures serve as a rich basis for later in-depth analysis. Manual annotations provide further information about meta data like the current course of study and user behavior including the incorporated modality, all literal utterances, language features, emotional expressions, foci of attention, and addressees.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1549/
PDF https://www.aclweb.org/anthology/L16-1549
PWC https://paperswithcode.com/paper/how-to-address-smart-homes-with-a-social
Repo
Framework

SimpleScience: Lexical Simplification of Scientific Terminology

Title SimpleScience: Lexical Simplification of Scientific Terminology
Authors Yea-Seul Kim, Jessica Hullman, Matthew Burgess, Eytan Adar
Abstract
Tasks Lexical Simplification, Word Embeddings
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1114/
PDF https://www.aclweb.org/anthology/D16-1114
PWC https://paperswithcode.com/paper/simplescience-lexical-simplification-of
Repo
Framework

Nonparametric Bayesian Models for Spoken Language Understanding

Title Nonparametric Bayesian Models for Spoken Language Understanding
Authors Kei Wakabayashi, Johane Takeuchi, Kotaro Funakoshi, Mikio Nakano
Abstract
Tasks Slot Filling, Spoken Language Understanding
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1232/
PDF https://www.aclweb.org/anthology/D16-1232
PWC https://paperswithcode.com/paper/nonparametric-bayesian-models-for-spoken
Repo
Framework

Task Lineages: Dialog State Tracking for Flexible Interaction

Title Task Lineages: Dialog State Tracking for Flexible Interaction
Authors Sungjin Lee, Am Stent, a
Abstract
Tasks Speech Recognition, Spoken Language Understanding
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3602/
PDF https://www.aclweb.org/anthology/W16-3602
PWC https://paperswithcode.com/paper/task-lineages-dialog-state-tracking-for
Repo
Framework

Stochastic Online AUC Maximization

Title Stochastic Online AUC Maximization
Authors Yiming Ying, Longyin Wen, Siwei Lyu
Abstract Area under ROC (AUC) is a metric which is widely used for measuring the classification performance for imbalanced data. It is of theoretical and practical interest to develop online learning algorithms that maximizes AUC for large-scale data. A specific challenge in developing online AUC maximization algorithm is that the learning objective function is usually defined over a pair of training examples of opposite classes, and existing methods achieves on-line processing with higher space and time complexity. In this work, we propose a new stochastic online algorithm for AUC maximization. In particular, we show that AUC optimization can be equivalently formulated as a convex-concave saddle point problem. From this saddle representation, a stochastic online algorithm (SOLAM) is proposed which has time and space complexity of one datum. We establish theoretical convergence of SOLAM with high probability and demonstrate its effectiveness and efficiency on standard benchmark datasets.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6065-stochastic-online-auc-maximization
PDF http://papers.nips.cc/paper/6065-stochastic-online-auc-maximization.pdf
PWC https://paperswithcode.com/paper/stochastic-online-auc-maximization
Repo
Framework

Reddit Temporal N-gram Corpus and its Applications on Paraphrase and Semantic Similarity in Social Media using a Topic-based Latent Semantic Analysis

Title Reddit Temporal N-gram Corpus and its Applications on Paraphrase and Semantic Similarity in Social Media using a Topic-based Latent Semantic Analysis
Authors Anh Dang, Abidalrahman Moh{'}d, Aminul Islam, Rosane Minghim, Michael Smit, Evangelos Milios
Abstract This paper introduces a new large-scale n-gram corpus that is created specifically from social media text. Two distinguishing characteristics of this corpus are its monthly temporal attribute and that it is created from 1.65 billion comments of user-generated text in Reddit. The usefulness of this corpus is exemplified and evaluated by a novel Topic-based Latent Semantic Analysis (TLSA) algorithm. The experimental results show that unsupervised TLSA outperforms all the state-of-the-art unsupervised and semi-supervised methods in SEMEVAL 2015: paraphrase and semantic similarity in Twitter tasks.
Tasks Information Retrieval, Language Modelling, Machine Translation, Named Entity Recognition, Paraphrase Identification, Rumour Detection, Semantic Similarity, Semantic Textual Similarity, Speech Recognition
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1335/
PDF https://www.aclweb.org/anthology/C16-1335
PWC https://paperswithcode.com/paper/reddit-temporal-n-gram-corpus-and-its
Repo
Framework

Transforming Dependency Structures to Logical Forms for Semantic Parsing

Title Transforming Dependency Structures to Logical Forms for Semantic Parsing
Authors Siva Reddy, Oscar T{"a}ckstr{"o}m, Michael Collins, Tom Kwiatkowski, Dipanjan Das, Mark Steedman, Mirella Lapata
Abstract The strongly typed syntax of grammar formalisms such as CCG, TAG, LFG and HPSG offers a synchronous framework for deriving syntactic structures and semantic logical forms. In contrast{—}partly due to the lack of a strong type system{—}dependency structures are easy to annotate and have become a widely used form of syntactic analysis for many languages. However, the lack of a type system makes a formal mechanism for deriving logical forms from dependency structures challenging. We address this by introducing a robust system based on the lambda calculus for deriving neo-Davidsonian logical forms from dependency trees. These logical forms are then used for semantic parsing of natural language to Freebase. Experiments on the Free917 and Web-Questions datasets show that our representation is superior to the original dependency trees and that it outperforms a CCG-based representation on this task. Compared to prior work, we obtain the strongest result to date on Free917 and competitive results on WebQuestions.
Tasks Question Answering, Semantic Parsing
Published 2016-01-01
URL https://www.aclweb.org/anthology/Q16-1010/
PDF https://www.aclweb.org/anthology/Q16-1010
PWC https://paperswithcode.com/paper/transforming-dependency-structures-to-logical
Repo
Framework

Improving Document Ranking using Query Expansion and Classification Techniques for Mixed Script Information Retrieval

Title Improving Document Ranking using Query Expansion and Classification Techniques for Mixed Script Information Retrieval
Authors Subham Kumar, Anwesh Sinha Ray, Sabyasachi Kamila, Asif Ekbal, Sriparna Saha, Pushpak Bhattacharyya
Abstract
Tasks Document Ranking, Information Retrieval, Transliteration
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-6311/
PDF https://www.aclweb.org/anthology/W16-6311
PWC https://paperswithcode.com/paper/improving-document-ranking-using-query
Repo
Framework

Linguistic Issues in the Machine Transliteration of Chinese, Japanese and Arabic Names

Title Linguistic Issues in the Machine Transliteration of Chinese, Japanese and Arabic Names
Authors Jack Halpern
Abstract
Tasks Entity Extraction, Transliteration
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2707/
PDF https://www.aclweb.org/anthology/W16-2707
PWC https://paperswithcode.com/paper/linguistic-issues-in-the-machine
Repo
Framework

Scalable Semi-Supervised Query Classification Using Matrix Sketching

Title Scalable Semi-Supervised Query Classification Using Matrix Sketching
Authors Young-Bum Kim, Karl Stratos, Ruhi Sarikaya
Abstract
Tasks Intent Classification, Representation Learning, Sentence Classification
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-2002/
PDF https://www.aclweb.org/anthology/P16-2002
PWC https://paperswithcode.com/paper/scalable-semi-supervised-query-classification
Repo
Framework
comments powered by Disqus