May 5, 2019

1752 words 9 mins read

Paper Group NANR 60

Paper Group NANR 60

UoB-UK at SemEval-2016 Task 1: A Flexible and Extendable System for Semantic Text Similarity using Types, Surprise and Phrase Linking. UNBNLP at SemEval-2016 Task 1: Semantic Textual Similarity: A Unified Framework for Semantic Processing and Evaluation. Fast recovery from a union of subspaces. Annotating Logical Forms for EHR Questions. Semantic T …

UoB-UK at SemEval-2016 Task 1: A Flexible and Extendable System for Semantic Text Similarity using Types, Surprise and Phrase Linking

Title UoB-UK at SemEval-2016 Task 1: A Flexible and Extendable System for Semantic Text Similarity using Types, Surprise and Phrase Linking
Authors Harish Tayyar Madabushi, Mark Buhagiar, Mark Lee
Abstract
Tasks Machine Translation
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1104/
PDF https://www.aclweb.org/anthology/S16-1104
PWC https://paperswithcode.com/paper/uob-uk-at-semeval-2016-task-1-a-flexible-and
Repo
Framework

UNBNLP at SemEval-2016 Task 1: Semantic Textual Similarity: A Unified Framework for Semantic Processing and Evaluation

Title UNBNLP at SemEval-2016 Task 1: Semantic Textual Similarity: A Unified Framework for Semantic Processing and Evaluation
Authors Milton King, Waseem Gharbieh, SoHyun Park, Paul Cook
Abstract
Tasks Semantic Textual Similarity, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1113/
PDF https://www.aclweb.org/anthology/S16-1113
PWC https://paperswithcode.com/paper/unbnlp-at-semeval-2016-task-1-semantic
Repo
Framework

Fast recovery from a union of subspaces

Title Fast recovery from a union of subspaces
Authors Chinmay Hegde, Piotr Indyk, Ludwig Schmidt
Abstract We address the problem of recovering a high-dimensional but structured vector from linear observations in a general setting where the vector can come from an arbitrary union of subspaces. This setup includes well-studied problems such as compressive sensing and low-rank matrix recovery. We show how to design more efficient algorithms for the union-of subspace recovery problem by using approximate projections. Instantiating our general framework for the low-rank matrix recovery problem gives the fastest provable running time for an algorithm with optimal sample complexity. Moreover, we give fast approximate projections for 2D histograms, another well-studied low-dimensional model of data. We complement our theoretical results with experiments demonstrating that our framework also leads to improved time and sample complexity empirically.
Tasks Compressive Sensing
Published 2016-12-01
URL http://papers.nips.cc/paper/6484-fast-recovery-from-a-union-of-subspaces
PDF http://papers.nips.cc/paper/6484-fast-recovery-from-a-union-of-subspaces.pdf
PWC https://paperswithcode.com/paper/fast-recovery-from-a-union-of-subspaces
Repo
Framework

Annotating Logical Forms for EHR Questions

Title Annotating Logical Forms for EHR Questions
Authors Kirk Roberts, Dina Demner-Fushman
Abstract This paper discusses the creation of a semantically annotated corpus of questions about patient data in electronic health records (EHRs). The goal is provide the training data necessary for semantic parsers to automatically convert EHR questions into a structured query. A layered annotation strategy is used which mirrors a typical natural language processing (NLP) pipeline. First, questions are syntactically analyzed to identify multi-part questions. Second, medical concepts are recognized and normalized to a clinical ontology. Finally, logical forms are created using a lambda calculus representation. We use a corpus of 446 questions asking for patient-specific information. From these, 468 specific questions are found containing 259 unique medical concepts and requiring 53 unique predicates to represent the logical forms. We further present detailed characteristics of the corpus, including inter-annotator agreement results, and describe the challenges automatic NLP systems will face on this task.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1598/
PDF https://www.aclweb.org/anthology/L16-1598
PWC https://paperswithcode.com/paper/annotating-logical-forms-for-ehr-questions
Repo
Framework

Semantic Textual Similarity in Quality Estimation

Title Semantic Textual Similarity in Quality Estimation
Authors Hanna Bechara, Carla Parra Escartin, Constantin Orasan, Lucia Specia
Abstract
Tasks Machine Translation, Semantic Textual Similarity
Published 2016-01-01
URL https://www.aclweb.org/anthology/W16-3413/
PDF https://www.aclweb.org/anthology/W16-3413
PWC https://paperswithcode.com/paper/semantic-textual-similarity-in-quality
Repo
Framework

Keynote - More than meets the ear: Processes that shape dialogue

Title Keynote - More than meets the ear: Processes that shape dialogue
Authors Susan Brennan
Abstract
Tasks Spoken Dialogue Systems
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3607/
PDF https://www.aclweb.org/anthology/W16-3607
PWC https://paperswithcode.com/paper/keynote-more-than-meets-the-ear-processes
Repo
Framework

Unsupervised Learning from Noisy Networks with Applications to Hi-C Data

Title Unsupervised Learning from Noisy Networks with Applications to Hi-C Data
Authors Bo Wang, Junjie Zhu, Armin Pourshafeie, Oana Ursu, Serafim Batzoglou, Anshul Kundaje
Abstract Complex networks play an important role in a plethora of disciplines in natural sciences. Cleaning up noisy observed networks, poses an important challenge in network analysis Existing methods utilize labeled data to alleviate the noise effect in the network. However, labeled data is usually expensive to collect while unlabeled data can be gathered cheaply. In this paper, we propose an optimization framework to mine useful structures from noisy networks in an unsupervised manner. The key feature of our optimization framework is its ability to utilize local structures as well as global patterns in the network. We extend our method to incorporate multi-resolution networks in order to add further resistance to high-levels of noise. We also generalize our framework to utilize partial labels to enhance the performance. We specifically focus our method on multi-resolution Hi-C data by recovering clusters of genomic regions that co-localize in 3D space. Additionally, we use Capture-C-generated partial labels to further denoise the Hi-C network. We empirically demonstrate the effectiveness of our framework in denoising the network and improving community detection results.
Tasks Community Detection, Denoising
Published 2016-12-01
URL http://papers.nips.cc/paper/6291-unsupervised-learning-from-noisy-networks-with-applications-to-hi-c-data
PDF http://papers.nips.cc/paper/6291-unsupervised-learning-from-noisy-networks-with-applications-to-hi-c-data.pdf
PWC https://paperswithcode.com/paper/unsupervised-learning-from-noisy-networks
Repo
Framework

Parallel Sentence Extraction from Comparable Corpora with Neural Network Features

Title Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
Authors Chenhui Chu, Raj Dabre, Sadao Kurohashi
Abstract Parallel corpora are crucial for machine translation (MT), however they are quite scarce for most language pairs and domains. As comparable corpora are far more available, many studies have been conducted to extract parallel sentences from them for MT. In this paper, we exploit the neural network features acquired from neural MT for parallel sentence extraction. We observe significant improvements for both accuracy in sentence extraction and MT performance.
Tasks Machine Translation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1468/
PDF https://www.aclweb.org/anthology/L16-1468
PWC https://paperswithcode.com/paper/parallel-sentence-extraction-from-comparable
Repo
Framework

Summ-it++: an Enriched Version of the Summ-it Corpus

Title Summ-it++: an Enriched Version of the Summ-it Corpus
Authors Ev Fonseca, ro, Andr{'e} Antonitsch, S Collovini, ra, Daniela Amaral, Renata Vieira, Anny Figueira
Abstract This paper presents Summ-it++, an enriched version the Summ-it corpus. In this new version, the corpus has received new semantic layers, named entity categories and relations between named entities, adding to the previous coreference annotation. In addition, we change the original Summ-it format to SemEval
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1324/
PDF https://www.aclweb.org/anthology/L16-1324
PWC https://paperswithcode.com/paper/summ-it-an-enriched-version-of-the-summ-it
Repo
Framework

Japanese-English Machine Translation of Recipe Texts

Title Japanese-English Machine Translation of Recipe Texts
Authors Takayuki Sato, Jun Harashima, Mamoru Komachi
Abstract Concomitant with the globalization of food culture, demand for the recipes of specialty dishes has been increasing. The recent growth in recipe sharing websites and food blogs has resulted in numerous recipe texts being available for diverse foods in various languages. However, little work has been done on machine translation of recipe texts. In this paper, we address the task of translating recipes and investigate the advantages and disadvantages of traditional phrase-based statistical machine translation and more recent neural machine translation. Specifically, we translate Japanese recipes into English, analyze errors in the translated recipes, and discuss available room for improvements.
Tasks Information Retrieval, Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4603/
PDF https://www.aclweb.org/anthology/W16-4603
PWC https://paperswithcode.com/paper/japanese-english-machine-translation-of
Repo
Framework

Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)

Title Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)
Authors
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1000/
PDF https://www.aclweb.org/anthology/S16-1000
PWC https://paperswithcode.com/paper/proceedings-of-the-10th-international-3
Repo
Framework

iUBC at SemEval-2016 Task 2: RNNs and LSTMs for interpretable STS

Title iUBC at SemEval-2016 Task 2: RNNs and LSTMs for interpretable STS
Authors I{~n}igo Lopez-Gazpio, Eneko Agirre, Montse Maritxalar
Abstract
Tasks Chunking, Semantic Textual Similarity
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1119/
PDF https://www.aclweb.org/anthology/S16-1119
PWC https://paperswithcode.com/paper/iubc-at-semeval-2016-task-2-rnns-and-lstms
Repo
Framework

SCALE: A Scalable Language Engineering Toolkit

Title SCALE: A Scalable Language Engineering Toolkit
Authors Joris Pelemans, Lyan Verwimp, Kris Demuynck, Hugo Van hamme, Patrick Wambacq
Abstract In this paper we present SCALE, a new Python toolkit that contains two extensions to n-gram language models. The first extension is a novel technique to model compound words called Semantic Head Mapping (SHM). The second extension, Bag-of-Words Language Modeling (BagLM), bundles popular models such as Latent Semantic Analysis and Continuous Skip-grams. Both extensions scale to large data and allow the integration into first-pass ASR decoding. The toolkit is open source, includes working examples and can be found on http://github.com/jorispelemans/scale.
Tasks Language Modelling
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1612/
PDF https://www.aclweb.org/anthology/L16-1612
PWC https://paperswithcode.com/paper/scale-a-scalable-language-engineering-toolkit
Repo
Framework

Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis

Title Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis
Authors S Brognaux, rine, Thomas Fran{\c{c}}ois, Marco Saerens
Abstract Text-to-speech has long been centered on the production of an intelligible message of good quality. More recently, interest has shifted to the generation of more natural and expressive speech. A major issue of existing approaches is that they usually rely on a manual annotation in expressive styles, which tends to be rather subjective. A typical related issue is that the annotation is strongly influenced ― and possibly biased ― by the semantic content of the text (e.g. a shot or a fault may incite the annotator to tag that sequence as expressing a high degree of excitation, independently of its acoustic realization). This paper investigates the assumption that human annotation of basketball commentaries in excitation levels can be automatically improved on the basis of acoustic features. It presents two techniques for label correction exploiting a Gaussian mixture and a proportional-odds logistic regression. The automatically re-annotated corpus is then used to train HMM-based expressive speech synthesizers, the performance of which is assessed through subjective evaluations. The results indicate that the automatic correction of the annotation with Gaussian mixture helps to synthesize more contrasted excitation levels, while preserving naturalness.
Tasks Speech Synthesis
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1613/
PDF https://www.aclweb.org/anthology/L16-1613
PWC https://paperswithcode.com/paper/combining-manual-and-automatic-prosodic
Repo
Framework

BAS Speech Science Web Services - an Update of Current Developments

Title BAS Speech Science Web Services - an Update of Current Developments
Authors Thomas Kisler, Uwe Reichel, Florian Schiel, Christoph Draxler, Bernhard Jackl, Nina P{"o}rner
Abstract In 2012 the Bavarian Archive for Speech Signals started providing some of its tools from the field of spoken language in the form of Software as a Service (SaaS). This means users access the processing functionality over a web browser and therefore do not have to install complex software packages on a local computer. Amongst others, these tools include segmentation {&} labeling, grapheme-to-phoneme conversion, text alignment, syllabification and metadata generation, where all but the last are available for a variety of languages. Since its creation the number of available services and the web interface have changed considerably. We give an overview and a detailed description of the system architecture, the available web services and their functionality. Furthermore, we show how the number of files processed over the system developed in the last four years.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1614/
PDF https://www.aclweb.org/anthology/L16-1614
PWC https://paperswithcode.com/paper/bas-speech-science-web-services-an-update-of
Repo
Framework
comments powered by Disqus