May 4, 2019

1960 words 10 mins read

Paper Group NANR 204

Paper Group NANR 204

Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding. Identification, characterization, and grounding of gradable terms in clinical text. Comparison of Emotional Understanding in Modality-Controlled Environments using Multimodal Online Emotional Communication Corpus. Strategy and Policy Learning for Non-Task-Oriented …

Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding

Title Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding
Authors Yangyang Shi, Kaisheng Yao, Hu Chen, Dong Yu, Yi-Cheng Pan, Mei-Yuh Hwang
Abstract
Tasks Chunking, Spoken Language Understanding
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1044/
PDF https://www.aclweb.org/anthology/N16-1044
PWC https://paperswithcode.com/paper/recurrent-support-vector-machines-for-slot
Repo
Framework

Identification, characterization, and grounding of gradable terms in clinical text

Title Identification, characterization, and grounding of gradable terms in clinical text
Authors Chaitanya Shivade, Marie-Catherine de Marneffe, Eric Fosler-Lussier, Albert M. Lai
Abstract
Tasks Decision Making
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2903/
PDF https://www.aclweb.org/anthology/W16-2903
PWC https://paperswithcode.com/paper/identification-characterization-and-grounding
Repo
Framework

Comparison of Emotional Understanding in Modality-Controlled Environments using Multimodal Online Emotional Communication Corpus

Title Comparison of Emotional Understanding in Modality-Controlled Environments using Multimodal Online Emotional Communication Corpus
Authors Yoshiko Arimoto, Kazuo Okanoya
Abstract In online computer-mediated communication, speakers were considered to have experienced difficulties in catching their partner{'}s emotions and in conveying their own emotions. To explain why online emotional communication is so difficult and to investigate how this problem should be solved, multimodal online emotional communication corpus was constructed by recording approximately 100 speakers{'} emotional expressions and reactions in a modality-controlled environment. Speakers communicated over the Internet using video chat, voice chat or text chat; their face-to-face conversations were used for comparison purposes. The corpora incorporated emotional labels by evaluating the speaker{'}s dynamic emotional states and the measurements of the speaker{'}s facial expression, vocal expression and autonomic nervous system activity. For the initial study of this project, which used a large-scale emotional communication corpus, the accuracy of online emotional understanding was assessed to demonstrate the emotional labels evaluated by the speakers and to summarize the speaker{'}s answers on the questionnaire regarding the difference between an online chat and face-to-face conversations in which they actually participated. The results revealed that speakers have difficulty communicating their emotions in online communication environments, regardless of the type of communication modality and that inaccurate emotional understanding occurs more frequently in online computer-mediated communication than in face-to-face communication.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1343/
PDF https://www.aclweb.org/anthology/L16-1343
PWC https://paperswithcode.com/paper/comparison-of-emotional-understanding-in
Repo
Framework

Strategy and Policy Learning for Non-Task-Oriented Conversational Systems

Title Strategy and Policy Learning for Non-Task-Oriented Conversational Systems
Authors Zhou Yu, Ziyu Xu, Alan W Black, Alex Rudnicky, er
Abstract
Tasks Machine Translation
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3649/
PDF https://www.aclweb.org/anthology/W16-3649
PWC https://paperswithcode.com/paper/strategy-and-policy-learning-for-non-task
Repo
Framework

Deep LSTM based Feature Mapping for Query Classification

Title Deep LSTM based Feature Mapping for Query Classification
Authors Yangyang Shi, Kaisheng Yao, Le Tian, Daxin Jiang
Abstract
Tasks Speech Recognition
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-1176/
PDF https://www.aclweb.org/anthology/N16-1176
PWC https://paperswithcode.com/paper/deep-lstm-based-feature-mapping-for-query
Repo
Framework

Sinhala Short Sentence Similarity Calculation using Corpus-Based and Knowledge-Based Similarity Measures

Title Sinhala Short Sentence Similarity Calculation using Corpus-Based and Knowledge-Based Similarity Measures
Authors Jcs Kadupitiya, Surangika Ranathunga, Gihan Dias
Abstract Currently, corpus based-similarity, string-based similarity, and knowledge-based similarity techniques are used to compare short phrases. However, no work has been conducted on the similarity of phrases in Sinhala language. In this paper, we present a hybrid methodology to compute the similarity between two Sinhala sentences using a Semantic Similarity Measurement technique (corpus-based similarity measurement plus knowledge-based similarity measurement) that makes use of word order information. Since Sinhala WordNet is still under construction, we used lexical resources in performing this semantic similarity calculation. Evaluation using 4000 sentence pairs yielded an average MSE of 0.145 and a Pearson correla-tion factor of 0.832.
Tasks Semantic Similarity, Semantic Textual Similarity, Word Sense Disambiguation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3705/
PDF https://www.aclweb.org/anthology/W16-3705
PWC https://paperswithcode.com/paper/sinhala-short-sentence-similarity-calculation
Repo
Framework

WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words

Title WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words
Authors Luisa Bentivogli, Mauro Cettolo, M. Amin Farajian, Marcello Federico
Abstract This paper presents WAGS (Word Alignment Gold Standard), a novel benchmark which allows extensive evaluation of WA tools on out-of-vocabulary (OOV) and rare words. WAGS is a subset of the Common Test section of the Europarl English-Italian parallel corpus, and is specifically tailored to OOV and rare words. WAGS is composed of 6,715 sentence pairs containing 11,958 occurrences of OOV and rare words up to frequency 15 in the Europarl Training set (5,080 English words and 6,878 Italian words), representing almost 3{%} of the whole text. Since WAGS is focused on OOV/rare words, manual alignments are provided for these words only, and not for the whole sentences. Two off-the-shelf word aligners have been evaluated on WAGS, and results have been compared to those obtained on an existing benchmark tailored to full text alignment. The results obtained confirm that WAGS is a valuable resource, which allows a statistically sound evaluation of WA systems{'} performance on OOV and rare words, as well as extensive data analyses. WAGS is publicly released under a Creative Commons Attribution license.
Tasks Word Alignment
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1562/
PDF https://www.aclweb.org/anthology/L16-1562
PWC https://paperswithcode.com/paper/wags-a-beautiful-english-italian-benchmark
Repo
Framework

Crowdsourcing-based Annotation of Emotions in Filipino and English Tweets

Title Crowdsourcing-based Annotation of Emotions in Filipino and English Tweets
Authors Fermin Roberto Lapitan, Riza Theresa Batista-Navarro, Eliezer Albacea
Abstract The automatic analysis of emotions conveyed in social media content, e.g., tweets, has many beneficial applications. In the Philippines, one of the most disaster-prone countries in the world, such methods could potentially enable first responders to make timely decisions despite the risk of data deluge. However, recognising emotions expressed in Philippine-generated tweets, which are mostly written in Filipino, English or a mix of both, is a non-trivial task. In order to facilitate the development of natural language processing (NLP) methods that will automate such type of analysis, we have built a corpus of tweets whose predominant emotions have been manually annotated by means of crowdsourcing. Defining measures ensuring that only high-quality annotations were retained, we have produced a gold standard corpus of 1,146 emotion-labelled Filipino and English tweets. We validate the value of this manually produced resource by demonstrating that an automatic emotion-prediction method based on the use of a publicly available word-emotion association lexicon was unable to reproduce the labels assigned via crowdsourcing. While we are planning to make a few extensions to the corpus in the near future, its current version has been made publicly available in order to foster the development of emotion analysis methods based on advanced Filipino and English NLP.
Tasks Emotion Recognition
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3708/
PDF https://www.aclweb.org/anthology/W16-3708
PWC https://paperswithcode.com/paper/crowdsourcing-based-annotation-of-emotions-in
Repo
Framework

Automatic Creation of a Sentence Aligned Sinhala-Tamil Parallel Corpus

Title Automatic Creation of a Sentence Aligned Sinhala-Tamil Parallel Corpus
Authors Riyafa Abdul Hameed, Nadeeshani Pathirennehelage, Anusha Ihalapathirana, Maryam Ziyad Mohamed, Surangika Ranathunga, Sanath Jayasena, Gihan Dias, Fern, S o, areka
Abstract A sentence aligned parallel corpus is an important prerequisite in statistical machine translation. However, manual creation of such a parallel corpus is time consuming, and requires experts fluent in both languages. Automatic creation of a sentence aligned parallel corpus using parallel text is the solution to this problem. In this paper, we present the first ever empirical evaluation carried out to identify the best method to automatically create a sentence aligned Sinhala-Tamil parallel corpus. Annual reports from Sri Lankan government institutions were used as the parallel text for aligning. Despite both Sinhala and Tamil being under-resourced languages, we were able to achieve an F-score value of 0.791 using a hybrid approach that makes use of a bilingual dictionary.
Tasks Machine Translation, Word Alignment
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3713/
PDF https://www.aclweb.org/anthology/W16-3713
PWC https://paperswithcode.com/paper/automatic-creation-of-a-sentence-aligned
Repo
Framework

Clustering-based Phonetic Projection in Mismatched Crowdsourcing Channels for Low-resourced ASR

Title Clustering-based Phonetic Projection in Mismatched Crowdsourcing Channels for Low-resourced ASR
Authors Wenda Chen, Mark Hasegawa-Johnson, Nancy Chen, Preethi Jyothi, Lav Varshney
Abstract Acquiring labeled speech for low-resource languages is a difficult task in the absence of native speakers of the language. One solution to this problem involves collecting speech transcriptions from crowd workers who are foreign or non-native speakers of a given target language. From these mismatched transcriptions, one can derive probabilistic phone transcriptions that are defined over the set of all target language phones using a noisy channel model. This paper extends prior work on deriving probabilistic transcriptions (PTs) from mismatched transcriptions by 1) modelling multilingual channels and 2) introducing a clustering-based phonetic mapping technique to improve the quality of PTs. Mismatched crowdsourcing for multilingual channels has certain properties of projection mapping, e.g., it can be interpreted as a clustering based on singular value decomposition of the segment alignments. To this end, we explore the use of distinctive feature weights, lexical tone confusions, and a two-step clustering algorithm to learn projections of phoneme segments from mismatched multilingual transcriber languages to the target language. We evaluate our techniques using mismatched transcriptions for Cantonese speech acquired from native English and Mandarin speakers. We observe a 5-9{%} relative reduction in phone error rate for the predicted Cantonese phone transcriptions using our proposed techniques compared with the previous PT method.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-3714/
PDF https://www.aclweb.org/anthology/W16-3714
PWC https://paperswithcode.com/paper/clustering-based-phonetic-projection-in
Repo
Framework

DISCO: A System Leveraging Semantic Search in Document Review

Title DISCO: A System Leveraging Semantic Search in Document Review
Authors Ngoc Phuoc An Vo, Fabien Guillot, Caroline Privault
Abstract This paper presents Disco, a prototype for supporting knowledge workers in exploring, reviewing and sorting collections of textual data. The goal is to facilitate, accelerate and improve the discovery of information. To this end, it combines Semantic Relatedness techniques with a review workflow developed in a tangible environment. Disco uses a semantic model that is leveraged on-line in the course of search sessions, and accessed through natural hand-gesture, in a simple and intuitive way.
Tasks Text Categorization, Text Clustering
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2014/
PDF https://www.aclweb.org/anthology/C16-2014
PWC https://paperswithcode.com/paper/disco-a-system-leveraging-semantic-search-in
Repo
Framework

Accounting ngrams and multi-word terms can improve topic models

Title Accounting ngrams and multi-word terms can improve topic models
Authors Michael Nokel, Natalia Loukachevitch
Abstract
Tasks Information Retrieval, Text Clustering, Topic Models, Word Sense Disambiguation
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-1806/
PDF https://www.aclweb.org/anthology/W16-1806
PWC https://paperswithcode.com/paper/accounting-ngrams-and-multi-word-terms-can
Repo
Framework

Improve Sentiment Analysis of Citations with Author Modelling

Title Improve Sentiment Analysis of Citations with Author Modelling
Authors Zheng Ma, Jinseok Nam, Karsten Weihe
Abstract
Tasks Sentiment Analysis
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0420/
PDF https://www.aclweb.org/anthology/W16-0420
PWC https://paperswithcode.com/paper/improve-sentiment-analysis-of-citations-with
Repo
Framework

Active Learning with Oracle Epiphany

Title Active Learning with Oracle Epiphany
Authors Tzu-Kuo Huang, Lihong Li, Ara Vartanian, Saleema Amershi, Jerry Zhu
Abstract We present a theoretical analysis of active learning with more realistic interactions with human oracles. Previous empirical studies have shown oracles abstaining on difficult queries until accumulating enough information to make label decisions. We formalize this phenomenon with an “oracle epiphany model” and analyze active learning query complexity under such oracles for both the realizable and the agnos- tic cases. Our analysis shows that active learning is possible with oracle epiphany, but incurs an additional cost depending on when the epiphany happens. Our results suggest new, principled active learning approaches with realistic oracles.
Tasks Active Learning
Published 2016-12-01
URL http://papers.nips.cc/paper/6155-active-learning-with-oracle-epiphany
PDF http://papers.nips.cc/paper/6155-active-learning-with-oracle-epiphany.pdf
PWC https://paperswithcode.com/paper/active-learning-with-oracle-epiphany
Repo
Framework

Coreference Annotation Scheme and Relation Types for Hindi

Title Coreference Annotation Scheme and Relation Types for Hindi
Authors V Mujadia, an, Palash Gupta, Dipti Misra Sharma
Abstract This paper describes a coreference annotation scheme, coreference annotation specific issues and their solutions through our proposed annotation scheme for Hindi. We introduce different co-reference relation types between continuous mentions of the same coreference chain such as {}Part-of{''}, {}Function-value pair{''} etc. We used Jaccard similarity based Krippendorff{`}s{'} alpha to demonstrate consistency in annotation scheme, annotation and corpora. To ease the coreference annotation process, we built a semi-automatic Coreference Annotation Tool (CAT). We also provide statistics of coreference annotation on Hindi Dependency Treebank (HDTB). |
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1025/
PDF https://www.aclweb.org/anthology/L16-1025
PWC https://paperswithcode.com/paper/coreference-annotation-scheme-and-relation
Repo
Framework
comments powered by Disqus