May 5, 2019

1557 words 8 mins read

Paper Group NANR 67

Paper Group NANR 67

The language of mental health problems in social media. Grammatical Error Correction: Machine Translation and Classifiers. Mining Call Center Conversations Exhibiting Similar Affective States. On the Possessor Interpretation of Non-Agentive Subjects. RALI System Description for CL-SciSumm 2016 Shared Task. Medical Concept Embeddings via Labeled Bac …

The language of mental health problems in social media

Title The language of mental health problems in social media
Authors George Gkotsis, Anika Oellrich, Tim Hubbard, Richard Dobson, Maria Liakata, Sumithra Velupillai, Rina Dutta
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0307/
PDF https://www.aclweb.org/anthology/W16-0307
PWC https://paperswithcode.com/paper/the-language-of-mental-health-problems-in
Repo
Framework

Grammatical Error Correction: Machine Translation and Classifiers

Title Grammatical Error Correction: Machine Translation and Classifiers
Authors Alla Rozovskaya, Dan Roth
Abstract
Tasks Grammatical Error Correction, Machine Translation
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-1208/
PDF https://www.aclweb.org/anthology/P16-1208
PWC https://paperswithcode.com/paper/grammatical-error-correction-machine
Repo
Framework

Mining Call Center Conversations Exhibiting Similar Affective States

Title Mining Call Center Conversations Exhibiting Similar Affective States
Authors Rupayan Chakraborty, P, harip, Meghna e, Sunil Kumar Kopparapu
Abstract
Tasks Emotion Recognition
Published 2016-10-01
URL https://www.aclweb.org/anthology/Y16-3028/
PDF https://www.aclweb.org/anthology/Y16-3028
PWC https://paperswithcode.com/paper/mining-call-center-conversations-exhibiting
Repo
Framework

On the Possessor Interpretation of Non-Agentive Subjects

Title On the Possessor Interpretation of Non-Agentive Subjects
Authors Tomokazu Takehisa
Abstract
Tasks
Published 2016-10-01
URL https://www.aclweb.org/anthology/Y16-3014/
PDF https://www.aclweb.org/anthology/Y16-3014
PWC https://paperswithcode.com/paper/on-the-possessor-interpretation-of-non
Repo
Framework

RALI System Description for CL-SciSumm 2016 Shared Task

Title RALI System Description for CL-SciSumm 2016 Shared Task
Authors Bruno Malenfant, Guy Lapalme
Abstract
Tasks Information Retrieval
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1517/
PDF https://www.aclweb.org/anthology/W16-1517
PWC https://paperswithcode.com/paper/rali-system-description-for-cl-scisumm-2016
Repo
Framework

Medical Concept Embeddings via Labeled Background Corpora

Title Medical Concept Embeddings via Labeled Background Corpora
Authors Eneldo Loza Menc{'\i}a, Gerard de Melo, Jinseok Nam
Abstract In recent years, we have seen an increasing amount of interest in low-dimensional vector representations of words. Among other things, these facilitate computing word similarity and relatedness scores. The most well-known example of algorithms to produce representations of this sort are the word2vec approaches. In this paper, we investigate a new model to induce such vector spaces for medical concepts, based on a joint objective that exploits not only word co-occurrences but also manually labeled documents, as available from sources such as PubMed. Our extensive experimental analysis shows that our embeddings lead to significantly higher correlations with human similarity and relatedness assessments than previous work. Due to the simplicity and versatility of vector representations, these findings suggest that our resource can easily be used as a drop-in replacement to improve any systems relying on medical concept similarity measures.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1733/
PDF https://www.aclweb.org/anthology/L16-1733
PWC https://paperswithcode.com/paper/medical-concept-embeddings-via-labeled
Repo
Framework

Mining Inference Formulas by Goal-Directed Random Walks

Title Mining Inference Formulas by Goal-Directed Random Walks
Authors Zhuoyu Wei, Jun Zhao, Kang Liu
Abstract
Tasks Knowledge Base Completion
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1145/
PDF https://www.aclweb.org/anthology/D16-1145
PWC https://paperswithcode.com/paper/mining-inference-formulas-by-goal-directed
Repo
Framework

Automatic Verification and Augmentation of Multilingual Lexicons

Title Automatic Verification and Augmentation of Multilingual Lexicons
Authors Maryam Aminian, Mohamed Al-Badrashiny, Mona Diab
Abstract We present an approach for automatic verification and augmentation of multilingual lexica. We exploit existing parallel and monolingual corpora to extract multilingual correspondents via tri-angulation. We demonstrate the efficacy of our approach on two publicly available resources: Tharwa, a three-way lexicon comprising Dialectal Arabic, Modern Standard Arabic and English lemmas among other information (Diab et al., 2014); and BabelNet, a multilingual thesaurus comprising over 276 languages including Arabic variant entries (Navigli and Ponzetto, 2012). Our automated approach yields an F1-score of 71.71{%} in generating correct multilingual correspondents against gold Tharwa, and 54.46{%} against gold BabelNet without any human intervention.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4810/
PDF https://www.aclweb.org/anthology/W16-4810
PWC https://paperswithcode.com/paper/automatic-verification-and-augmentation-of
Repo
Framework

Cultural Communication Idiosyncrasies in Human-Computer Interaction

Title Cultural Communication Idiosyncrasies in Human-Computer Interaction
Authors Juliana Miehle, Koichiro Yoshino, Louisa Pragst, Stefan Ultes, Satoshi Nakamura, Wolfgang Minker
Abstract
Tasks Spoken Dialogue Systems
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3610/
PDF https://www.aclweb.org/anthology/W16-3610
PWC https://paperswithcode.com/paper/cultural-communication-idiosyncrasies-in
Repo
Framework

LingoTurk: managing crowdsourced tasks for psycholinguistics

Title LingoTurk: managing crowdsourced tasks for psycholinguistics
Authors Florian Pusse, Asad Sayeed, Vera Demberg
Abstract
Tasks Machine Translation
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-3012/
PDF https://www.aclweb.org/anthology/N16-3012
PWC https://paperswithcode.com/paper/lingoturk-managing-crowdsourced-tasks-for
Repo
Framework

A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research

Title A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
Authors Emre Yilmaz, Maaike Andringa, Sigrid Kingma, Jelske Dijkstra, Frits van der Kuip, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel, David van Leeuwen
Abstract We present a new speech database containing 18.5 hours of annotated radio broadcasts in the Frisian language. Frisian is mostly spoken in the province Fryslan and it is the second official language of the Netherlands. The recordings are collected from the archives of Omrop Fryslan, the regional public broadcaster of the province Fryslan. The database covers almost a 50-year time span. The native speakers of Frisian are mostly bilingual and often code-switch in daily conversations due to the extensive influence of the Dutch language. Considering the longitudinal and code-switching nature of the data, an appropriate annotation protocol has been designed and the data is manually annotated with the orthographic transcription, speaker identities, dialect information, code-switching details and background noise/music information.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1739/
PDF https://www.aclweb.org/anthology/L16-1739
PWC https://paperswithcode.com/paper/a-longitudinal-bilingual-frisian-dutch-radio
Repo
Framework

Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese

Title Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese
Authors Yurie Iribe, Norihide Kitaoka, Shuhei Segawa
Abstract We have constructed a new speech data corpus, using the utterances of 100 elderly Japanese people, to improve speech recognition accuracy of the speech of older people. Humanoid robots are being developed for use in elder care nursing homes. Interaction with such robots is expected to help maintain the cognitive abilities of nursing home residents, as well as providing them with companionship. In order for these robots to interact with elderly people through spoken dialogue, a high performance speech recognition system for speech of elderly people is needed. To develop such a system, we recorded speech uttered by 100 elderly Japanese, most of them are living in nursing homes, with an average age of 77.2. Previously, a seniors{'} speech corpus named S-JNAS was developed, but the average age of the participants was 67.6 years, but the target age for nursing home care is around 75 years old, much higher than that of the S-JNAS samples. In this paper we compare our new corpus with an existing Japanese read speech corpus, JNAS, which consists of adult speech, and with the above mentioned S-JNAS, the senior version of JNAS.
Tasks Speech Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1741/
PDF https://www.aclweb.org/anthology/L16-1741
PWC https://paperswithcode.com/paper/speech-corpus-spoken-by-young-old-old-old-and
Repo
Framework

Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis

Title Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis
Authors Agnieszka Wagner, Katarzyna Klessa, Jolanta Bachan
Abstract This paper reports on a new database ― Polish rhythmic database and tools developed with the aim of investigating timing phenomena and rhythmic structure in Polish including topics such as, inter alia, the effect of speaking style and tempo on timing patterns, phonotactic and phrasal properties of speech rhythm and stability of rhythm metrics. So far, 19 native and 12 non-native speakers with different first languages have been recorded. The collected speech data (5 h 14 min.) represents five different speaking styles and five different tempi. For the needs of speech corpus management, annotation and analysis, a database was developed and integrated with Annotation Pro (Klessa et al., 2013, Klessa, 2016). Currently, the database is the only resource for Polish which allows for a systematic study of a broad range of phenomena related to speech timing and rhythm. The paper also introduces new tools and methods developed to facilitate the database annotation and analysis with respect to various timing and rhythm measures. In the end, the results of an ongoing research and first experimental results using the new resources are reported and future work is sketched.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1742/
PDF https://www.aclweb.org/anthology/L16-1742
PWC https://paperswithcode.com/paper/polish-rhythmic-database-new-resources-for
Repo
Framework

Analyzing Post-dialogue Comments by Speakers – How Do Humans Personalize Their Utterances in Dialogue? –

Title Analyzing Post-dialogue Comments by Speakers – How Do Humans Personalize Their Utterances in Dialogue? –
Authors Toru Hirano, Ryuichiro Higashinaka, Yoshihiro Matsuo
Abstract
Tasks
Published 2016-09-01
URL https://www.aclweb.org/anthology/W16-3619/
PDF https://www.aclweb.org/anthology/W16-3619
PWC https://paperswithcode.com/paper/analyzing-post-dialogue-comments-by-speakers
Repo
Framework

Providing a Catalogue of Language Resources for Commercial Users

Title Providing a Catalogue of Language Resources for Commercial Users
Authors Bente Maegaard, Lina Henriksen, Andrew Joscelyne, Vesna Lusicky, Margaretha Mazura, Sussi Olsen, Claus Povlsen, Philippe Wacker
Abstract Language resources (LR) are indispensable for the development of tools for machine translation (MT) or various kinds of computer-assisted translation (CAT). In particular language corpora, both parallel and monolingual are considered most important for instance for MT, not only SMT but also hybrid MT. The Language Technology Observatory will provide easy access to information about LRs deemed to be useful for MT and other translation tools through its LR Catalogue. In order to determine what aspects of an LR are useful for MT practitioners, a user study was made, providing a guide to the most relevant metadata and the most relevant quality criteria. We have seen that many resources exist which are useful for MT and similar work, but the majority are for (academic) research or educational use only, and as such not available for commercial use. Our work has revealed a list of gaps: coverage gap, awareness gap, quality gap, quantity gap. The paper ends with recommendations for a forward-looking strategy.
Tasks Machine Translation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1072/
PDF https://www.aclweb.org/anthology/L16-1072
PWC https://paperswithcode.com/paper/providing-a-catalogue-of-language-resources
Repo
Framework
comments powered by Disqus