May 5, 2019

1557 words 8 mins read

Paper Group NANR 67

The language of mental health problems in social media. Grammatical Error Correction: Machine Translation and Classifiers. Mining Call Center Conversations Exhibiting Similar Affective States. On the Possessor Interpretation of Non-Agentive Subjects. RALI System Description for CL-SciSumm 2016 Shared Task. Medical Concept Embeddings via Labeled Bac …


Title	The language of mental health problems in social media
Authors	George Gkotsis, Anika Oellrich, Tim Hubbard, Richard Dobson, Maria Liakata, Sumithra Velupillai, Rina Dutta
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0307/
PDF	https://www.aclweb.org/anthology/W16-0307
PWC	https://paperswithcode.com/paper/the-language-of-mental-health-problems-in
Repo
Framework

Grammatical Error Correction: Machine Translation and Classifiers


Title	Grammatical Error Correction: Machine Translation and Classifiers
Authors	Alla Rozovskaya, Dan Roth
Abstract
Tasks	Grammatical Error Correction, Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1208/
PDF	https://www.aclweb.org/anthology/P16-1208
PWC	https://paperswithcode.com/paper/grammatical-error-correction-machine
Repo
Framework

Mining Call Center Conversations Exhibiting Similar Affective States


Title	Mining Call Center Conversations Exhibiting Similar Affective States
Authors	Rupayan Chakraborty, P, harip, Meghna e, Sunil Kumar Kopparapu
Abstract
Tasks	Emotion Recognition
Published	2016-10-01
URL	https://www.aclweb.org/anthology/Y16-3028/
PDF	https://www.aclweb.org/anthology/Y16-3028
PWC	https://paperswithcode.com/paper/mining-call-center-conversations-exhibiting
Repo
Framework

On the Possessor Interpretation of Non-Agentive Subjects


Title	On the Possessor Interpretation of Non-Agentive Subjects
Authors	Tomokazu Takehisa
Abstract
Tasks
Published	2016-10-01
URL	https://www.aclweb.org/anthology/Y16-3014/
PDF	https://www.aclweb.org/anthology/Y16-3014
PWC	https://paperswithcode.com/paper/on-the-possessor-interpretation-of-non
Repo
Framework

RALI System Description for CL-SciSumm 2016 Shared Task


Title	RALI System Description for CL-SciSumm 2016 Shared Task
Authors	Bruno Malenfant, Guy Lapalme
Abstract
Tasks	Information Retrieval
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-1517/
PDF	https://www.aclweb.org/anthology/W16-1517
PWC	https://paperswithcode.com/paper/rali-system-description-for-cl-scisumm-2016
Repo
Framework

Medical Concept Embeddings via Labeled Background Corpora


Title	Medical Concept Embeddings via Labeled Background Corpora
Authors	Eneldo Loza Menc{'\i}a, Gerard de Melo, Jinseok Nam
Abstract	In recent years, we have seen an increasing amount of interest in low-dimensional vector representations of words. Among other things, these facilitate computing word similarity and relatedness scores. The most well-known example of algorithms to produce representations of this sort are the word2vec approaches. In this paper, we investigate a new model to induce such vector spaces for medical concepts, based on a joint objective that exploits not only word co-occurrences but also manually labeled documents, as available from sources such as PubMed. Our extensive experimental analysis shows that our embeddings lead to significantly higher correlations with human similarity and relatedness assessments than previous work. Due to the simplicity and versatility of vector representations, these findings suggest that our resource can easily be used as a drop-in replacement to improve any systems relying on medical concept similarity measures.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1733/
PDF	https://www.aclweb.org/anthology/L16-1733
PWC	https://paperswithcode.com/paper/medical-concept-embeddings-via-labeled
Repo
Framework

Mining Inference Formulas by Goal-Directed Random Walks


Title	Mining Inference Formulas by Goal-Directed Random Walks
Authors	Zhuoyu Wei, Jun Zhao, Kang Liu
Abstract
Tasks	Knowledge Base Completion
Published	2016-11-01
URL	https://www.aclweb.org/anthology/D16-1145/
PDF	https://www.aclweb.org/anthology/D16-1145
PWC	https://paperswithcode.com/paper/mining-inference-formulas-by-goal-directed
Repo
Framework

Automatic Verification and Augmentation of Multilingual Lexicons


Title	Automatic Verification and Augmentation of Multilingual Lexicons
Authors	Maryam Aminian, Mohamed Al-Badrashiny, Mona Diab
Abstract	We present an approach for automatic verification and augmentation of multilingual lexica. We exploit existing parallel and monolingual corpora to extract multilingual correspondents via tri-angulation. We demonstrate the efficacy of our approach on two publicly available resources: Tharwa, a three-way lexicon comprising Dialectal Arabic, Modern Standard Arabic and English lemmas among other information (Diab et al., 2014); and BabelNet, a multilingual thesaurus comprising over 276 languages including Arabic variant entries (Navigli and Ponzetto, 2012). Our automated approach yields an F1-score of 71.71{%} in generating correct multilingual correspondents against gold Tharwa, and 54.46{%} against gold BabelNet without any human intervention.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4810/
PDF	https://www.aclweb.org/anthology/W16-4810
PWC	https://paperswithcode.com/paper/automatic-verification-and-augmentation-of
Repo
Framework

Cultural Communication Idiosyncrasies in Human-Computer Interaction


Title	Cultural Communication Idiosyncrasies in Human-Computer Interaction
Authors	Juliana Miehle, Koichiro Yoshino, Louisa Pragst, Stefan Ultes, Satoshi Nakamura, Wolfgang Minker
Abstract
Tasks	Spoken Dialogue Systems
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-3610/
PDF	https://www.aclweb.org/anthology/W16-3610
PWC	https://paperswithcode.com/paper/cultural-communication-idiosyncrasies-in
Repo
Framework

LingoTurk: managing crowdsourced tasks for psycholinguistics


Title	LingoTurk: managing crowdsourced tasks for psycholinguistics
Authors	Florian Pusse, Asad Sayeed, Vera Demberg
Abstract
Tasks	Machine Translation
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-3012/
PDF	https://www.aclweb.org/anthology/N16-3012
PWC	https://paperswithcode.com/paper/lingoturk-managing-crowdsourced-tasks-for
Repo
Framework

A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research


Title	A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
Authors	Emre Yilmaz, Maaike Andringa, Sigrid Kingma, Jelske Dijkstra, Frits van der Kuip, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel, David van Leeuwen
Abstract	We present a new speech database containing 18.5 hours of annotated radio broadcasts in the Frisian language. Frisian is mostly spoken in the province Fryslan and it is the second official language of the Netherlands. The recordings are collected from the archives of Omrop Fryslan, the regional public broadcaster of the province Fryslan. The database covers almost a 50-year time span. The native speakers of Frisian are mostly bilingual and often code-switch in daily conversations due to the extensive influence of the Dutch language. Considering the longitudinal and code-switching nature of the data, an appropriate annotation protocol has been designed and the data is manually annotated with the orthographic transcription, speaker identities, dialect information, code-switching details and background noise/music information.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1739/
PDF	https://www.aclweb.org/anthology/L16-1739
PWC	https://paperswithcode.com/paper/a-longitudinal-bilingual-frisian-dutch-radio
Repo
Framework

Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese


Title	Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese
Authors	Yurie Iribe, Norihide Kitaoka, Shuhei Segawa
Abstract	We have constructed a new speech data corpus, using the utterances of 100 elderly Japanese people, to improve speech recognition accuracy of the speech of older people. Humanoid robots are being developed for use in elder care nursing homes. Interaction with such robots is expected to help maintain the cognitive abilities of nursing home residents, as well as providing them with companionship. In order for these robots to interact with elderly people through spoken dialogue, a high performance speech recognition system for speech of elderly people is needed. To develop such a system, we recorded speech uttered by 100 elderly Japanese, most of them are living in nursing homes, with an average age of 77.2. Previously, a seniors{'} speech corpus named S-JNAS was developed, but the average age of the participants was 67.6 years, but the target age for nursing home care is around 75 years old, much higher than that of the S-JNAS samples. In this paper we compare our new corpus with an existing Japanese read speech corpus, JNAS, which consists of adult speech, and with the above mentioned S-JNAS, the senior version of JNAS.
Tasks	Speech Recognition
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1741/
PDF	https://www.aclweb.org/anthology/L16-1741
PWC	https://paperswithcode.com/paper/speech-corpus-spoken-by-young-old-old-old-and
Repo
Framework

Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis


Title	Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis
Authors	Agnieszka Wagner, Katarzyna Klessa, Jolanta Bachan
Abstract	This paper reports on a new database ― Polish rhythmic database and tools developed with the aim of investigating timing phenomena and rhythmic structure in Polish including topics such as, inter alia, the effect of speaking style and tempo on timing patterns, phonotactic and phrasal properties of speech rhythm and stability of rhythm metrics. So far, 19 native and 12 non-native speakers with different first languages have been recorded. The collected speech data (5 h 14 min.) represents five different speaking styles and five different tempi. For the needs of speech corpus management, annotation and analysis, a database was developed and integrated with Annotation Pro (Klessa et al., 2013, Klessa, 2016). Currently, the database is the only resource for Polish which allows for a systematic study of a broad range of phenomena related to speech timing and rhythm. The paper also introduces new tools and methods developed to facilitate the database annotation and analysis with respect to various timing and rhythm measures. In the end, the results of an ongoing research and first experimental results using the new resources are reported and future work is sketched.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1742/
PDF	https://www.aclweb.org/anthology/L16-1742
PWC	https://paperswithcode.com/paper/polish-rhythmic-database-new-resources-for
Repo
Framework

Analyzing Post-dialogue Comments by Speakers – How Do Humans Personalize Their Utterances in Dialogue? –


Title	Analyzing Post-dialogue Comments by Speakers – How Do Humans Personalize Their Utterances in Dialogue? –
Authors	Toru Hirano, Ryuichiro Higashinaka, Yoshihiro Matsuo
Abstract
Tasks
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-3619/
PDF	https://www.aclweb.org/anthology/W16-3619
PWC	https://paperswithcode.com/paper/analyzing-post-dialogue-comments-by-speakers
Repo
Framework

Providing a Catalogue of Language Resources for Commercial Users


Title	Providing a Catalogue of Language Resources for Commercial Users
Authors	Bente Maegaard, Lina Henriksen, Andrew Joscelyne, Vesna Lusicky, Margaretha Mazura, Sussi Olsen, Claus Povlsen, Philippe Wacker
Abstract	Language resources (LR) are indispensable for the development of tools for machine translation (MT) or various kinds of computer-assisted translation (CAT). In particular language corpora, both parallel and monolingual are considered most important for instance for MT, not only SMT but also hybrid MT. The Language Technology Observatory will provide easy access to information about LRs deemed to be useful for MT and other translation tools through its LR Catalogue. In order to determine what aspects of an LR are useful for MT practitioners, a user study was made, providing a guide to the most relevant metadata and the most relevant quality criteria. We have seen that many resources exist which are useful for MT and similar work, but the majority are for (academic) research or educational use only, and as such not available for commercial use. Our work has revealed a list of gaps: coverage gap, awareness gap, quality gap, quantity gap. The paper ends with recommendations for a forward-looking strategy.
Tasks	Machine Translation
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1072/
PDF	https://www.aclweb.org/anthology/L16-1072
PWC	https://paperswithcode.com/paper/providing-a-catalogue-of-language-resources
Repo
Framework