Paper Group NANR 67
The language of mental health problems in social media. Grammatical Error Correction: Machine Translation and Classifiers. Mining Call Center Conversations Exhibiting Similar Affective States. On the Possessor Interpretation of Non-Agentive Subjects. RALI System Description for CL-SciSumm 2016 Shared Task. Medical Concept Embeddings via Labeled Bac …
The language of mental health problems in social media
Title | The language of mental health problems in social media |
Authors | George Gkotsis, Anika Oellrich, Tim Hubbard, Richard Dobson, Maria Liakata, Sumithra Velupillai, Rina Dutta |
Abstract | |
Tasks | |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0307/ |
https://www.aclweb.org/anthology/W16-0307 | |
PWC | https://paperswithcode.com/paper/the-language-of-mental-health-problems-in |
Repo | |
Framework | |
Grammatical Error Correction: Machine Translation and Classifiers
Title | Grammatical Error Correction: Machine Translation and Classifiers |
Authors | Alla Rozovskaya, Dan Roth |
Abstract | |
Tasks | Grammatical Error Correction, Machine Translation |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/P16-1208/ |
https://www.aclweb.org/anthology/P16-1208 | |
PWC | https://paperswithcode.com/paper/grammatical-error-correction-machine |
Repo | |
Framework | |
Mining Call Center Conversations Exhibiting Similar Affective States
Title | Mining Call Center Conversations Exhibiting Similar Affective States |
Authors | Rupayan Chakraborty, P, harip, Meghna e, Sunil Kumar Kopparapu |
Abstract | |
Tasks | Emotion Recognition |
Published | 2016-10-01 |
URL | https://www.aclweb.org/anthology/Y16-3028/ |
https://www.aclweb.org/anthology/Y16-3028 | |
PWC | https://paperswithcode.com/paper/mining-call-center-conversations-exhibiting |
Repo | |
Framework | |
On the Possessor Interpretation of Non-Agentive Subjects
Title | On the Possessor Interpretation of Non-Agentive Subjects |
Authors | Tomokazu Takehisa |
Abstract | |
Tasks | |
Published | 2016-10-01 |
URL | https://www.aclweb.org/anthology/Y16-3014/ |
https://www.aclweb.org/anthology/Y16-3014 | |
PWC | https://paperswithcode.com/paper/on-the-possessor-interpretation-of-non |
Repo | |
Framework | |
RALI System Description for CL-SciSumm 2016 Shared Task
Title | RALI System Description for CL-SciSumm 2016 Shared Task |
Authors | Bruno Malenfant, Guy Lapalme |
Abstract | |
Tasks | Information Retrieval |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-1517/ |
https://www.aclweb.org/anthology/W16-1517 | |
PWC | https://paperswithcode.com/paper/rali-system-description-for-cl-scisumm-2016 |
Repo | |
Framework | |
Medical Concept Embeddings via Labeled Background Corpora
Title | Medical Concept Embeddings via Labeled Background Corpora |
Authors | Eneldo Loza Menc{'\i}a, Gerard de Melo, Jinseok Nam |
Abstract | In recent years, we have seen an increasing amount of interest in low-dimensional vector representations of words. Among other things, these facilitate computing word similarity and relatedness scores. The most well-known example of algorithms to produce representations of this sort are the word2vec approaches. In this paper, we investigate a new model to induce such vector spaces for medical concepts, based on a joint objective that exploits not only word co-occurrences but also manually labeled documents, as available from sources such as PubMed. Our extensive experimental analysis shows that our embeddings lead to significantly higher correlations with human similarity and relatedness assessments than previous work. Due to the simplicity and versatility of vector representations, these findings suggest that our resource can easily be used as a drop-in replacement to improve any systems relying on medical concept similarity measures. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1733/ |
https://www.aclweb.org/anthology/L16-1733 | |
PWC | https://paperswithcode.com/paper/medical-concept-embeddings-via-labeled |
Repo | |
Framework | |
Mining Inference Formulas by Goal-Directed Random Walks
Title | Mining Inference Formulas by Goal-Directed Random Walks |
Authors | Zhuoyu Wei, Jun Zhao, Kang Liu |
Abstract | |
Tasks | Knowledge Base Completion |
Published | 2016-11-01 |
URL | https://www.aclweb.org/anthology/D16-1145/ |
https://www.aclweb.org/anthology/D16-1145 | |
PWC | https://paperswithcode.com/paper/mining-inference-formulas-by-goal-directed |
Repo | |
Framework | |
Automatic Verification and Augmentation of Multilingual Lexicons
Title | Automatic Verification and Augmentation of Multilingual Lexicons |
Authors | Maryam Aminian, Mohamed Al-Badrashiny, Mona Diab |
Abstract | We present an approach for automatic verification and augmentation of multilingual lexica. We exploit existing parallel and monolingual corpora to extract multilingual correspondents via tri-angulation. We demonstrate the efficacy of our approach on two publicly available resources: Tharwa, a three-way lexicon comprising Dialectal Arabic, Modern Standard Arabic and English lemmas among other information (Diab et al., 2014); and BabelNet, a multilingual thesaurus comprising over 276 languages including Arabic variant entries (Navigli and Ponzetto, 2012). Our automated approach yields an F1-score of 71.71{%} in generating correct multilingual correspondents against gold Tharwa, and 54.46{%} against gold BabelNet without any human intervention. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4810/ |
https://www.aclweb.org/anthology/W16-4810 | |
PWC | https://paperswithcode.com/paper/automatic-verification-and-augmentation-of |
Repo | |
Framework | |
Cultural Communication Idiosyncrasies in Human-Computer Interaction
Title | Cultural Communication Idiosyncrasies in Human-Computer Interaction |
Authors | Juliana Miehle, Koichiro Yoshino, Louisa Pragst, Stefan Ultes, Satoshi Nakamura, Wolfgang Minker |
Abstract | |
Tasks | Spoken Dialogue Systems |
Published | 2016-09-01 |
URL | https://www.aclweb.org/anthology/W16-3610/ |
https://www.aclweb.org/anthology/W16-3610 | |
PWC | https://paperswithcode.com/paper/cultural-communication-idiosyncrasies-in |
Repo | |
Framework | |
LingoTurk: managing crowdsourced tasks for psycholinguistics
Title | LingoTurk: managing crowdsourced tasks for psycholinguistics |
Authors | Florian Pusse, Asad Sayeed, Vera Demberg |
Abstract | |
Tasks | Machine Translation |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/N16-3012/ |
https://www.aclweb.org/anthology/N16-3012 | |
PWC | https://paperswithcode.com/paper/lingoturk-managing-crowdsourced-tasks-for |
Repo | |
Framework | |
A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
Title | A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research |
Authors | Emre Yilmaz, Maaike Andringa, Sigrid Kingma, Jelske Dijkstra, Frits van der Kuip, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel, David van Leeuwen |
Abstract | We present a new speech database containing 18.5 hours of annotated radio broadcasts in the Frisian language. Frisian is mostly spoken in the province Fryslan and it is the second official language of the Netherlands. The recordings are collected from the archives of Omrop Fryslan, the regional public broadcaster of the province Fryslan. The database covers almost a 50-year time span. The native speakers of Frisian are mostly bilingual and often code-switch in daily conversations due to the extensive influence of the Dutch language. Considering the longitudinal and code-switching nature of the data, an appropriate annotation protocol has been designed and the data is manually annotated with the orthographic transcription, speaker identities, dialect information, code-switching details and background noise/music information. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1739/ |
https://www.aclweb.org/anthology/L16-1739 | |
PWC | https://paperswithcode.com/paper/a-longitudinal-bilingual-frisian-dutch-radio |
Repo | |
Framework | |
Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese
Title | Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese |
Authors | Yurie Iribe, Norihide Kitaoka, Shuhei Segawa |
Abstract | We have constructed a new speech data corpus, using the utterances of 100 elderly Japanese people, to improve speech recognition accuracy of the speech of older people. Humanoid robots are being developed for use in elder care nursing homes. Interaction with such robots is expected to help maintain the cognitive abilities of nursing home residents, as well as providing them with companionship. In order for these robots to interact with elderly people through spoken dialogue, a high performance speech recognition system for speech of elderly people is needed. To develop such a system, we recorded speech uttered by 100 elderly Japanese, most of them are living in nursing homes, with an average age of 77.2. Previously, a seniors{'} speech corpus named S-JNAS was developed, but the average age of the participants was 67.6 years, but the target age for nursing home care is around 75 years old, much higher than that of the S-JNAS samples. In this paper we compare our new corpus with an existing Japanese read speech corpus, JNAS, which consists of adult speech, and with the above mentioned S-JNAS, the senior version of JNAS. |
Tasks | Speech Recognition |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1741/ |
https://www.aclweb.org/anthology/L16-1741 | |
PWC | https://paperswithcode.com/paper/speech-corpus-spoken-by-young-old-old-old-and |
Repo | |
Framework | |
Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis
Title | Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis |
Authors | Agnieszka Wagner, Katarzyna Klessa, Jolanta Bachan |
Abstract | This paper reports on a new database ― Polish rhythmic database and tools developed with the aim of investigating timing phenomena and rhythmic structure in Polish including topics such as, inter alia, the effect of speaking style and tempo on timing patterns, phonotactic and phrasal properties of speech rhythm and stability of rhythm metrics. So far, 19 native and 12 non-native speakers with different first languages have been recorded. The collected speech data (5 h 14 min.) represents five different speaking styles and five different tempi. For the needs of speech corpus management, annotation and analysis, a database was developed and integrated with Annotation Pro (Klessa et al., 2013, Klessa, 2016). Currently, the database is the only resource for Polish which allows for a systematic study of a broad range of phenomena related to speech timing and rhythm. The paper also introduces new tools and methods developed to facilitate the database annotation and analysis with respect to various timing and rhythm measures. In the end, the results of an ongoing research and first experimental results using the new resources are reported and future work is sketched. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1742/ |
https://www.aclweb.org/anthology/L16-1742 | |
PWC | https://paperswithcode.com/paper/polish-rhythmic-database-new-resources-for |
Repo | |
Framework | |
Analyzing Post-dialogue Comments by Speakers – How Do Humans Personalize Their Utterances in Dialogue? –
Title | Analyzing Post-dialogue Comments by Speakers – How Do Humans Personalize Their Utterances in Dialogue? – |
Authors | Toru Hirano, Ryuichiro Higashinaka, Yoshihiro Matsuo |
Abstract | |
Tasks | |
Published | 2016-09-01 |
URL | https://www.aclweb.org/anthology/W16-3619/ |
https://www.aclweb.org/anthology/W16-3619 | |
PWC | https://paperswithcode.com/paper/analyzing-post-dialogue-comments-by-speakers |
Repo | |
Framework | |
Providing a Catalogue of Language Resources for Commercial Users
Title | Providing a Catalogue of Language Resources for Commercial Users |
Authors | Bente Maegaard, Lina Henriksen, Andrew Joscelyne, Vesna Lusicky, Margaretha Mazura, Sussi Olsen, Claus Povlsen, Philippe Wacker |
Abstract | Language resources (LR) are indispensable for the development of tools for machine translation (MT) or various kinds of computer-assisted translation (CAT). In particular language corpora, both parallel and monolingual are considered most important for instance for MT, not only SMT but also hybrid MT. The Language Technology Observatory will provide easy access to information about LRs deemed to be useful for MT and other translation tools through its LR Catalogue. In order to determine what aspects of an LR are useful for MT practitioners, a user study was made, providing a guide to the most relevant metadata and the most relevant quality criteria. We have seen that many resources exist which are useful for MT and similar work, but the majority are for (academic) research or educational use only, and as such not available for commercial use. Our work has revealed a list of gaps: coverage gap, awareness gap, quality gap, quantity gap. The paper ends with recommendations for a forward-looking strategy. |
Tasks | Machine Translation |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1072/ |
https://www.aclweb.org/anthology/L16-1072 | |
PWC | https://paperswithcode.com/paper/providing-a-catalogue-of-language-resources |
Repo | |
Framework | |