Paper Group NANR 102
Crazy Mad Nutters: The Language of Mental Health. Exploratory Analysis of Social Media Prior to a Suicide Attempt. Mental Distress Detection and Triage in Forum Posts: The LT3 CLPsych 2016 Shared Task System. Sentiment, Subjectivity, and Social Analysis Go ToWork: An Industry View - Invited Talk. Kernel Observers: Systems-Theoretic Modeling and Inf …
Crazy Mad Nutters: The Language of Mental Health
Title | Crazy Mad Nutters: The Language of Mental Health |
Authors | Jena D. Hwang, Kristy Hollingshead |
Abstract | |
Tasks | |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0306/ |
https://www.aclweb.org/anthology/W16-0306 | |
PWC | https://paperswithcode.com/paper/crazy-mad-nutters-the-language-of-mental |
Repo | |
Framework | |
Exploratory Analysis of Social Media Prior to a Suicide Attempt
Title | Exploratory Analysis of Social Media Prior to a Suicide Attempt |
Authors | Glen Coppersmith, Kim Ngo, Ryan Leary, Anthony Wood |
Abstract | |
Tasks | Emotion Classification |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0311/ |
https://www.aclweb.org/anthology/W16-0311 | |
PWC | https://paperswithcode.com/paper/exploratory-analysis-of-social-media-prior-to |
Repo | |
Framework | |
Mental Distress Detection and Triage in Forum Posts: The LT3 CLPsych 2016 Shared Task System
Title | Mental Distress Detection and Triage in Forum Posts: The LT3 CLPsych 2016 Shared Task System |
Authors | Bart Desmet, Gilles Jacobs, V{'e}ronique Hoste |
Abstract | |
Tasks | |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0317/ |
https://www.aclweb.org/anthology/W16-0317 | |
PWC | https://paperswithcode.com/paper/mental-distress-detection-and-triage-in-forum |
Repo | |
Framework | |
Sentiment, Subjectivity, and Social Analysis Go ToWork: An Industry View - Invited Talk
Title | Sentiment, Subjectivity, and Social Analysis Go ToWork: An Industry View - Invited Talk |
Authors | Seth Grimes |
Abstract | |
Tasks | Sentiment Analysis |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-0402/ |
https://www.aclweb.org/anthology/W16-0402 | |
PWC | https://paperswithcode.com/paper/sentiment-subjectivity-and-social-analysis-go |
Repo | |
Framework | |
Kernel Observers: Systems-Theoretic Modeling and Inference of Spatiotemporally Evolving Processes
Title | Kernel Observers: Systems-Theoretic Modeling and Inference of Spatiotemporally Evolving Processes |
Authors | Hassan A. Kingravi, Harshal R. Maske, Girish Chowdhary |
Abstract | We consider the problem of estimating the latent state of a spatiotemporally evolving continuous function using very few sensor measurements. We show that layering a dynamical systems prior over temporal evolution of weights of a kernel model is a valid approach to spatiotemporal modeling that does not necessarily require the design of complex nonstationary kernels. Furthermore, we show that such a predictive model can be utilized to determine sensing locations that guarantee that the hidden state of the phenomena can be recovered with very few measurements. We provide sufficient conditions on the number and spatial location of samples required to guarantee state recovery, and provide a lower bound on the minimum number of samples required to robustly infer the hidden states. Our approach outperforms existing methods in numerical experiments. |
Tasks | |
Published | 2016-12-01 |
URL | http://papers.nips.cc/paper/6189-kernel-observers-systems-theoretic-modeling-and-inference-of-spatiotemporally-evolving-processes |
http://papers.nips.cc/paper/6189-kernel-observers-systems-theoretic-modeling-and-inference-of-spatiotemporally-evolving-processes.pdf | |
PWC | https://paperswithcode.com/paper/kernel-observers-systems-theoretic-modeling |
Repo | |
Framework | |
The OpenCourseWare Metadiscourse (OCWMD) Corpus
Title | The OpenCourseWare Metadiscourse (OCWMD) Corpus |
Authors | Ghada Alharbi, Thomas Hain |
Abstract | This study describes a new corpus of over 60,000 hand-annotated metadiscourse acts from 106 OpenCourseWare lectures, from two different disciplines: Physics and Economics. Metadiscourse is a set of linguistic expressions that signal different functions in the discourse. This type of language is hypothesised to be helpful in finding a structure in unstructured text, such as lectures discourse. A brief summary is provided about the annotation scheme and labelling procedures, inter-annotator reliability statistics, overall distributional statistics, a description of auxiliary data that will be distributed with the corpus, and information relating to how to obtain the data. The results provide a deeper understanding of lecture structure and confirm the reliable coding of metadiscursive acts in academic lectures across different disciplines. The next stage of our research will be to build a classification model to automate the tagging process, instead of manual annotation, which take time and efforts. This is in addition to the use of these tags as indicators of the higher level structure of lecture discourse. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1279/ |
https://www.aclweb.org/anthology/L16-1279 | |
PWC | https://paperswithcode.com/paper/the-opencourseware-metadiscourse-ocwmd-corpus |
Repo | |
Framework | |
Automatic Speech Recognition Errors as a Predictor of L2 Listening Difficulties
Title | Automatic Speech Recognition Errors as a Predictor of L2 Listening Difficulties |
Authors | Maryam Sadat Mirzaei, Kourosh Meshgi, Tatsuya Kawahara |
Abstract | This paper investigates the use of automatic speech recognition (ASR) errors as indicators of the second language (L2) learners{'} listening difficulties and in doing so strives to overcome the shortcomings of Partial and Synchronized Caption (PSC) system. PSC is a system that generates a partial caption including difficult words detected based on high speech rate, low frequency, and specificity. To improve the choice of words in this system, and explore a better method to detect speech challenges, ASR errors were investigated as a model of the L2 listener, hypothesizing that some of these errors are similar to those of language learners{'} when transcribing the videos. To investigate this hypothesis, ASR errors in transcription of several TED talks were analyzed and compared with PSC{'}s selected words. Both the overlapping and mismatching cases were analyzed to investigate possible improvement for the PSC system. Those ASR errors that were not detected by PSC as cases of learners{'} difficulties were further analyzed and classified into four categories: homophones, minimal pairs, breached boundaries and negatives. These errors were embedded into the baseline PSC to make the enhanced version and were evaluated in an experiment with L2 learners. The results indicated that the enhanced version, which encompasses the ASR errors addresses most of the L2 learners{'} difficulties and better assists them in comprehending challenging video segments as compared with the baseline. |
Tasks | Speech Recognition |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4122/ |
https://www.aclweb.org/anthology/W16-4122 | |
PWC | https://paperswithcode.com/paper/automatic-speech-recognition-errors-as-a |
Repo | |
Framework | |
Learning pressures reduce morphological complexity: Linking corpus, computational and experimental evidence
Title | Learning pressures reduce morphological complexity: Linking corpus, computational and experimental evidence |
Authors | Christian Bentz, Aleks Berdicevskis, rs |
Abstract | The morphological complexity of languages differs widely and changes over time. Pathways of change are often driven by the interplay of multiple competing factors, and are hard to disentangle. We here focus on a paradigmatic scenario of language change: the reduction of morphological complexity from Latin towards the Romance languages. To establish a causal explanation for this phenomenon, we employ three lines of evidence: 1) analyses of parallel corpora to measure the complexity of words in actual language production, 2) applications of NLP tools to further tease apart the contribution of inflectional morphology to word complexity, and 3) experimental data from artificial language learning, which illustrate the learning pressures at play when morphology simplifies. These three lines of evidence converge to show that pressures associated with imperfect language learning are good candidates to causally explain the reduction in morphological complexity in the Latin-to-Romance scenario. More generally, we argue that combining corpus, computational and experimental evidence is the way forward in historical linguistics and linguistic typology. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4125/ |
https://www.aclweb.org/anthology/W16-4125 | |
PWC | https://paperswithcode.com/paper/learning-pressures-reduce-morphological |
Repo | |
Framework | |
CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research
Title | CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research |
Authors | Santanu Pal, Sudip Kumar Naskar, Marcos Zampieri, Tapas Nayak, Josef van Genabith |
Abstract | We present a free web-based CAT tool called CATaLog Online which provides a novel and user-friendly online CAT environment for post-editors/translators. The goal is to support distributed translation, reduce post-editing time and effort, improve the post-editing experience and capture data for incremental MT/APE (automatic post-editing) and translation process research. The tool supports individual as well as batch mode file translation and provides translations from three engines {–} translation memory (TM), MT and APE. TM suggestions are color coded to accelerate the post-editing task. The users can integrate their personal TM/MT outputs. The tool remotely monitors and records post-editing activities generating an extensive range of post-editing logs. |
Tasks | Automatic Post-Editing, Machine Translation |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-2021/ |
https://www.aclweb.org/anthology/C16-2021 | |
PWC | https://paperswithcode.com/paper/catalog-online-a-web-based-cat-tool-for |
Repo | |
Framework | |
SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision
Title | SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision |
Authors | Jan Deriu, Maurice Gonzenbach, Fatih Uzdilli, Aurelien Lucchi, Valeria De Luca, Martin Jaggi |
Abstract | |
Tasks | Sentence Embedding, Sentiment Analysis, Word Embeddings |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/S16-1173/ |
https://www.aclweb.org/anthology/S16-1173 | |
PWC | https://paperswithcode.com/paper/swisscheese-at-semeval-2016-task-4-sentiment |
Repo | |
Framework | |
Cross-lingual Dependency Transfer : What Matters? Assessing the Impact of Pre- and Post-processing
Title | Cross-lingual Dependency Transfer : What Matters? Assessing the Impact of Pre- and Post-processing |
Authors | Oph{'e}lie Lacroix, Guillaume Wisniewski, Fran{\c{c}}ois Yvon |
Abstract | |
Tasks | |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/W16-1203/ |
https://www.aclweb.org/anthology/W16-1203 | |
PWC | https://paperswithcode.com/paper/cross-lingual-dependency-transfer-what |
Repo | |
Framework | |
The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Title | The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People |
Authors | Michel Vacher, Sa{"\i}da Bouakaz, Marc-Eric Bobillier Chaumon, Fr{'e}d{'e}ric Aman, R. A. Khan, Slima Bekkadja, Fran{\c{c}}ois Portet, Erwan Guillou, Solange Rossato, Benjamin Lecouteux |
Abstract | Ambient Assisted Living aims at enhancing the quality of life of older and disabled people at home thanks to Smart Homes. In particular, regarding elderly living alone at home, the detection of distress situation after a fall is very important to reassure this kind of population. However, many studies do not include tests in real settings, because data collection in this domain is very expensive and challenging and because of the few available data sets. The C IRDO corpus is a dataset recorded in realistic conditions in D OMUS , a fully equipped Smart Home with microphones and home automation sensors, in which participants performed scenarios including real falls on a carpet and calls for help. These scenarios were elaborated thanks to a field study involving elderly persons. Experiments related in a first part to distress detection in real-time using audio and speech analysis and in a second part to fall detection using video analysis are presented. Results show the difficulty of the task. The database can be used as standardized database by researchers to evaluate and compare their systems for elderly person{'}s assistance. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1221/ |
https://www.aclweb.org/anthology/L16-1221 | |
PWC | https://paperswithcode.com/paper/the-cirdo-corpus-comprehensive-audiovideo |
Repo | |
Framework | |
Geographical Visualization of Search Results in Historical Corpora
Title | Geographical Visualization of Search Results in Historical Corpora |
Authors | Florian Petran |
Abstract | We present ANNISVis, a webapp for comparative visualization of geographical distribution of linguistic data, as well as a sample deployment for a corpus of Middle High German texts. Unlike existing geographical visualization solutions, which work with pre-existing data sets, or are bound to specific corpora, ANNISVis allows the user to formulate multiple ad-hoc queries and visualizes them on a map, and it can be configured for any corpus that can be imported into ANNIS. This enables explorative queries of the quantitative aspects of a corpus with geographical features. The tool will be made available to download in open source. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4013/ |
https://www.aclweb.org/anthology/W16-4013 | |
PWC | https://paperswithcode.com/paper/geographical-visualization-of-search-results |
Repo | |
Framework | |
Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset
Title | Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset |
Authors | Vuk Batanovi{'c}, Bo{\v{s}}ko Nikoli{'c}, Milan Milosavljevi{'c} |
Abstract | Collecting data for sentiment analysis in resource-limited languages carries a significant risk of sample selection bias, since the small quantities of available data are most likely not representative of the whole population. Ignoring this bias leads to less robust machine learning classifiers and less reliable evaluation results. In this paper we present a dataset balancing algorithm that minimizes the sample selection bias by eliminating irrelevant systematic differences between the sentiment classes. We prove its superiority over the random sampling method and we use it to create the Serbian movie review dataset ― SerbMR ― the first balanced and topically uniform sentiment analysis dataset in Serbian. In addition, we propose an incremental way of finding the optimal combination of simple text processing options and machine learning features for sentiment classification. Several popular classifiers are used in conjunction with this evaluation approach in order to establish strong but reliable baselines for sentiment analysis in Serbian. |
Tasks | Sentiment Analysis |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1427/ |
https://www.aclweb.org/anthology/L16-1427 | |
PWC | https://paperswithcode.com/paper/reliable-baselines-for-sentiment-analysis-in |
Repo | |
Framework | |
Using Learning-To-Rank to Enhance NLM Medical Text Indexer Results
Title | Using Learning-To-Rank to Enhance NLM Medical Text Indexer Results |
Authors | Ilya Zavorin, James Mork, Dina Demner-Fushman |
Abstract | |
Tasks | Learning-To-Rank |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-3102/ |
https://www.aclweb.org/anthology/W16-3102 | |
PWC | https://paperswithcode.com/paper/using-learning-to-rank-to-enhance-nlm-medical |
Repo | |
Framework | |