May 5, 2019

1967 words 10 mins read

Paper Group NANR 102

Crazy Mad Nutters: The Language of Mental Health. Exploratory Analysis of Social Media Prior to a Suicide Attempt. Mental Distress Detection and Triage in Forum Posts: The LT3 CLPsych 2016 Shared Task System. Sentiment, Subjectivity, and Social Analysis Go ToWork: An Industry View - Invited Talk. Kernel Observers: Systems-Theoretic Modeling and Inf …

Crazy Mad Nutters: The Language of Mental Health


Title	Crazy Mad Nutters: The Language of Mental Health
Authors	Jena D. Hwang, Kristy Hollingshead
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0306/
PDF	https://www.aclweb.org/anthology/W16-0306
PWC	https://paperswithcode.com/paper/crazy-mad-nutters-the-language-of-mental
Repo
Framework


Title	Exploratory Analysis of Social Media Prior to a Suicide Attempt
Authors	Glen Coppersmith, Kim Ngo, Ryan Leary, Anthony Wood
Abstract
Tasks	Emotion Classification
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0311/
PDF	https://www.aclweb.org/anthology/W16-0311
PWC	https://paperswithcode.com/paper/exploratory-analysis-of-social-media-prior-to
Repo
Framework

Mental Distress Detection and Triage in Forum Posts: The LT3 CLPsych 2016 Shared Task System


Title	Mental Distress Detection and Triage in Forum Posts: The LT3 CLPsych 2016 Shared Task System
Authors	Bart Desmet, Gilles Jacobs, V{'e}ronique Hoste
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0317/
PDF	https://www.aclweb.org/anthology/W16-0317
PWC	https://paperswithcode.com/paper/mental-distress-detection-and-triage-in-forum
Repo
Framework


Title	Sentiment, Subjectivity, and Social Analysis Go ToWork: An Industry View - Invited Talk
Authors	Seth Grimes
Abstract
Tasks	Sentiment Analysis
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-0402/
PDF	https://www.aclweb.org/anthology/W16-0402
PWC	https://paperswithcode.com/paper/sentiment-subjectivity-and-social-analysis-go
Repo
Framework

Kernel Observers: Systems-Theoretic Modeling and Inference of Spatiotemporally Evolving Processes


Title	Kernel Observers: Systems-Theoretic Modeling and Inference of Spatiotemporally Evolving Processes
Authors	Hassan A. Kingravi, Harshal R. Maske, Girish Chowdhary
Abstract	We consider the problem of estimating the latent state of a spatiotemporally evolving continuous function using very few sensor measurements. We show that layering a dynamical systems prior over temporal evolution of weights of a kernel model is a valid approach to spatiotemporal modeling that does not necessarily require the design of complex nonstationary kernels. Furthermore, we show that such a predictive model can be utilized to determine sensing locations that guarantee that the hidden state of the phenomena can be recovered with very few measurements. We provide sufficient conditions on the number and spatial location of samples required to guarantee state recovery, and provide a lower bound on the minimum number of samples required to robustly infer the hidden states. Our approach outperforms existing methods in numerical experiments.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6189-kernel-observers-systems-theoretic-modeling-and-inference-of-spatiotemporally-evolving-processes
PDF	http://papers.nips.cc/paper/6189-kernel-observers-systems-theoretic-modeling-and-inference-of-spatiotemporally-evolving-processes.pdf
PWC	https://paperswithcode.com/paper/kernel-observers-systems-theoretic-modeling
Repo
Framework

The OpenCourseWare Metadiscourse (OCWMD) Corpus


Title	The OpenCourseWare Metadiscourse (OCWMD) Corpus
Authors	Ghada Alharbi, Thomas Hain
Abstract	This study describes a new corpus of over 60,000 hand-annotated metadiscourse acts from 106 OpenCourseWare lectures, from two different disciplines: Physics and Economics. Metadiscourse is a set of linguistic expressions that signal different functions in the discourse. This type of language is hypothesised to be helpful in finding a structure in unstructured text, such as lectures discourse. A brief summary is provided about the annotation scheme and labelling procedures, inter-annotator reliability statistics, overall distributional statistics, a description of auxiliary data that will be distributed with the corpus, and information relating to how to obtain the data. The results provide a deeper understanding of lecture structure and confirm the reliable coding of metadiscursive acts in academic lectures across different disciplines. The next stage of our research will be to build a classification model to automate the tagging process, instead of manual annotation, which take time and efforts. This is in addition to the use of these tags as indicators of the higher level structure of lecture discourse.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1279/
PDF	https://www.aclweb.org/anthology/L16-1279
PWC	https://paperswithcode.com/paper/the-opencourseware-metadiscourse-ocwmd-corpus
Repo
Framework

Automatic Speech Recognition Errors as a Predictor of L2 Listening Difficulties


Title	Automatic Speech Recognition Errors as a Predictor of L2 Listening Difficulties
Authors	Maryam Sadat Mirzaei, Kourosh Meshgi, Tatsuya Kawahara
Abstract	This paper investigates the use of automatic speech recognition (ASR) errors as indicators of the second language (L2) learners{'} listening difficulties and in doing so strives to overcome the shortcomings of Partial and Synchronized Caption (PSC) system. PSC is a system that generates a partial caption including difficult words detected based on high speech rate, low frequency, and specificity. To improve the choice of words in this system, and explore a better method to detect speech challenges, ASR errors were investigated as a model of the L2 listener, hypothesizing that some of these errors are similar to those of language learners{'} when transcribing the videos. To investigate this hypothesis, ASR errors in transcription of several TED talks were analyzed and compared with PSC{'}s selected words. Both the overlapping and mismatching cases were analyzed to investigate possible improvement for the PSC system. Those ASR errors that were not detected by PSC as cases of learners{'} difficulties were further analyzed and classified into four categories: homophones, minimal pairs, breached boundaries and negatives. These errors were embedded into the baseline PSC to make the enhanced version and were evaluated in an experiment with L2 learners. The results indicated that the enhanced version, which encompasses the ASR errors addresses most of the L2 learners{'} difficulties and better assists them in comprehending challenging video segments as compared with the baseline.
Tasks	Speech Recognition
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4122/
PDF	https://www.aclweb.org/anthology/W16-4122
PWC	https://paperswithcode.com/paper/automatic-speech-recognition-errors-as-a
Repo
Framework

Learning pressures reduce morphological complexity: Linking corpus, computational and experimental evidence


Title	Learning pressures reduce morphological complexity: Linking corpus, computational and experimental evidence
Authors	Christian Bentz, Aleks Berdicevskis, rs
Abstract	The morphological complexity of languages differs widely and changes over time. Pathways of change are often driven by the interplay of multiple competing factors, and are hard to disentangle. We here focus on a paradigmatic scenario of language change: the reduction of morphological complexity from Latin towards the Romance languages. To establish a causal explanation for this phenomenon, we employ three lines of evidence: 1) analyses of parallel corpora to measure the complexity of words in actual language production, 2) applications of NLP tools to further tease apart the contribution of inflectional morphology to word complexity, and 3) experimental data from artificial language learning, which illustrate the learning pressures at play when morphology simplifies. These three lines of evidence converge to show that pressures associated with imperfect language learning are good candidates to causally explain the reduction in morphological complexity in the Latin-to-Romance scenario. More generally, we argue that combining corpus, computational and experimental evidence is the way forward in historical linguistics and linguistic typology.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4125/
PDF	https://www.aclweb.org/anthology/W16-4125
PWC	https://paperswithcode.com/paper/learning-pressures-reduce-morphological
Repo
Framework

CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research


Title	CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research
Authors	Santanu Pal, Sudip Kumar Naskar, Marcos Zampieri, Tapas Nayak, Josef van Genabith
Abstract	We present a free web-based CAT tool called CATaLog Online which provides a novel and user-friendly online CAT environment for post-editors/translators. The goal is to support distributed translation, reduce post-editing time and effort, improve the post-editing experience and capture data for incremental MT/APE (automatic post-editing) and translation process research. The tool supports individual as well as batch mode file translation and provides translations from three engines {–} translation memory (TM), MT and APE. TM suggestions are color coded to accelerate the post-editing task. The users can integrate their personal TM/MT outputs. The tool remotely monitors and records post-editing activities generating an extensive range of post-editing logs.
Tasks	Automatic Post-Editing, Machine Translation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-2021/
PDF	https://www.aclweb.org/anthology/C16-2021
PWC	https://paperswithcode.com/paper/catalog-online-a-web-based-cat-tool-for
Repo
Framework

SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision


Title	SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision
Authors	Jan Deriu, Maurice Gonzenbach, Fatih Uzdilli, Aurelien Lucchi, Valeria De Luca, Martin Jaggi
Abstract
Tasks	Sentence Embedding, Sentiment Analysis, Word Embeddings
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1173/
PDF	https://www.aclweb.org/anthology/S16-1173
PWC	https://paperswithcode.com/paper/swisscheese-at-semeval-2016-task-4-sentiment
Repo
Framework

Cross-lingual Dependency Transfer : What Matters? Assessing the Impact of Pre- and Post-processing


Title	Cross-lingual Dependency Transfer : What Matters? Assessing the Impact of Pre- and Post-processing
Authors	Oph{'e}lie Lacroix, Guillaume Wisniewski, Fran{\c{c}}ois Yvon
Abstract
Tasks
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-1203/
PDF	https://www.aclweb.org/anthology/W16-1203
PWC	https://paperswithcode.com/paper/cross-lingual-dependency-transfer-what
Repo
Framework

The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People


Title	The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Authors	Michel Vacher, Sa{"\i}da Bouakaz, Marc-Eric Bobillier Chaumon, Fr{'e}d{'e}ric Aman, R. A. Khan, Slima Bekkadja, Fran{\c{c}}ois Portet, Erwan Guillou, Solange Rossato, Benjamin Lecouteux
Abstract	Ambient Assisted Living aims at enhancing the quality of life of older and disabled people at home thanks to Smart Homes. In particular, regarding elderly living alone at home, the detection of distress situation after a fall is very important to reassure this kind of population. However, many studies do not include tests in real settings, because data collection in this domain is very expensive and challenging and because of the few available data sets. The C IRDO corpus is a dataset recorded in realistic conditions in D OMUS , a fully equipped Smart Home with microphones and home automation sensors, in which participants performed scenarios including real falls on a carpet and calls for help. These scenarios were elaborated thanks to a field study involving elderly persons. Experiments related in a first part to distress detection in real-time using audio and speech analysis and in a second part to fall detection using video analysis are presented. Results show the difficulty of the task. The database can be used as standardized database by researchers to evaluate and compare their systems for elderly person{'}s assistance.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1221/
PDF	https://www.aclweb.org/anthology/L16-1221
PWC	https://paperswithcode.com/paper/the-cirdo-corpus-comprehensive-audiovideo
Repo
Framework

Geographical Visualization of Search Results in Historical Corpora


Title	Geographical Visualization of Search Results in Historical Corpora
Authors	Florian Petran
Abstract	We present ANNISVis, a webapp for comparative visualization of geographical distribution of linguistic data, as well as a sample deployment for a corpus of Middle High German texts. Unlike existing geographical visualization solutions, which work with pre-existing data sets, or are bound to specific corpora, ANNISVis allows the user to formulate multiple ad-hoc queries and visualizes them on a map, and it can be configured for any corpus that can be imported into ANNIS. This enables explorative queries of the quantitative aspects of a corpus with geographical features. The tool will be made available to download in open source.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4013/
PDF	https://www.aclweb.org/anthology/W16-4013
PWC	https://paperswithcode.com/paper/geographical-visualization-of-search-results
Repo
Framework

Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset


Title	Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset
Authors	Vuk Batanovi{'c}, Bo{\v{s}}ko Nikoli{'c}, Milan Milosavljevi{'c}
Abstract	Collecting data for sentiment analysis in resource-limited languages carries a significant risk of sample selection bias, since the small quantities of available data are most likely not representative of the whole population. Ignoring this bias leads to less robust machine learning classifiers and less reliable evaluation results. In this paper we present a dataset balancing algorithm that minimizes the sample selection bias by eliminating irrelevant systematic differences between the sentiment classes. We prove its superiority over the random sampling method and we use it to create the Serbian movie review dataset ― SerbMR ― the first balanced and topically uniform sentiment analysis dataset in Serbian. In addition, we propose an incremental way of finding the optimal combination of simple text processing options and machine learning features for sentiment classification. Several popular classifiers are used in conjunction with this evaluation approach in order to establish strong but reliable baselines for sentiment analysis in Serbian.
Tasks	Sentiment Analysis
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1427/
PDF	https://www.aclweb.org/anthology/L16-1427
PWC	https://paperswithcode.com/paper/reliable-baselines-for-sentiment-analysis-in
Repo
Framework

Using Learning-To-Rank to Enhance NLM Medical Text Indexer Results


Title	Using Learning-To-Rank to Enhance NLM Medical Text Indexer Results
Authors	Ilya Zavorin, James Mork, Dina Demner-Fushman
Abstract
Tasks	Learning-To-Rank
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-3102/
PDF	https://www.aclweb.org/anthology/W16-3102
PWC	https://paperswithcode.com/paper/using-learning-to-rank-to-enhance-nlm-medical
Repo
Framework