May 5, 2019

1967 words 10 mins read

Paper Group NANR 102

Paper Group NANR 102

Crazy Mad Nutters: The Language of Mental Health. Exploratory Analysis of Social Media Prior to a Suicide Attempt. Mental Distress Detection and Triage in Forum Posts: The LT3 CLPsych 2016 Shared Task System. Sentiment, Subjectivity, and Social Analysis Go ToWork: An Industry View - Invited Talk. Kernel Observers: Systems-Theoretic Modeling and Inf …

Crazy Mad Nutters: The Language of Mental Health

Title Crazy Mad Nutters: The Language of Mental Health
Authors Jena D. Hwang, Kristy Hollingshead
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0306/
PDF https://www.aclweb.org/anthology/W16-0306
PWC https://paperswithcode.com/paper/crazy-mad-nutters-the-language-of-mental
Repo
Framework

Exploratory Analysis of Social Media Prior to a Suicide Attempt

Title Exploratory Analysis of Social Media Prior to a Suicide Attempt
Authors Glen Coppersmith, Kim Ngo, Ryan Leary, Anthony Wood
Abstract
Tasks Emotion Classification
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0311/
PDF https://www.aclweb.org/anthology/W16-0311
PWC https://paperswithcode.com/paper/exploratory-analysis-of-social-media-prior-to
Repo
Framework

Mental Distress Detection and Triage in Forum Posts: The LT3 CLPsych 2016 Shared Task System

Title Mental Distress Detection and Triage in Forum Posts: The LT3 CLPsych 2016 Shared Task System
Authors Bart Desmet, Gilles Jacobs, V{'e}ronique Hoste
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0317/
PDF https://www.aclweb.org/anthology/W16-0317
PWC https://paperswithcode.com/paper/mental-distress-detection-and-triage-in-forum
Repo
Framework

Sentiment, Subjectivity, and Social Analysis Go ToWork: An Industry View - Invited Talk

Title Sentiment, Subjectivity, and Social Analysis Go ToWork: An Industry View - Invited Talk
Authors Seth Grimes
Abstract
Tasks Sentiment Analysis
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-0402/
PDF https://www.aclweb.org/anthology/W16-0402
PWC https://paperswithcode.com/paper/sentiment-subjectivity-and-social-analysis-go
Repo
Framework

Kernel Observers: Systems-Theoretic Modeling and Inference of Spatiotemporally Evolving Processes

Title Kernel Observers: Systems-Theoretic Modeling and Inference of Spatiotemporally Evolving Processes
Authors Hassan A. Kingravi, Harshal R. Maske, Girish Chowdhary
Abstract We consider the problem of estimating the latent state of a spatiotemporally evolving continuous function using very few sensor measurements. We show that layering a dynamical systems prior over temporal evolution of weights of a kernel model is a valid approach to spatiotemporal modeling that does not necessarily require the design of complex nonstationary kernels. Furthermore, we show that such a predictive model can be utilized to determine sensing locations that guarantee that the hidden state of the phenomena can be recovered with very few measurements. We provide sufficient conditions on the number and spatial location of samples required to guarantee state recovery, and provide a lower bound on the minimum number of samples required to robustly infer the hidden states. Our approach outperforms existing methods in numerical experiments.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6189-kernel-observers-systems-theoretic-modeling-and-inference-of-spatiotemporally-evolving-processes
PDF http://papers.nips.cc/paper/6189-kernel-observers-systems-theoretic-modeling-and-inference-of-spatiotemporally-evolving-processes.pdf
PWC https://paperswithcode.com/paper/kernel-observers-systems-theoretic-modeling
Repo
Framework

The OpenCourseWare Metadiscourse (OCWMD) Corpus

Title The OpenCourseWare Metadiscourse (OCWMD) Corpus
Authors Ghada Alharbi, Thomas Hain
Abstract This study describes a new corpus of over 60,000 hand-annotated metadiscourse acts from 106 OpenCourseWare lectures, from two different disciplines: Physics and Economics. Metadiscourse is a set of linguistic expressions that signal different functions in the discourse. This type of language is hypothesised to be helpful in finding a structure in unstructured text, such as lectures discourse. A brief summary is provided about the annotation scheme and labelling procedures, inter-annotator reliability statistics, overall distributional statistics, a description of auxiliary data that will be distributed with the corpus, and information relating to how to obtain the data. The results provide a deeper understanding of lecture structure and confirm the reliable coding of metadiscursive acts in academic lectures across different disciplines. The next stage of our research will be to build a classification model to automate the tagging process, instead of manual annotation, which take time and efforts. This is in addition to the use of these tags as indicators of the higher level structure of lecture discourse.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1279/
PDF https://www.aclweb.org/anthology/L16-1279
PWC https://paperswithcode.com/paper/the-opencourseware-metadiscourse-ocwmd-corpus
Repo
Framework

Automatic Speech Recognition Errors as a Predictor of L2 Listening Difficulties

Title Automatic Speech Recognition Errors as a Predictor of L2 Listening Difficulties
Authors Maryam Sadat Mirzaei, Kourosh Meshgi, Tatsuya Kawahara
Abstract This paper investigates the use of automatic speech recognition (ASR) errors as indicators of the second language (L2) learners{'} listening difficulties and in doing so strives to overcome the shortcomings of Partial and Synchronized Caption (PSC) system. PSC is a system that generates a partial caption including difficult words detected based on high speech rate, low frequency, and specificity. To improve the choice of words in this system, and explore a better method to detect speech challenges, ASR errors were investigated as a model of the L2 listener, hypothesizing that some of these errors are similar to those of language learners{'} when transcribing the videos. To investigate this hypothesis, ASR errors in transcription of several TED talks were analyzed and compared with PSC{'}s selected words. Both the overlapping and mismatching cases were analyzed to investigate possible improvement for the PSC system. Those ASR errors that were not detected by PSC as cases of learners{'} difficulties were further analyzed and classified into four categories: homophones, minimal pairs, breached boundaries and negatives. These errors were embedded into the baseline PSC to make the enhanced version and were evaluated in an experiment with L2 learners. The results indicated that the enhanced version, which encompasses the ASR errors addresses most of the L2 learners{'} difficulties and better assists them in comprehending challenging video segments as compared with the baseline.
Tasks Speech Recognition
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4122/
PDF https://www.aclweb.org/anthology/W16-4122
PWC https://paperswithcode.com/paper/automatic-speech-recognition-errors-as-a
Repo
Framework

Learning pressures reduce morphological complexity: Linking corpus, computational and experimental evidence

Title Learning pressures reduce morphological complexity: Linking corpus, computational and experimental evidence
Authors Christian Bentz, Aleks Berdicevskis, rs
Abstract The morphological complexity of languages differs widely and changes over time. Pathways of change are often driven by the interplay of multiple competing factors, and are hard to disentangle. We here focus on a paradigmatic scenario of language change: the reduction of morphological complexity from Latin towards the Romance languages. To establish a causal explanation for this phenomenon, we employ three lines of evidence: 1) analyses of parallel corpora to measure the complexity of words in actual language production, 2) applications of NLP tools to further tease apart the contribution of inflectional morphology to word complexity, and 3) experimental data from artificial language learning, which illustrate the learning pressures at play when morphology simplifies. These three lines of evidence converge to show that pressures associated with imperfect language learning are good candidates to causally explain the reduction in morphological complexity in the Latin-to-Romance scenario. More generally, we argue that combining corpus, computational and experimental evidence is the way forward in historical linguistics and linguistic typology.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4125/
PDF https://www.aclweb.org/anthology/W16-4125
PWC https://paperswithcode.com/paper/learning-pressures-reduce-morphological
Repo
Framework

CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research

Title CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research
Authors Santanu Pal, Sudip Kumar Naskar, Marcos Zampieri, Tapas Nayak, Josef van Genabith
Abstract We present a free web-based CAT tool called CATaLog Online which provides a novel and user-friendly online CAT environment for post-editors/translators. The goal is to support distributed translation, reduce post-editing time and effort, improve the post-editing experience and capture data for incremental MT/APE (automatic post-editing) and translation process research. The tool supports individual as well as batch mode file translation and provides translations from three engines {–} translation memory (TM), MT and APE. TM suggestions are color coded to accelerate the post-editing task. The users can integrate their personal TM/MT outputs. The tool remotely monitors and records post-editing activities generating an extensive range of post-editing logs.
Tasks Automatic Post-Editing, Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2021/
PDF https://www.aclweb.org/anthology/C16-2021
PWC https://paperswithcode.com/paper/catalog-online-a-web-based-cat-tool-for
Repo
Framework

SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision

Title SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision
Authors Jan Deriu, Maurice Gonzenbach, Fatih Uzdilli, Aurelien Lucchi, Valeria De Luca, Martin Jaggi
Abstract
Tasks Sentence Embedding, Sentiment Analysis, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1173/
PDF https://www.aclweb.org/anthology/S16-1173
PWC https://paperswithcode.com/paper/swisscheese-at-semeval-2016-task-4-sentiment
Repo
Framework

Cross-lingual Dependency Transfer : What Matters? Assessing the Impact of Pre- and Post-processing

Title Cross-lingual Dependency Transfer : What Matters? Assessing the Impact of Pre- and Post-processing
Authors Oph{'e}lie Lacroix, Guillaume Wisniewski, Fran{\c{c}}ois Yvon
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1203/
PDF https://www.aclweb.org/anthology/W16-1203
PWC https://paperswithcode.com/paper/cross-lingual-dependency-transfer-what
Repo
Framework

The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People

Title The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
Authors Michel Vacher, Sa{"\i}da Bouakaz, Marc-Eric Bobillier Chaumon, Fr{'e}d{'e}ric Aman, R. A. Khan, Slima Bekkadja, Fran{\c{c}}ois Portet, Erwan Guillou, Solange Rossato, Benjamin Lecouteux
Abstract Ambient Assisted Living aims at enhancing the quality of life of older and disabled people at home thanks to Smart Homes. In particular, regarding elderly living alone at home, the detection of distress situation after a fall is very important to reassure this kind of population. However, many studies do not include tests in real settings, because data collection in this domain is very expensive and challenging and because of the few available data sets. The C IRDO corpus is a dataset recorded in realistic conditions in D OMUS , a fully equipped Smart Home with microphones and home automation sensors, in which participants performed scenarios including real falls on a carpet and calls for help. These scenarios were elaborated thanks to a field study involving elderly persons. Experiments related in a first part to distress detection in real-time using audio and speech analysis and in a second part to fall detection using video analysis are presented. Results show the difficulty of the task. The database can be used as standardized database by researchers to evaluate and compare their systems for elderly person{'}s assistance.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1221/
PDF https://www.aclweb.org/anthology/L16-1221
PWC https://paperswithcode.com/paper/the-cirdo-corpus-comprehensive-audiovideo
Repo
Framework

Geographical Visualization of Search Results in Historical Corpora

Title Geographical Visualization of Search Results in Historical Corpora
Authors Florian Petran
Abstract We present ANNISVis, a webapp for comparative visualization of geographical distribution of linguistic data, as well as a sample deployment for a corpus of Middle High German texts. Unlike existing geographical visualization solutions, which work with pre-existing data sets, or are bound to specific corpora, ANNISVis allows the user to formulate multiple ad-hoc queries and visualizes them on a map, and it can be configured for any corpus that can be imported into ANNIS. This enables explorative queries of the quantitative aspects of a corpus with geographical features. The tool will be made available to download in open source.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4013/
PDF https://www.aclweb.org/anthology/W16-4013
PWC https://paperswithcode.com/paper/geographical-visualization-of-search-results
Repo
Framework

Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset

Title Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset
Authors Vuk Batanovi{'c}, Bo{\v{s}}ko Nikoli{'c}, Milan Milosavljevi{'c}
Abstract Collecting data for sentiment analysis in resource-limited languages carries a significant risk of sample selection bias, since the small quantities of available data are most likely not representative of the whole population. Ignoring this bias leads to less robust machine learning classifiers and less reliable evaluation results. In this paper we present a dataset balancing algorithm that minimizes the sample selection bias by eliminating irrelevant systematic differences between the sentiment classes. We prove its superiority over the random sampling method and we use it to create the Serbian movie review dataset ― SerbMR ― the first balanced and topically uniform sentiment analysis dataset in Serbian. In addition, we propose an incremental way of finding the optimal combination of simple text processing options and machine learning features for sentiment classification. Several popular classifiers are used in conjunction with this evaluation approach in order to establish strong but reliable baselines for sentiment analysis in Serbian.
Tasks Sentiment Analysis
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1427/
PDF https://www.aclweb.org/anthology/L16-1427
PWC https://paperswithcode.com/paper/reliable-baselines-for-sentiment-analysis-in
Repo
Framework

Using Learning-To-Rank to Enhance NLM Medical Text Indexer Results

Title Using Learning-To-Rank to Enhance NLM Medical Text Indexer Results
Authors Ilya Zavorin, James Mork, Dina Demner-Fushman
Abstract
Tasks Learning-To-Rank
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-3102/
PDF https://www.aclweb.org/anthology/W16-3102
PWC https://paperswithcode.com/paper/using-learning-to-rank-to-enhance-nlm-medical
Repo
Framework
comments powered by Disqus