May 5, 2019

1934 words 10 mins read

Paper Group NANR 142

Paper Group NANR 142

N-ary Biographical Relation Extraction using Shortest Path Dependencies. Multilingual Multimodal Language Processing Using Neural Networks. Phrase Level Segmentation and Labelling of Machine Translation Errors. Multimodal Mood Classification - A Case Study of Differences in Hindi and Western Songs. Mining linguistic tone patterns with symbolic repr …

N-ary Biographical Relation Extraction using Shortest Path Dependencies

Title N-ary Biographical Relation Extraction using Shortest Path Dependencies
Authors Gitansh Khirbat, Jianzhong Qi, Rui Zhang
Abstract
Tasks Entity Extraction, Named Entity Recognition, Question Answering, Relation Extraction
Published 2016-12-01
URL https://www.aclweb.org/anthology/U16-1008/
PDF https://www.aclweb.org/anthology/U16-1008
PWC https://paperswithcode.com/paper/n-ary-biographical-relation-extraction-using
Repo
Framework

Multilingual Multimodal Language Processing Using Neural Networks

Title Multilingual Multimodal Language Processing Using Neural Networks
Authors Mitesh M Khapra, Ch, Sarath ar
Abstract
Tasks Machine Translation, Representation Learning
Published 2016-06-01
URL https://www.aclweb.org/anthology/N16-4002/
PDF https://www.aclweb.org/anthology/N16-4002
PWC https://paperswithcode.com/paper/multilingual-multimodal-language-processing
Repo
Framework

Phrase Level Segmentation and Labelling of Machine Translation Errors

Title Phrase Level Segmentation and Labelling of Machine Translation Errors
Authors Fr{'e}d{'e}ric Blain, Varvara Logacheva, Lucia Specia
Abstract This paper presents our work towards a novel approach for Quality Estimation (QE) of machine translation based on sequences of adjacent words, the so-called phrases. This new level of QE aims to provide a natural balance between QE at word and sentence-level, which are either too fine grained or too coarse levels for some applications. However, phrase-level QE implies an intrinsic challenge: how to segment a machine translation into sequence of words (contiguous or not) that represent an error. We discuss three possible segmentation strategies to automatically extract erroneous phrases. We evaluate these strategies against annotations at phrase-level produced by humans, using a new dataset collected for this purpose.
Tasks Machine Translation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1356/
PDF https://www.aclweb.org/anthology/L16-1356
PWC https://paperswithcode.com/paper/phrase-level-segmentation-and-labelling-of
Repo
Framework

Multimodal Mood Classification - A Case Study of Differences in Hindi and Western Songs

Title Multimodal Mood Classification - A Case Study of Differences in Hindi and Western Songs
Authors Braja Gopal Patra, Dipankar Das, B, Sivaji yopadhyay
Abstract Music information retrieval has emerged as a mainstream research area in the past two decades. Experiments on music mood classification have been performed mainly on Western music based on audio, lyrics and a combination of both. Unfortunately, due to the scarcity of digitalized resources, Indian music fares poorly in music mood retrieval research. In this paper, we identified the mood taxonomy and prepared multimodal mood annotated datasets for Hindi and Western songs. We identified important audio and lyric features using correlation based feature selection technique. Finally, we developed mood classification systems using Support Vector Machines and Feed Forward Neural Networks based on the features collected from audio, lyrics, and a combination of both. The best performing multimodal systems achieved F-measures of 75.1 and 83.5 for classifying the moods of the Hindi and Western songs respectively using Feed Forward Neural Networks. A comparative analysis indicates that the selected features work well for mood classification of the Western songs and produces better results as compared to the mood classification systems for Hindi songs.
Tasks Feature Selection, Information Retrieval, Music Information Retrieval
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1186/
PDF https://www.aclweb.org/anthology/C16-1186
PWC https://paperswithcode.com/paper/multimodal-mood-classification-a-case-study
Repo
Framework

Mining linguistic tone patterns with symbolic representation

Title Mining linguistic tone patterns with symbolic representation
Authors Shuo Zhang
Abstract
Tasks Information Retrieval, Music Information Retrieval, Speech Recognition, Time Series, Time Series Classification
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2001/
PDF https://www.aclweb.org/anthology/W16-2001
PWC https://paperswithcode.com/paper/mining-linguistic-tone-patterns-with-symbolic
Repo
Framework

Constructing a Dictionary Describing Feature Changes of Arguments in Event Sentences

Title Constructing a Dictionary Describing Feature Changes of Arguments in Event Sentences
Authors Tetsuaki Nakamura, Daisuke Kawahara
Abstract
Tasks Common Sense Reasoning
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-1006/
PDF https://www.aclweb.org/anthology/W16-1006
PWC https://paperswithcode.com/paper/constructing-a-dictionary-describing-feature
Repo
Framework

Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP

Title Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP
Authors
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-2500/
PDF https://www.aclweb.org/anthology/W16-2500
PWC https://paperswithcode.com/paper/proceedings-of-the-1st-workshop-on-evaluating
Repo
Framework

Discriminative Analysis of Linguistic Features for Typological Study

Title Discriminative Analysis of Linguistic Features for Typological Study
Authors Hiroya Takamura, Ryo Nagata, Yoshifumi Kawasaki
Abstract We address the task of automatically estimating the missing values of linguistic features by making use of the fact that some linguistic features in typological databases are informative to each other. The questions to address in this work are (i) how much predictive power do features have on the value of another feature? (ii) to what extent can we attribute this predictive power to genealogical or areal factors, as opposed to being provided by tendencies or implicational universals? To address these questions, we conduct a discriminative or predictive analysis on the typological database. Specifically, we use a machine-learning classifier to estimate the value of each feature of each language using the values of the other features, under different choices of training data: all the other languages, or all the other languages except for the ones having the same origin or area with the target language.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1011/
PDF https://www.aclweb.org/anthology/L16-1011
PWC https://paperswithcode.com/paper/discriminative-analysis-of-linguistic
Repo
Framework

Improving Topic Model Clustering of Newspaper Comments for Summarisation

Title Improving Topic Model Clustering of Newspaper Comments for Summarisation
Authors Clare Llewellyn, Claire Grover, Oberl, Jon er
Abstract
Tasks Topic Models
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3007/
PDF https://www.aclweb.org/anthology/P16-3007
PWC https://paperswithcode.com/paper/improving-topic-model-clustering-of-newspaper
Repo
Framework

Adapting an Entity Centric Model for Portuguese Coreference Resolution

Title Adapting an Entity Centric Model for Portuguese Coreference Resolution
Authors Ev Fonseca, ro, Renata Vieira, Aline Vanin
Abstract This paper presents the adaptation of an Entity Centric Model for Portuguese coreference resolution, considering 10 named entity categories. The model was evaluated on named e using the HAREM Portuguese corpus and the results are 81.0{%} of precision and 58.3{%} of recall overall, the resulting system is freely available
Tasks Coreference Resolution
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1023/
PDF https://www.aclweb.org/anthology/L16-1023
PWC https://paperswithcode.com/paper/adapting-an-entity-centric-model-for
Repo
Framework

Task-Oriented Intrinsic Evaluation of Semantic Textual Similarity

Title Task-Oriented Intrinsic Evaluation of Semantic Textual Similarity
Authors Nils Reimers, Philip Beyer, Iryna Gurevych
Abstract Semantic Textual Similarity (STS) is a foundational NLP task and can be used in a wide range of tasks. To determine the STS of two texts, hundreds of different STS systems exist, however, for an NLP system designer, it is hard to decide which system is the best one. To answer this question, an intrinsic evaluation of the STS systems is conducted by comparing the output of the system to human judgments on semantic similarity. The comparison is usually done using Pearson correlation. In this work, we show that relying on intrinsic evaluations with Pearson correlation can be misleading. In three common STS based tasks we could observe that the Pearson correlation was especially ill-suited to detect the best STS system for the task and other evaluation measures were much better suited. In this work we define how the validity of an intrinsic evaluation can be assessed and compare different intrinsic evaluation methods. Understanding of the properties of the targeted task is crucial and we propose a framework for conducting the intrinsic evaluation which takes the properties of the targeted task into account.
Tasks Question Answering, Semantic Similarity, Semantic Textual Similarity, Text Summarization
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1009/
PDF https://www.aclweb.org/anthology/C16-1009
PWC https://paperswithcode.com/paper/task-oriented-intrinsic-evaluation-of
Repo
Framework

Expanding wordnets to new languages with multilingual sense disambiguation

Title Expanding wordnets to new languages with multilingual sense disambiguation
Authors Mihael Arcan, John Philip McCrae, Paul Buitelaar
Abstract Princeton WordNet is one of the most important resources for natural language processing, but is only available for English. While it has been translated using the expand approach to many other languages, this is an expensive manual process. Therefore it would be beneficial to have a high-quality automatic translation approach that would support NLP techniques, which rely on WordNet in new languages. The translation of wordnets is fundamentally complex because of the need to translate all senses of a word including low frequency senses, which is very challenging for current machine translation approaches. For this reason we leverage existing translations of WordNet in other languages to identify contextual information for wordnet senses from a large set of generic parallel corpora. We evaluate our approach using 10 translated wordnets for European languages. Our experiment shows a significant improvement over translation without any contextual information. Furthermore, we evaluate how the choice of pivot languages affects performance of multilingual word sense disambiguation.
Tasks Information Retrieval, Machine Translation, Sentiment Analysis, Word Sense Disambiguation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1010/
PDF https://www.aclweb.org/anthology/C16-1010
PWC https://paperswithcode.com/paper/expanding-wordnets-to-new-languages-with
Repo
Framework

Deceptive Opinion Spam Detection Using Neural Network

Title Deceptive Opinion Spam Detection Using Neural Network
Authors Yafeng Ren, Yue Zhang
Abstract Deceptive opinion spam detection has attracted significant attention from both business and research communities. Existing approaches are based on manual discrete features, which can capture linguistic and psychological cues. However, such features fail to encode the semantic meaning of a document from the discourse perspective, which limits the performance. In this paper, we empirically explore a neural network model to learn document-level representation for detecting deceptive opinion spam. In particular, given a document, the model learns sentence representations with a convolutional neural network, which are combined using a gated recurrent neural network with attention mechanism to model discourse information and yield a document vector. Finally, the document representation is used directly as features to identify deceptive opinion spam. Experimental results on three domains (Hotel, Restaurant, and Doctor) show that our proposed method outperforms state-of-the-art methods.
Tasks Word Embeddings
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1014/
PDF https://www.aclweb.org/anthology/C16-1014
PWC https://paperswithcode.com/paper/deceptive-opinion-spam-detection-using-neural
Repo
Framework

Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data

Title Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data
Authors Julian Bleicken, Thomas Hanke, Uta Salden, Sven Wagner
Abstract For publishing sign language corpus data on the web, anonymization is crucial even if it is impossible to hide the visual appearance of the signers: In a small community, even vague references to third persons may be enough to identify those persons. In the case of the DGS Korpus (German Sign Language corpus) project, we want to publish data as a contribution to the cultural heritage of the sign language community while annotation of the data is still ongoing. This poses the question how well anonymization can be achieved given that no full linguistic analysis of the data is available. Basically, we combine analysis of all data that we have, including named entity recognition on translations into German. For this, we use the WebLicht language technology infrastructure. We report on the reliability of these methods in this special context and also illustrate how the anonymization of the video data is technically achieved in order to minimally disturb the viewer.
Tasks Named Entity Recognition
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1526/
PDF https://www.aclweb.org/anthology/L16-1526
PWC https://paperswithcode.com/paper/using-a-language-technology-infrastructure
Repo
Framework
Title Exploring Text Links for Coherent Multi-Document Summarization
Authors Xun Wang, Masaaki Nishino, Tsutomu Hirao, Katsuhito Sudoh, Masaaki Nagata
Abstract Summarization aims to represent source documents by a shortened passage. Existing methods focus on the extraction of key information, but often neglect coherence. Hence the generated summaries suffer from a lack of readability. To address this problem, we have developed a graph-based method by exploring the links between text to produce coherent summaries. Our approach involves finding a sequence of sentences that best represent the key information in a coherent way. In contrast to the previous methods that focus only on salience, the proposed method addresses both coherence and informativeness based on textual linkages. We conduct experiments on the DUC2004 summarization task data set. A performance comparison reveals that the summaries generated by the proposed system achieve comparable results in terms of the ROUGE metric, and show improvements in readability by human evaluation.
Tasks Document Summarization, Multi-Document Summarization, Text Generation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1021/
PDF https://www.aclweb.org/anthology/C16-1021
PWC https://paperswithcode.com/paper/exploring-text-links-for-coherent-multi
Repo
Framework
comments powered by Disqus