May 4, 2019

1776 words 9 mins read

Paper Group NANR 183

Paper Group NANR 183

Generation of Conceptual-Level Text Cloud with Graph Diffusion. Multivariate tests of association based on univariate tests. Proceedings of the Fourth BioASQ workshop. SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis. Identifying News from Tweets. Constructing an Annotated Corpus for Protest Event Mining. U …

Generation of Conceptual-Level Text Cloud with Graph Diffusion

Title Generation of Conceptual-Level Text Cloud with Graph Diffusion
Authors Ying-Chun Lin, Po-An Yang, Yen-Kuan Lee, Kun-Ta Chuang
Abstract
Tasks Keyword Extraction
Published 2016-10-01
URL https://www.aclweb.org/anthology/O16-1034/
PDF https://www.aclweb.org/anthology/O16-1034
PWC https://paperswithcode.com/paper/generation-of-conceptual-level-text-cloud
Repo
Framework

Multivariate tests of association based on univariate tests

Title Multivariate tests of association based on univariate tests
Authors Ruth Heller, Yair Heller
Abstract For testing two vector random variables for independence, we propose testing whether the distance of one vector from an arbitrary center point is independent from the distance of the other vector from another arbitrary center point by a univariate test. We prove that under minimal assumptions, it is enough to have a consistent univariate independence test on the distances, to guarantee that the power to detect dependence between the random vectors increases to one with sample size. If the univariate test is distribution-free, the multivariate test will also be distribution-free. If we consider multiple center points and aggregate the center-specific univariate tests, the power may be further improved, and the resulting multivariate test may be distribution-free for specific aggregation methods (if the univariate test is distribution-free). We show that certain multivariate tests recently proposed in the literature can be viewed as instances of this general approach. Moreover, we show in experiments that novel tests constructed using our approach can have better power and computational time than competing approaches.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6220-multivariate-tests-of-association-based-on-univariate-tests
PDF http://papers.nips.cc/paper/6220-multivariate-tests-of-association-based-on-univariate-tests.pdf
PWC https://paperswithcode.com/paper/multivariate-tests-of-association-based-on
Repo
Framework

Proceedings of the Fourth BioASQ workshop

Title Proceedings of the Fourth BioASQ workshop
Authors
Abstract
Tasks Question Answering
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-3100/
PDF https://www.aclweb.org/anthology/W16-3100
PWC https://paperswithcode.com/paper/proceedings-of-the-fourth-bioasq-workshop
Repo
Framework

SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis

Title SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis
Authors Mickael Rouvier, Benoit Favre
Abstract
Tasks Sentence Classification, Sentiment Analysis, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1030/
PDF https://www.aclweb.org/anthology/S16-1030
PWC https://paperswithcode.com/paper/sensei-lif-at-semeval-2016-task-4-polarity
Repo
Framework

Identifying News from Tweets

Title Identifying News from Tweets
Authors Jesse Freitas, Heng Ji
Abstract
Tasks Active Learning
Published 2016-11-01
URL https://www.aclweb.org/anthology/W16-5602/
PDF https://www.aclweb.org/anthology/W16-5602
PWC https://paperswithcode.com/paper/identifying-news-from-tweets
Repo
Framework

Constructing an Annotated Corpus for Protest Event Mining

Title Constructing an Annotated Corpus for Protest Event Mining
Authors Peter Makarov, Jasmine Lorenzini, Hanspeter Kriesi
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/W16-5613/
PDF https://www.aclweb.org/anthology/W16-5613
PWC https://paperswithcode.com/paper/constructing-an-annotated-corpus-for-protest
Repo
Framework

Universal Dependencies for Turkish

Title Universal Dependencies for Turkish
Authors Umut Sulubacak, Memduh Gokirmak, Francis Tyers, {\c{C}}a{\u{g}}r{\i} {\c{C}}{"o}ltekin, Joakim Nivre, G{"u}l{\c{s}}en Eryi{\u{g}}it
Abstract The Universal Dependencies (UD) project was conceived after the substantial recent interest in unifying annotation schemes across languages. With its own annotation principles and abstract inventory for parts of speech, morphosyntactic features and dependency relations, UD aims to facilitate multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. This paper presents the Turkish IMST-UD Treebank, the first Turkish treebank to be in a UD release. The IMST-UD Treebank was automatically converted from the IMST Treebank, which was also recently released. We describe this conversion procedure in detail, complete with mapping tables. We also present our evaluation of the parsing performances of both versions of the IMST Treebank. Our findings suggest that the UD framework is at least as viable for Turkish as the original annotation framework of the IMST Treebank.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1325/
PDF https://www.aclweb.org/anthology/C16-1325
PWC https://paperswithcode.com/paper/universal-dependencies-for-turkish
Repo
Framework

Automatic Syllabification for Manipuri language

Title Automatic Syllabification for Manipuri language
Authors Loitongbam Gyanendro Singh, Lenin Laitonjam, Sanasam Ranbir Singh
Abstract Development of hand crafted rule for syllabifying words of a language is an expensive task. This paper proposes several data-driven methods for automatic syllabification of words written in Manipuri language. Manipuri is one of the scheduled Indian languages. First, we propose a language-independent rule-based approach formulated using entropy based phonotactic segmentation. Second, we project the syllabification problem as a sequence labeling problem and investigate its effect using various sequence labeling approaches. Third, we combine the effect of sequence labeling and rule-based method and investigate the performance of the hybrid approach. From various experimental observations, it is evident that the proposed methods outperform the baseline rule-based method. The entropy based phonotactic segmentation provides a word accuracy of 96{%}, CRF (sequence labeling approach) provides 97{%} and hybrid approach provides 98{%} word accuracy.
Tasks Speech Recognition, Speech Synthesis, Text-To-Speech Synthesis
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1034/
PDF https://www.aclweb.org/anthology/C16-1034
PWC https://paperswithcode.com/paper/automatic-syllabification-for-manipuri
Repo
Framework

SRDF: Extracting Lexical Knowledge Graph for Preserving Sentence Meaning

Title SRDF: Extracting Lexical Knowledge Graph for Preserving Sentence Meaning
Authors Sangha Nam, GyuHyeon Choi, Younggyun Hahm, Key-Sun Choi
Abstract In this paper, we present an open information extraction system so-called SRDF that generates lexical knowledge graphs from unstructured texts. In semantic web, knowledge is expressed in the RDF triple form but the natural language text consist of multiple relations between arguments. For this reason, we combine open information extraction with the reification for the full text extraction to preserve meaning of sentence in our knowledge graph. And also our knowledge graph is designed to adapt for many existing semantic web applications. At the end of this paper, we introduce the result of the experiment and a Korean template generation module developed using SRDF.
Tasks Dependency Parsing, Knowledge Graphs, Open Information Extraction, Question Answering, Reading Comprehension
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4411/
PDF https://www.aclweb.org/anthology/W16-4411
PWC https://paperswithcode.com/paper/srdf-extracting-lexical-knowledge-graph-for
Repo
Framework

Combining Heterogeneous User Generated Data to Sense Well-being

Title Combining Heterogeneous User Generated Data to Sense Well-being
Authors Adam Tsakalidis, Maria Liakata, Theo Damoulas, Brigitte Jellinek, Weisi Guo, Alex Cristea, ra
Abstract In this paper we address a new problem of predicting affect and well-being scales in a real-world setting of heterogeneous, longitudinal and non-synchronous textual as well as non-linguistic data that can be harvested from on-line media and mobile phones. We describe the method for collecting the heterogeneous longitudinal data, how features are extracted to address missing information and differences in temporal alignment, and how the latter are combined to yield promising predictions of affect and well-being on the basis of widely used psychological scales. We achieve a coefficient of determination ($R^2$) of 0.71-0.76 and a correlation coefficient of 0.68-0.87 which is higher than the state-of-the art in equivalent multi-modal tasks for affect.
Tasks Emotion Recognition
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1283/
PDF https://www.aclweb.org/anthology/C16-1283
PWC https://paperswithcode.com/paper/combining-heterogeneous-user-generated-data
Repo
Framework

Neural Network Language Models for Candidate Scoring in Hybrid Multi-System Machine Translation

Title Neural Network Language Models for Candidate Scoring in Hybrid Multi-System Machine Translation
Authors Mat{=\i}ss Rikters
Abstract This paper presents the comparison of how using different neural network based language modeling tools for selecting the best candidate fragments affects the final output translation quality in a hybrid multi-system machine translation setup. Experiments were conducted by comparing perplexity and BLEU scores on common test cases using the same training data set. A 12-gram statistical language model was selected as a baseline to oppose three neural network based models of different characteristics. The models were integrated in a hybrid system that depends on the perplexity score of a sentence fragment to produce the best fitting translations. The results show a correlation between language model perplexity and BLEU scores as well as overall improvements in BLEU.
Tasks Language Modelling, Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4502/
PDF https://www.aclweb.org/anthology/W16-4502
PWC https://paperswithcode.com/paper/neural-network-language-models-for-candidate
Repo
Framework

Image-Image Search for Comparable Corpora Construction

Title Image-Image Search for Comparable Corpora Construction
Authors Yu Hong, Liang Yao, Mengyi Liu, Tongtao Zhang, Wenxuan Zhou, Jianmin Yao, Heng Ji
Abstract We present a novel method of comparable corpora construction. Unlike the traditional methods which heavily rely on linguistic features, our method only takes image similarity into consid-eration. We use an image-image search engine to obtain similar images, together with the cap-tions in source language and target language. On the basis, we utilize captions of similar imag-es to construct sentence-level bilingual corpora. Experiments on 10,371 target captions show that our method achieves a precision of 0.85 in the top search results.
Tasks Image Retrieval
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4503/
PDF https://www.aclweb.org/anthology/W16-4503
PWC https://paperswithcode.com/paper/image-image-search-for-comparable-corpora
Repo
Framework

Significance of an Accurate Sandhi-Splitter in Shallow Parsing of Dravidian Languages

Title Significance of an Accurate Sandhi-Splitter in Shallow Parsing of Dravidian Languages
Authors Devadath V V, Dipti Misra Sharma
Abstract
Tasks
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3006/
PDF https://www.aclweb.org/anthology/P16-3006
PWC https://paperswithcode.com/paper/significance-of-an-accurate-sandhi-splitter
Repo
Framework

Predicting Translation Equivalents in Linked WordNets

Title Predicting Translation Equivalents in Linked WordNets
Authors Krasimir Angelov, Gleb Lobanov
Abstract We present an algorithm for predicting translation equivalents between two languages, based on the corresponding WordNets. The assumption is that all synsets of one of the languages are linked to the corresponding synsets in the other language. In theory, given the exact sense of a word in a context it must be possible to translate it as any of the words in the linked synset. In practice, however, this does not work well since automatic and accurate sense disambiguation is difficult. Instead it is possible to define a more robust translation relation between the lexemes of the two languages. As far as we know the Finnish WordNet is the only one that includes that relation. Our algorithm can be used to predict the relation for other languages as well. This is useful for instance in hybrid machine translation systems which are usually more dependent on high-quality translation dictionaries.
Tasks Machine Translation
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4504/
PDF https://www.aclweb.org/anthology/W16-4504
PWC https://paperswithcode.com/paper/predicting-translation-equivalents-in-linked
Repo
Framework

PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors

Title PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors
Authors G{"o}zde {"O}zbal, Carlo Strapparava, Serra Sinem Tekiro{\u{g}}lu
Abstract Proverbs are commonly metaphoric in nature and the mapping across domains is commonly established in proverbs. The abundance of proverbs in terms of metaphors makes them an extremely valuable linguistic resource since they can be utilized as a gold standard for various metaphor related linguistic tasks such as metaphor identification or interpretation. Besides, a collection of proverbs fromvarious languages annotated with metaphors would also be essential for social scientists to explore the cultural differences betweenthose languages. In this paper, we introduce PROMETHEUS, a dataset consisting of English proverbs and their equivalents in Italian.In addition to the word-level metaphor annotations for each proverb, PROMETHEUS contains other types of information such as the metaphoricity degree of the overall proverb, its meaning, the century that it was first recorded in and a pair of subjective questions responded by the annotators. To the best of our knowledge, this is the first multi-lingual and open-domain corpus of proverbs annotated with word-level metaphors.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1600/
PDF https://www.aclweb.org/anthology/L16-1600
PWC https://paperswithcode.com/paper/prometheus-a-corpus-of-proverbs-annotated
Repo
Framework
comments powered by Disqus