May 4, 2019

1776 words 9 mins read

Paper Group NANR 183

Generation of Conceptual-Level Text Cloud with Graph Diffusion. Multivariate tests of association based on univariate tests. Proceedings of the Fourth BioASQ workshop. SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis. Identifying News from Tweets. Constructing an Annotated Corpus for Protest Event Mining. U …

Generation of Conceptual-Level Text Cloud with Graph Diffusion


Title	Generation of Conceptual-Level Text Cloud with Graph Diffusion
Authors	Ying-Chun Lin, Po-An Yang, Yen-Kuan Lee, Kun-Ta Chuang
Abstract
Tasks	Keyword Extraction
Published	2016-10-01
URL	https://www.aclweb.org/anthology/O16-1034/
PDF	https://www.aclweb.org/anthology/O16-1034
PWC	https://paperswithcode.com/paper/generation-of-conceptual-level-text-cloud
Repo
Framework

Multivariate tests of association based on univariate tests


Title	Multivariate tests of association based on univariate tests
Authors	Ruth Heller, Yair Heller
Abstract	For testing two vector random variables for independence, we propose testing whether the distance of one vector from an arbitrary center point is independent from the distance of the other vector from another arbitrary center point by a univariate test. We prove that under minimal assumptions, it is enough to have a consistent univariate independence test on the distances, to guarantee that the power to detect dependence between the random vectors increases to one with sample size. If the univariate test is distribution-free, the multivariate test will also be distribution-free. If we consider multiple center points and aggregate the center-specific univariate tests, the power may be further improved, and the resulting multivariate test may be distribution-free for specific aggregation methods (if the univariate test is distribution-free). We show that certain multivariate tests recently proposed in the literature can be viewed as instances of this general approach. Moreover, we show in experiments that novel tests constructed using our approach can have better power and computational time than competing approaches.
Tasks
Published	2016-12-01
URL	http://papers.nips.cc/paper/6220-multivariate-tests-of-association-based-on-univariate-tests
PDF	http://papers.nips.cc/paper/6220-multivariate-tests-of-association-based-on-univariate-tests.pdf
PWC	https://paperswithcode.com/paper/multivariate-tests-of-association-based-on
Repo
Framework

Proceedings of the Fourth BioASQ workshop


Title	Proceedings of the Fourth BioASQ workshop
Authors
Abstract
Tasks	Question Answering
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-3100/
PDF	https://www.aclweb.org/anthology/W16-3100
PWC	https://paperswithcode.com/paper/proceedings-of-the-fourth-bioasq-workshop
Repo
Framework

SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis


Title	SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis
Authors	Mickael Rouvier, Benoit Favre
Abstract
Tasks	Sentence Classification, Sentiment Analysis, Word Embeddings
Published	2016-06-01
URL	https://www.aclweb.org/anthology/S16-1030/
PDF	https://www.aclweb.org/anthology/S16-1030
PWC	https://paperswithcode.com/paper/sensei-lif-at-semeval-2016-task-4-polarity
Repo
Framework

Identifying News from Tweets


Title	Identifying News from Tweets
Authors	Jesse Freitas, Heng Ji
Abstract
Tasks	Active Learning
Published	2016-11-01
URL	https://www.aclweb.org/anthology/W16-5602/
PDF	https://www.aclweb.org/anthology/W16-5602
PWC	https://paperswithcode.com/paper/identifying-news-from-tweets
Repo
Framework

Constructing an Annotated Corpus for Protest Event Mining


Title	Constructing an Annotated Corpus for Protest Event Mining
Authors	Peter Makarov, Jasmine Lorenzini, Hanspeter Kriesi
Abstract
Tasks
Published	2016-11-01
URL	https://www.aclweb.org/anthology/W16-5613/
PDF	https://www.aclweb.org/anthology/W16-5613
PWC	https://paperswithcode.com/paper/constructing-an-annotated-corpus-for-protest
Repo
Framework

Universal Dependencies for Turkish


Title	Universal Dependencies for Turkish
Authors	Umut Sulubacak, Memduh Gokirmak, Francis Tyers, {\c{C}}a{\u{g}}r{\i} {\c{C}}{"o}ltekin, Joakim Nivre, G{"u}l{\c{s}}en Eryi{\u{g}}it
Abstract	The Universal Dependencies (UD) project was conceived after the substantial recent interest in unifying annotation schemes across languages. With its own annotation principles and abstract inventory for parts of speech, morphosyntactic features and dependency relations, UD aims to facilitate multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. This paper presents the Turkish IMST-UD Treebank, the first Turkish treebank to be in a UD release. The IMST-UD Treebank was automatically converted from the IMST Treebank, which was also recently released. We describe this conversion procedure in detail, complete with mapping tables. We also present our evaluation of the parsing performances of both versions of the IMST Treebank. Our findings suggest that the UD framework is at least as viable for Turkish as the original annotation framework of the IMST Treebank.
Tasks
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1325/
PDF	https://www.aclweb.org/anthology/C16-1325
PWC	https://paperswithcode.com/paper/universal-dependencies-for-turkish
Repo
Framework

Automatic Syllabification for Manipuri language


Title	Automatic Syllabification for Manipuri language
Authors	Loitongbam Gyanendro Singh, Lenin Laitonjam, Sanasam Ranbir Singh
Abstract	Development of hand crafted rule for syllabifying words of a language is an expensive task. This paper proposes several data-driven methods for automatic syllabification of words written in Manipuri language. Manipuri is one of the scheduled Indian languages. First, we propose a language-independent rule-based approach formulated using entropy based phonotactic segmentation. Second, we project the syllabification problem as a sequence labeling problem and investigate its effect using various sequence labeling approaches. Third, we combine the effect of sequence labeling and rule-based method and investigate the performance of the hybrid approach. From various experimental observations, it is evident that the proposed methods outperform the baseline rule-based method. The entropy based phonotactic segmentation provides a word accuracy of 96{%}, CRF (sequence labeling approach) provides 97{%} and hybrid approach provides 98{%} word accuracy.
Tasks	Speech Recognition, Speech Synthesis, Text-To-Speech Synthesis
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1034/
PDF	https://www.aclweb.org/anthology/C16-1034
PWC	https://paperswithcode.com/paper/automatic-syllabification-for-manipuri
Repo
Framework

SRDF: Extracting Lexical Knowledge Graph for Preserving Sentence Meaning


Title	SRDF: Extracting Lexical Knowledge Graph for Preserving Sentence Meaning
Authors	Sangha Nam, GyuHyeon Choi, Younggyun Hahm, Key-Sun Choi
Abstract	In this paper, we present an open information extraction system so-called SRDF that generates lexical knowledge graphs from unstructured texts. In semantic web, knowledge is expressed in the RDF triple form but the natural language text consist of multiple relations between arguments. For this reason, we combine open information extraction with the reification for the full text extraction to preserve meaning of sentence in our knowledge graph. And also our knowledge graph is designed to adapt for many existing semantic web applications. At the end of this paper, we introduce the result of the experiment and a Korean template generation module developed using SRDF.
Tasks	Dependency Parsing, Knowledge Graphs, Open Information Extraction, Question Answering, Reading Comprehension
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4411/
PDF	https://www.aclweb.org/anthology/W16-4411
PWC	https://paperswithcode.com/paper/srdf-extracting-lexical-knowledge-graph-for
Repo
Framework

Combining Heterogeneous User Generated Data to Sense Well-being


Title	Combining Heterogeneous User Generated Data to Sense Well-being
Authors	Adam Tsakalidis, Maria Liakata, Theo Damoulas, Brigitte Jellinek, Weisi Guo, Alex Cristea, ra
Abstract	In this paper we address a new problem of predicting affect and well-being scales in a real-world setting of heterogeneous, longitudinal and non-synchronous textual as well as non-linguistic data that can be harvested from on-line media and mobile phones. We describe the method for collecting the heterogeneous longitudinal data, how features are extracted to address missing information and differences in temporal alignment, and how the latter are combined to yield promising predictions of affect and well-being on the basis of widely used psychological scales. We achieve a coefficient of determination ($R^2$) of 0.71-0.76 and a correlation coefficient of 0.68-0.87 which is higher than the state-of-the art in equivalent multi-modal tasks for affect.
Tasks	Emotion Recognition
Published	2016-12-01
URL	https://www.aclweb.org/anthology/C16-1283/
PDF	https://www.aclweb.org/anthology/C16-1283
PWC	https://paperswithcode.com/paper/combining-heterogeneous-user-generated-data
Repo
Framework

Neural Network Language Models for Candidate Scoring in Hybrid Multi-System Machine Translation


Title	Neural Network Language Models for Candidate Scoring in Hybrid Multi-System Machine Translation
Authors	Mat{=\i}ss Rikters
Abstract	This paper presents the comparison of how using different neural network based language modeling tools for selecting the best candidate fragments affects the final output translation quality in a hybrid multi-system machine translation setup. Experiments were conducted by comparing perplexity and BLEU scores on common test cases using the same training data set. A 12-gram statistical language model was selected as a baseline to oppose three neural network based models of different characteristics. The models were integrated in a hybrid system that depends on the perplexity score of a sentence fragment to produce the best fitting translations. The results show a correlation between language model perplexity and BLEU scores as well as overall improvements in BLEU.
Tasks	Language Modelling, Machine Translation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4502/
PDF	https://www.aclweb.org/anthology/W16-4502
PWC	https://paperswithcode.com/paper/neural-network-language-models-for-candidate
Repo
Framework

Image-Image Search for Comparable Corpora Construction


Title	Image-Image Search for Comparable Corpora Construction
Authors	Yu Hong, Liang Yao, Mengyi Liu, Tongtao Zhang, Wenxuan Zhou, Jianmin Yao, Heng Ji
Abstract	We present a novel method of comparable corpora construction. Unlike the traditional methods which heavily rely on linguistic features, our method only takes image similarity into consid-eration. We use an image-image search engine to obtain similar images, together with the cap-tions in source language and target language. On the basis, we utilize captions of similar imag-es to construct sentence-level bilingual corpora. Experiments on 10,371 target captions show that our method achieves a precision of 0.85 in the top search results.
Tasks	Image Retrieval
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4503/
PDF	https://www.aclweb.org/anthology/W16-4503
PWC	https://paperswithcode.com/paper/image-image-search-for-comparable-corpora
Repo
Framework

Significance of an Accurate Sandhi-Splitter in Shallow Parsing of Dravidian Languages


Title	Significance of an Accurate Sandhi-Splitter in Shallow Parsing of Dravidian Languages
Authors	Devadath V V, Dipti Misra Sharma
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-3006/
PDF	https://www.aclweb.org/anthology/P16-3006
PWC	https://paperswithcode.com/paper/significance-of-an-accurate-sandhi-splitter
Repo
Framework

Predicting Translation Equivalents in Linked WordNets


Title	Predicting Translation Equivalents in Linked WordNets
Authors	Krasimir Angelov, Gleb Lobanov
Abstract	We present an algorithm for predicting translation equivalents between two languages, based on the corresponding WordNets. The assumption is that all synsets of one of the languages are linked to the corresponding synsets in the other language. In theory, given the exact sense of a word in a context it must be possible to translate it as any of the words in the linked synset. In practice, however, this does not work well since automatic and accurate sense disambiguation is difficult. Instead it is possible to define a more robust translation relation between the lexemes of the two languages. As far as we know the Finnish WordNet is the only one that includes that relation. Our algorithm can be used to predict the relation for other languages as well. This is useful for instance in hybrid machine translation systems which are usually more dependent on high-quality translation dictionaries.
Tasks	Machine Translation
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4504/
PDF	https://www.aclweb.org/anthology/W16-4504
PWC	https://paperswithcode.com/paper/predicting-translation-equivalents-in-linked
Repo
Framework

PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors


Title	PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors
Authors	G{"o}zde {"O}zbal, Carlo Strapparava, Serra Sinem Tekiro{\u{g}}lu
Abstract	Proverbs are commonly metaphoric in nature and the mapping across domains is commonly established in proverbs. The abundance of proverbs in terms of metaphors makes them an extremely valuable linguistic resource since they can be utilized as a gold standard for various metaphor related linguistic tasks such as metaphor identification or interpretation. Besides, a collection of proverbs fromvarious languages annotated with metaphors would also be essential for social scientists to explore the cultural differences betweenthose languages. In this paper, we introduce PROMETHEUS, a dataset consisting of English proverbs and their equivalents in Italian.In addition to the word-level metaphor annotations for each proverb, PROMETHEUS contains other types of information such as the metaphoricity degree of the overall proverb, its meaning, the century that it was first recorded in and a pair of subjective questions responded by the annotators. To the best of our knowledge, this is the first multi-lingual and open-domain corpus of proverbs annotated with word-level metaphors.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1600/
PDF	https://www.aclweb.org/anthology/L16-1600
PWC	https://paperswithcode.com/paper/prometheus-a-corpus-of-proverbs-annotated
Repo
Framework