Paper Group NANR 183
Generation of Conceptual-Level Text Cloud with Graph Diffusion. Multivariate tests of association based on univariate tests. Proceedings of the Fourth BioASQ workshop. SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis. Identifying News from Tweets. Constructing an Annotated Corpus for Protest Event Mining. U …
Generation of Conceptual-Level Text Cloud with Graph Diffusion
Title | Generation of Conceptual-Level Text Cloud with Graph Diffusion |
Authors | Ying-Chun Lin, Po-An Yang, Yen-Kuan Lee, Kun-Ta Chuang |
Abstract | |
Tasks | Keyword Extraction |
Published | 2016-10-01 |
URL | https://www.aclweb.org/anthology/O16-1034/ |
https://www.aclweb.org/anthology/O16-1034 | |
PWC | https://paperswithcode.com/paper/generation-of-conceptual-level-text-cloud |
Repo | |
Framework | |
Multivariate tests of association based on univariate tests
Title | Multivariate tests of association based on univariate tests |
Authors | Ruth Heller, Yair Heller |
Abstract | For testing two vector random variables for independence, we propose testing whether the distance of one vector from an arbitrary center point is independent from the distance of the other vector from another arbitrary center point by a univariate test. We prove that under minimal assumptions, it is enough to have a consistent univariate independence test on the distances, to guarantee that the power to detect dependence between the random vectors increases to one with sample size. If the univariate test is distribution-free, the multivariate test will also be distribution-free. If we consider multiple center points and aggregate the center-specific univariate tests, the power may be further improved, and the resulting multivariate test may be distribution-free for specific aggregation methods (if the univariate test is distribution-free). We show that certain multivariate tests recently proposed in the literature can be viewed as instances of this general approach. Moreover, we show in experiments that novel tests constructed using our approach can have better power and computational time than competing approaches. |
Tasks | |
Published | 2016-12-01 |
URL | http://papers.nips.cc/paper/6220-multivariate-tests-of-association-based-on-univariate-tests |
http://papers.nips.cc/paper/6220-multivariate-tests-of-association-based-on-univariate-tests.pdf | |
PWC | https://paperswithcode.com/paper/multivariate-tests-of-association-based-on |
Repo | |
Framework | |
Proceedings of the Fourth BioASQ workshop
Title | Proceedings of the Fourth BioASQ workshop |
Authors | |
Abstract | |
Tasks | Question Answering |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/W16-3100/ |
https://www.aclweb.org/anthology/W16-3100 | |
PWC | https://paperswithcode.com/paper/proceedings-of-the-fourth-bioasq-workshop |
Repo | |
Framework | |
SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis
Title | SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis |
Authors | Mickael Rouvier, Benoit Favre |
Abstract | |
Tasks | Sentence Classification, Sentiment Analysis, Word Embeddings |
Published | 2016-06-01 |
URL | https://www.aclweb.org/anthology/S16-1030/ |
https://www.aclweb.org/anthology/S16-1030 | |
PWC | https://paperswithcode.com/paper/sensei-lif-at-semeval-2016-task-4-polarity |
Repo | |
Framework | |
Identifying News from Tweets
Title | Identifying News from Tweets |
Authors | Jesse Freitas, Heng Ji |
Abstract | |
Tasks | Active Learning |
Published | 2016-11-01 |
URL | https://www.aclweb.org/anthology/W16-5602/ |
https://www.aclweb.org/anthology/W16-5602 | |
PWC | https://paperswithcode.com/paper/identifying-news-from-tweets |
Repo | |
Framework | |
Constructing an Annotated Corpus for Protest Event Mining
Title | Constructing an Annotated Corpus for Protest Event Mining |
Authors | Peter Makarov, Jasmine Lorenzini, Hanspeter Kriesi |
Abstract | |
Tasks | |
Published | 2016-11-01 |
URL | https://www.aclweb.org/anthology/W16-5613/ |
https://www.aclweb.org/anthology/W16-5613 | |
PWC | https://paperswithcode.com/paper/constructing-an-annotated-corpus-for-protest |
Repo | |
Framework | |
Universal Dependencies for Turkish
Title | Universal Dependencies for Turkish |
Authors | Umut Sulubacak, Memduh Gokirmak, Francis Tyers, {\c{C}}a{\u{g}}r{\i} {\c{C}}{"o}ltekin, Joakim Nivre, G{"u}l{\c{s}}en Eryi{\u{g}}it |
Abstract | The Universal Dependencies (UD) project was conceived after the substantial recent interest in unifying annotation schemes across languages. With its own annotation principles and abstract inventory for parts of speech, morphosyntactic features and dependency relations, UD aims to facilitate multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. This paper presents the Turkish IMST-UD Treebank, the first Turkish treebank to be in a UD release. The IMST-UD Treebank was automatically converted from the IMST Treebank, which was also recently released. We describe this conversion procedure in detail, complete with mapping tables. We also present our evaluation of the parsing performances of both versions of the IMST Treebank. Our findings suggest that the UD framework is at least as viable for Turkish as the original annotation framework of the IMST Treebank. |
Tasks | |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-1325/ |
https://www.aclweb.org/anthology/C16-1325 | |
PWC | https://paperswithcode.com/paper/universal-dependencies-for-turkish |
Repo | |
Framework | |
Automatic Syllabification for Manipuri language
Title | Automatic Syllabification for Manipuri language |
Authors | Loitongbam Gyanendro Singh, Lenin Laitonjam, Sanasam Ranbir Singh |
Abstract | Development of hand crafted rule for syllabifying words of a language is an expensive task. This paper proposes several data-driven methods for automatic syllabification of words written in Manipuri language. Manipuri is one of the scheduled Indian languages. First, we propose a language-independent rule-based approach formulated using entropy based phonotactic segmentation. Second, we project the syllabification problem as a sequence labeling problem and investigate its effect using various sequence labeling approaches. Third, we combine the effect of sequence labeling and rule-based method and investigate the performance of the hybrid approach. From various experimental observations, it is evident that the proposed methods outperform the baseline rule-based method. The entropy based phonotactic segmentation provides a word accuracy of 96{%}, CRF (sequence labeling approach) provides 97{%} and hybrid approach provides 98{%} word accuracy. |
Tasks | Speech Recognition, Speech Synthesis, Text-To-Speech Synthesis |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-1034/ |
https://www.aclweb.org/anthology/C16-1034 | |
PWC | https://paperswithcode.com/paper/automatic-syllabification-for-manipuri |
Repo | |
Framework | |
SRDF: Extracting Lexical Knowledge Graph for Preserving Sentence Meaning
Title | SRDF: Extracting Lexical Knowledge Graph for Preserving Sentence Meaning |
Authors | Sangha Nam, GyuHyeon Choi, Younggyun Hahm, Key-Sun Choi |
Abstract | In this paper, we present an open information extraction system so-called SRDF that generates lexical knowledge graphs from unstructured texts. In semantic web, knowledge is expressed in the RDF triple form but the natural language text consist of multiple relations between arguments. For this reason, we combine open information extraction with the reification for the full text extraction to preserve meaning of sentence in our knowledge graph. And also our knowledge graph is designed to adapt for many existing semantic web applications. At the end of this paper, we introduce the result of the experiment and a Korean template generation module developed using SRDF. |
Tasks | Dependency Parsing, Knowledge Graphs, Open Information Extraction, Question Answering, Reading Comprehension |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4411/ |
https://www.aclweb.org/anthology/W16-4411 | |
PWC | https://paperswithcode.com/paper/srdf-extracting-lexical-knowledge-graph-for |
Repo | |
Framework | |
Combining Heterogeneous User Generated Data to Sense Well-being
Title | Combining Heterogeneous User Generated Data to Sense Well-being |
Authors | Adam Tsakalidis, Maria Liakata, Theo Damoulas, Brigitte Jellinek, Weisi Guo, Alex Cristea, ra |
Abstract | In this paper we address a new problem of predicting affect and well-being scales in a real-world setting of heterogeneous, longitudinal and non-synchronous textual as well as non-linguistic data that can be harvested from on-line media and mobile phones. We describe the method for collecting the heterogeneous longitudinal data, how features are extracted to address missing information and differences in temporal alignment, and how the latter are combined to yield promising predictions of affect and well-being on the basis of widely used psychological scales. We achieve a coefficient of determination ($R^2$) of 0.71-0.76 and a correlation coefficient of 0.68-0.87 which is higher than the state-of-the art in equivalent multi-modal tasks for affect. |
Tasks | Emotion Recognition |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/C16-1283/ |
https://www.aclweb.org/anthology/C16-1283 | |
PWC | https://paperswithcode.com/paper/combining-heterogeneous-user-generated-data |
Repo | |
Framework | |
Neural Network Language Models for Candidate Scoring in Hybrid Multi-System Machine Translation
Title | Neural Network Language Models for Candidate Scoring in Hybrid Multi-System Machine Translation |
Authors | Mat{=\i}ss Rikters |
Abstract | This paper presents the comparison of how using different neural network based language modeling tools for selecting the best candidate fragments affects the final output translation quality in a hybrid multi-system machine translation setup. Experiments were conducted by comparing perplexity and BLEU scores on common test cases using the same training data set. A 12-gram statistical language model was selected as a baseline to oppose three neural network based models of different characteristics. The models were integrated in a hybrid system that depends on the perplexity score of a sentence fragment to produce the best fitting translations. The results show a correlation between language model perplexity and BLEU scores as well as overall improvements in BLEU. |
Tasks | Language Modelling, Machine Translation |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4502/ |
https://www.aclweb.org/anthology/W16-4502 | |
PWC | https://paperswithcode.com/paper/neural-network-language-models-for-candidate |
Repo | |
Framework | |
Image-Image Search for Comparable Corpora Construction
Title | Image-Image Search for Comparable Corpora Construction |
Authors | Yu Hong, Liang Yao, Mengyi Liu, Tongtao Zhang, Wenxuan Zhou, Jianmin Yao, Heng Ji |
Abstract | We present a novel method of comparable corpora construction. Unlike the traditional methods which heavily rely on linguistic features, our method only takes image similarity into consid-eration. We use an image-image search engine to obtain similar images, together with the cap-tions in source language and target language. On the basis, we utilize captions of similar imag-es to construct sentence-level bilingual corpora. Experiments on 10,371 target captions show that our method achieves a precision of 0.85 in the top search results. |
Tasks | Image Retrieval |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4503/ |
https://www.aclweb.org/anthology/W16-4503 | |
PWC | https://paperswithcode.com/paper/image-image-search-for-comparable-corpora |
Repo | |
Framework | |
Significance of an Accurate Sandhi-Splitter in Shallow Parsing of Dravidian Languages
Title | Significance of an Accurate Sandhi-Splitter in Shallow Parsing of Dravidian Languages |
Authors | Devadath V V, Dipti Misra Sharma |
Abstract | |
Tasks | |
Published | 2016-08-01 |
URL | https://www.aclweb.org/anthology/P16-3006/ |
https://www.aclweb.org/anthology/P16-3006 | |
PWC | https://paperswithcode.com/paper/significance-of-an-accurate-sandhi-splitter |
Repo | |
Framework | |
Predicting Translation Equivalents in Linked WordNets
Title | Predicting Translation Equivalents in Linked WordNets |
Authors | Krasimir Angelov, Gleb Lobanov |
Abstract | We present an algorithm for predicting translation equivalents between two languages, based on the corresponding WordNets. The assumption is that all synsets of one of the languages are linked to the corresponding synsets in the other language. In theory, given the exact sense of a word in a context it must be possible to translate it as any of the words in the linked synset. In practice, however, this does not work well since automatic and accurate sense disambiguation is difficult. Instead it is possible to define a more robust translation relation between the lexemes of the two languages. As far as we know the Finnish WordNet is the only one that includes that relation. Our algorithm can be used to predict the relation for other languages as well. This is useful for instance in hybrid machine translation systems which are usually more dependent on high-quality translation dictionaries. |
Tasks | Machine Translation |
Published | 2016-12-01 |
URL | https://www.aclweb.org/anthology/W16-4504/ |
https://www.aclweb.org/anthology/W16-4504 | |
PWC | https://paperswithcode.com/paper/predicting-translation-equivalents-in-linked |
Repo | |
Framework | |
PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors
Title | PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors |
Authors | G{"o}zde {"O}zbal, Carlo Strapparava, Serra Sinem Tekiro{\u{g}}lu |
Abstract | Proverbs are commonly metaphoric in nature and the mapping across domains is commonly established in proverbs. The abundance of proverbs in terms of metaphors makes them an extremely valuable linguistic resource since they can be utilized as a gold standard for various metaphor related linguistic tasks such as metaphor identification or interpretation. Besides, a collection of proverbs fromvarious languages annotated with metaphors would also be essential for social scientists to explore the cultural differences betweenthose languages. In this paper, we introduce PROMETHEUS, a dataset consisting of English proverbs and their equivalents in Italian.In addition to the word-level metaphor annotations for each proverb, PROMETHEUS contains other types of information such as the metaphoricity degree of the overall proverb, its meaning, the century that it was first recorded in and a pair of subjective questions responded by the annotators. To the best of our knowledge, this is the first multi-lingual and open-domain corpus of proverbs annotated with word-level metaphors. |
Tasks | |
Published | 2016-05-01 |
URL | https://www.aclweb.org/anthology/L16-1600/ |
https://www.aclweb.org/anthology/L16-1600 | |
PWC | https://paperswithcode.com/paper/prometheus-a-corpus-of-proverbs-annotated |
Repo | |
Framework | |