May 5, 2019

1994 words 10 mins read

Paper Group NANR 144

Paper Group NANR 144

DUTIR in BioNLP-ST 2016: Utilizing Convolutional Network and Distributed Representation to Extract Complicate Relations. Proceedings of the Open Knowledge Base and Question Answering Workshop (OKBQA 2016). Proceedings of the 5th International Workshop on Computational Terminology (Computerm2016). CATaLog Online: Porting a Post-editing Tool to the W …

DUTIR in BioNLP-ST 2016: Utilizing Convolutional Network and Distributed Representation to Extract Complicate Relations

Title DUTIR in BioNLP-ST 2016: Utilizing Convolutional Network and Distributed Representation to Extract Complicate Relations
Authors Honglei Li, Jianhai Zhang, Jian Wang, Hongfei Lin, Zhihao Yang
Abstract
Tasks Feature Engineering, Relation Extraction
Published 2016-08-01
URL https://www.aclweb.org/anthology/W16-3012/
PDF https://www.aclweb.org/anthology/W16-3012
PWC https://paperswithcode.com/paper/dutir-in-bionlp-st-2016-utilizing
Repo
Framework

Proceedings of the Open Knowledge Base and Question Answering Workshop (OKBQA 2016)

Title Proceedings of the Open Knowledge Base and Question Answering Workshop (OKBQA 2016)
Authors
Abstract
Tasks Question Answering
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4400/
PDF https://www.aclweb.org/anthology/W16-4400
PWC https://paperswithcode.com/paper/proceedings-of-the-open-knowledge-base-and
Repo
Framework

Proceedings of the 5th International Workshop on Computational Terminology (Computerm2016)

Title Proceedings of the 5th International Workshop on Computational Terminology (Computerm2016)
Authors
Abstract
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4700/
PDF https://www.aclweb.org/anthology/W16-4700
PWC https://paperswithcode.com/paper/proceedings-of-the-5th-international-workshop
Repo
Framework

CATaLog Online: Porting a Post-editing Tool to the Web

Title CATaLog Online: Porting a Post-editing Tool to the Web
Authors Santanu Pal, Marcos Zampieri, Sudip Kumar Naskar, Tapas Nayak, Mihaela Vela, Josef van Genabith
Abstract This paper presents CATaLog online, a new web-based MT and TM post-editing tool. CATaLog online is a freeware software that can be used through a web browser and it requires only a simple registration. The tool features a number of editing and log functions similar to the desktop version of CATaLog enhanced with several new features that we describe in detail in this paper. CATaLog online is designed to allow users to post-edit both translation memory segments as well as machine translation output. The tool provides a complete set of log information currently not available in most commercial CAT tools. Log information can be used both for project management purposes as well as for the study of the translation process and translator{'}s productivity.
Tasks Machine Translation
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1095/
PDF https://www.aclweb.org/anthology/L16-1095
PWC https://paperswithcode.com/paper/catalog-online-porting-a-post-editing-tool-to
Repo
Framework

Typology of Adjectives Benchmark for Compositional Distributional Models

Title Typology of Adjectives Benchmark for Compositional Distributional Models
Authors Daria Ryzhova, Maria Kyuseva, Denis Paperno
Abstract In this paper we present a novel application of compositional distributional semantic models (CDSMs): prediction of lexical typology. The paper introduces the notion of typological closeness, which is a novel rigorous formalization of semantic similarity based on comparison of multilingual data. Starting from the Moscow Database of Qualitative Features for adjective typology, we create four datasets of typological closeness, on which we test a range of distributional semantic models. We show that, on the one hand, vector representations of phrases based on data from one language can be used to predict how words within the phrase translate into different languages, and, on the other hand, that typological data can serve as a semantic benchmark for distributional models. We find that compositional distributional models, especially parametric ones, perform way above non-compositional alternatives on the task.
Tasks Semantic Similarity, Semantic Textual Similarity
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1199/
PDF https://www.aclweb.org/anthology/L16-1199
PWC https://paperswithcode.com/paper/typology-of-adjectives-benchmark-for
Repo
Framework

Deep Submodular Functions: Definitions and Learning

Title Deep Submodular Functions: Definitions and Learning
Authors Brian W. Dolhansky, Jeff A. Bilmes
Abstract We propose and study a new class of submodular functions called deep submodular functions (DSFs). We define DSFs and situate them within the broader context of classes of submodular functions in relationship both to various matroid ranks and sums of concave composed with modular functions (SCMs). Notably, we find that DSFs constitute a strictly broader class than SCMs, thus motivating their use, but that they do not comprise all submodular functions. Interestingly, some DSFs can be seen as special cases of certain deep neural networks (DNNs), hence the name. Finally, we provide a method to learn DSFs in a max-margin framework, and offer preliminary results applying this both to synthetic and real-world data instances.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6361-deep-submodular-functions-definitions-and-learning
PDF http://papers.nips.cc/paper/6361-deep-submodular-functions-definitions-and-learning.pdf
PWC https://paperswithcode.com/paper/deep-submodular-functions-definitions-and
Repo
Framework

A Proposition-Based Abstractive Summariser

Title A Proposition-Based Abstractive Summariser
Authors Yimai Fang, Haoyue Zhu, Ewa Muszy{'n}ska, Alex Kuhnle, er, Simone Teufel
Abstract Abstractive summarisation is not yet common amongst today{'}s deployed and research systems. Most existing systems either extract sentences or compress individual sentences. In this paper, we present a summariser that works by a different paradigm. It is a further development of an existing summariser that has an incremental, proposition-based content selection process but lacks a natural language (NL) generator for the final output. Using an NL generator, we can now produce the summary text to directly reflect the selected propositions. Our evaluation compares textual quality of our system to the earlier preliminary output method, and also uses ROUGE to compare to various summarisers that use the traditional method of sentence extraction, followed by compression. Our results suggest that cutting out the middle-man of sentence extraction can lead to better abstractive summaries.
Tasks Language Modelling, Sentence Compression
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1055/
PDF https://www.aclweb.org/anthology/C16-1055
PWC https://paperswithcode.com/paper/a-proposition-based-abstractive-summariser
Repo
Framework

Concepticon: A Resource for the Linking of Concept Lists

Title Concepticon: A Resource for the Linking of Concept Lists
Authors Johann-Mattis List, Michael Cysouw, Robert Forkel
Abstract We present an attempt to link the large amount of different concept lists which are used in the linguistic literature, ranging from Swadesh lists in historical linguistics to naming tests in clinical studies and psycholinguistics. This resource, our Concepticon, links 30 222 concept labels from 160 conceptlists to 2495 concept sets. Each concept set is given a unique identifier, a unique label, and a human-readable definition. Concept sets are further structured by defining different relations between the concepts. The resource can be used for various purposes. Serving as a rich reference for new and existing databases in diachronic and synchronic linguistics, it allows researchers a quick access to studies on semantic change, cross-linguistic polysemies, and semantic associations.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1379/
PDF https://www.aclweb.org/anthology/L16-1379
PWC https://paperswithcode.com/paper/concepticon-a-resource-for-the-linking-of
Repo
Framework

Punctuation Prediction for Unsegmented Transcript Based on Word Vector

Title Punctuation Prediction for Unsegmented Transcript Based on Word Vector
Authors Xiaoyin Che, Cheng Wang, Haojin Yang, Christoph Meinel
Abstract In this paper we propose an approach to predict punctuation marks for unsegmented speech transcript. The approach is purely lexical, with pre-trained Word Vectors as the only input. A training model of Deep Neural Network (DNN) or Convolutional Neural Network (CNN) is applied to classify whether a punctuation mark should be inserted after the third word of a 5-words sequence and which kind of punctuation mark the inserted one should be. TED talks within IWSLT dataset are used in both training and evaluation phases. The proposed approach shows its effectiveness by achieving better result than the state-of-the-art lexical solution which works with same type of data, especially when predicting puncuation position only.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1103/
PDF https://www.aclweb.org/anthology/L16-1103
PWC https://paperswithcode.com/paper/punctuation-prediction-for-unsegmented
Repo
Framework

Analysing Constraint Grammars with a SAT-solver

Title Analysing Constraint Grammars with a SAT-solver
Authors Inari Listenmaa, Koen Claessen
Abstract We describe a method for analysing Constraint Grammars (CG) that can detect internal conflicts and redundancies in a given grammar, without the need for a corpus. The aim is for grammar writers to be able to automatically diagnose, and then manually improve their grammars. Our method works by translating the given grammar into logical constraints that are analysed by a SAT-solver. We have evaluated our analysis on a number of non-trivial grammars and found inconsistencies.
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1111/
PDF https://www.aclweb.org/anthology/L16-1111
PWC https://paperswithcode.com/paper/analysing-constraint-grammars-with-a-sat
Repo
Framework

A Joint Sentiment-Target-Stance Model for Stance Classification in Tweets

Title A Joint Sentiment-Target-Stance Model for Stance Classification in Tweets
Authors Javid Ebrahimi, Dejing Dou, Daniel Lowd
Abstract Classifying the stance expressed in online microblogging social media is an emerging problem in opinion mining. We propose a probabilistic approach to stance classification in tweets, which models stance, target of stance, and sentiment of tweet, jointly. Instead of simply conjoining the sentiment or target variables as extra variables to the feature space, we use a novel formulation to incorporate three-way interactions among sentiment-stance-input variables and three-way interactions among target-stance-input variables. The proposed specification intuitively aims to discriminate sentiment features from target features for stance classification. In addition, regularizing a single stance classifier, which handles all targets, acts as a soft weight-sharing among them. We demonstrate that discriminative training of this model achieves the state-of-the-art results in supervised stance classification, and its generative training obtains competitive results in the weakly supervised setting.
Tasks Argument Mining, Opinion Mining, Sentiment Analysis, Stance Detection, Subjectivity Analysis
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1250/
PDF https://www.aclweb.org/anthology/C16-1250
PWC https://paperswithcode.com/paper/a-joint-sentiment-target-stance-model-for
Repo
Framework

A subtree-based factorization of dependency parsing

Title A subtree-based factorization of dependency parsing
Authors Qiuye Zhao, Qun Liu
Abstract We propose a dependency parsing pipeline, in which the parsing of long-distance projections and localized dependencies are explicitly decomposed at the input level. A chosen baseline dependency parsing model performs only on {`}carved{'} sequences at the second stage, which are transformed from coarse constituent parsing outputs at the first stage. When k-best constituent parsing outputs are kept, a third-stage is required to search for an optimal combination of the overlapped dependency subtrees. In this sense, our dependency model is subtree-factored. We explore alternative approaches for scoring subtrees, including feature-based models as well as continuous representations. The search for optimal subset to combine is formulated as an ILP problem. This framework especially benefits the models poor on long sentences, generally improving baselines by 0.75-1.28 (UAS) on English, achieving comparable performance with high-order models but faster. For Chinese, the most notable increase is as high as 3.63 (UAS) when the proposed framework is applied to first-order parsing models. |
Tasks Dependency Parsing
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1057/
PDF https://www.aclweb.org/anthology/C16-1057
PWC https://paperswithcode.com/paper/a-subtree-based-factorization-of-dependency
Repo
Framework

A Bayesian model for joint word alignment and part-of-speech transfer

Title A Bayesian model for joint word alignment and part-of-speech transfer
Authors Robert {"O}stling
Abstract Current methods for word alignment require considerable amounts of parallel text to deliver accurate results, a requirement which is met only for a small minority of the world{'}s approximately 7,000 languages. We show that by jointly performing word alignment and annotation transfer in a novel Bayesian model, alignment accuracy can be improved for language pairs where annotations are available for only one of the languages{—}a finding which could facilitate the study and processing of a vast number of low-resource languages. We also present an evaluation where our method is used to perform single-source and multi-source part-of-speech transfer with 22 translations of the same text in four different languages. This allows us to quantify the considerable variation in accuracy depending on the specific source text(s) used, even with different translations into the same language.
Tasks Machine Translation, Word Alignment, Word Sense Disambiguation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1060/
PDF https://www.aclweb.org/anthology/C16-1060
PWC https://paperswithcode.com/paper/a-bayesian-model-for-joint-word-alignment-and
Repo
Framework

An Alternate View on Strong Lexicalization in TAG

Title An Alternate View on Strong Lexicalization in TAG
Authors Aniello De Santo, Al{"e}na Aks{"e}nova, Thomas Graf
Abstract
Tasks
Published 2016-06-01
URL https://www.aclweb.org/anthology/W16-3310/
PDF https://www.aclweb.org/anthology/W16-3310
PWC https://paperswithcode.com/paper/an-alternate-view-on-strong-lexicalization-in
Repo
Framework

Probabilistic Prototype Model for Serendipitous Property Mining

Title Probabilistic Prototype Model for Serendipitous Property Mining
Authors Taesung Lee, Seung-won Hwang, Zhongyuan Wang
Abstract Besides providing the relevant information, amusing users has been an important role of the web. Many web sites provide serendipitous (unexpected but relevant) information to draw user traffic. In this paper, we study the representative scenario of mining an amusing quiz. An existing approach leverages a knowledge base to mine an unexpected property then find quiz questions on such property, based on prototype theory in cognitive science. However, existing deterministic model is vulnerable to noise in the knowledge base. Therefore, we instead propose to leverage probabilistic approach to build a prototype that can overcome noise. Our extensive empirical study shows that our approach not only significantly outperforms baselines by 0.06 in accuracy, and 0.11 in serendipity but also shows higher relevance than the traditional relevance-pursuing baseline using TF-IDF.
Tasks Question Generation
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1064/
PDF https://www.aclweb.org/anthology/C16-1064
PWC https://paperswithcode.com/paper/probabilistic-prototype-model-for
Repo
Framework
comments powered by Disqus