July 26, 2019

2081 words 10 mins read

Paper Group NANR 189

Mapping the Paraphrase Database to WordNet. Semantic Frame Labeling with Target-based Neural Model. 基於卷積類神經網路之廣播節目音訊事件偵測系統 (Automatic Audio Event Detection of Broadcast Radio Programs Based on Convolution Neural Networks) [In Chinese]. Distributed Prediction of Relations for Entities: The Easy, The Difficult, and The Impossible. Autobank: a semi-au …

Mapping the Paraphrase Database to WordNet


Title	Mapping the Paraphrase Database to WordNet
Authors	Anne Cocos, Marianna Apidianaki, Chris Callison-Burch
Abstract	WordNet has facilitated important research in natural language processing but its usefulness is somewhat limited by its relatively small lexical coverage. The Paraphrase Database (PPDB) covers 650 times more words, but lacks the semantic structure of WordNet that would make it more directly useful for downstream tasks. We present a method for mapping words from PPDB to WordNet synsets with 89{%} accuracy. The mapping also lays important groundwork for incorporating WordNet{'}s relations into PPDB so as to increase its utility for semantic reasoning in applications.
Tasks
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-1009/
PDF	https://www.aclweb.org/anthology/S17-1009
PWC	https://paperswithcode.com/paper/mapping-the-paraphrase-database-to-wordnet
Repo
Framework

Semantic Frame Labeling with Target-based Neural Model


Title	Semantic Frame Labeling with Target-based Neural Model
Authors	Yukun Feng, Dong Yu, Jian Xu, Chunhua Liu
Abstract	This paper explores the automatic learning of distributed representations of the target{'}s context for semantic frame labeling with target-based neural model. We constrain the whole sentence as the model{'}s input without feature extraction from the sentence. This is different from many previous works in which local feature extraction of the targets is widely used. This constraint makes the task harder, especially with long sentences, but also makes our model easily applicable to a range of resources and other similar tasks. We evaluate our model on several resources and get the state-of-the-art result on subtask 2 of SemEval 2015 task 15. Finally, we extend the task to word-sense disambiguation task and we also achieve a strong result in comparison to state-of-the-art work.
Tasks	Feature Engineering, Word Embeddings, Word Sense Disambiguation
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-1010/
PDF	https://www.aclweb.org/anthology/S17-1010
PWC	https://paperswithcode.com/paper/semantic-frame-labeling-with-target-based
Repo
Framework

基於卷積類神經網路之廣播節目音訊事件偵測系統 (Automatic Audio Event Detection of Broadcast Radio Programs Based on Convolution Neural Networks) [In Chinese]


Title	基於卷積類神經網路之廣播節目音訊事件偵測系統 (Automatic Audio Event Detection of Broadcast Radio Programs Based on Convolution Neural Networks) [In Chinese]
Authors	Jhih-wei Chen, Wu-Hua Hsu, Yuan-Fu Liao
Abstract
Tasks
Published	2017-11-01
URL	https://www.aclweb.org/anthology/O17-1003/
PDF	https://www.aclweb.org/anthology/O17-1003
PWC	https://paperswithcode.com/paper/ao14aceccc2e-a1ac-ce3e-aoaa-c3c-automatic
Repo
Framework

Distributed Prediction of Relations for Entities: The Easy, The Difficult, and The Impossible


Title	Distributed Prediction of Relations for Entities: The Easy, The Difficult, and The Impossible
Authors	Abhijeet Gupta, Gemma Boleda, Sebastian Pad{'o}
Abstract	Word embeddings are supposed to provide easy access to semantic relations such as {``}male of{''} (man{–}woman). While this claim has been investigated for concepts, little is known about the distributional behavior of relations of (Named) Entities. We describe two word embedding-based models that predict values for relational attributes of entities, and analyse them. The task is challenging, with major performance differences between relations. Contrary to many NLP tasks, high difficulty for a relation does not result from low frequency, but from (a) one-to-many mappings; and (b) lack of context patterns expressing the relation that are easy to pick up by word embeddings. \|
Tasks	Knowledge Base Completion, Relation Extraction, Word Embeddings
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-1012/
PDF	https://www.aclweb.org/anthology/S17-1012
PWC	https://paperswithcode.com/paper/distributed-prediction-of-relations-for
Repo
Framework

Autobank: a semi-automatic annotation tool for developing deep Minimalist Grammar treebanks


Title	Autobank: a semi-automatic annotation tool for developing deep Minimalist Grammar treebanks
Authors	John Torr
Abstract	This paper presents Autobank, a prototype tool for constructing a wide-coverage Minimalist Grammar (MG) (Stabler 1997), and semi-automatically converting the Penn Treebank (PTB) into a deep Minimalist treebank. The front end of the tool is a graphical user interface which facilitates the rapid development of a seed set of MG trees via manual reannotation of PTB preterminals with MG lexical categories. The system then extracts various dependency mappings between the source and target trees, and uses these in concert with a non-statistical MG parser to automatically reannotate the rest of the corpus. Autobank thus enables deep treebank conversions (and subsequent modifications) without the need for complex transduction algorithms accompanied by cascades of ad hoc rules; instead, the locus of human effort falls directly on the task of grammar construction itself.
Tasks	Machine Translation
Published	2017-04-01
URL	https://www.aclweb.org/anthology/E17-3021/
PDF	https://www.aclweb.org/anthology/E17-3021
PWC	https://paperswithcode.com/paper/autobank-a-semi-automatic-annotation-tool-for
Repo
Framework

The (too Many) Problems of Analogical Reasoning with Word Vectors


Title	The (too Many) Problems of Analogical Reasoning with Word Vectors
Authors	Anna Rogers, Aleks Drozd, r, Bofang Li
Abstract	This paper explores the possibilities of analogical reasoning with vector space models. Given two pairs of words with the same relation (e.g. man:woman :: king:queen), it was proposed that the offset between one pair of the corresponding word vectors can be used to identify the unknown member of the other pair (king - man + woman = queen). We argue against such {``}linguistic regularities{''} as a model for linguistic relations in vector space models and as a benchmark, and we show that the vector offset (as well as two other, better-performing methods) suffers from dependence on vector similarity. \|
Tasks	Word Embeddings, Word Sense Disambiguation
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-1017/
PDF	https://www.aclweb.org/anthology/S17-1017
PWC	https://paperswithcode.com/paper/the-too-many-problems-of-analogical-reasoning
Repo
Framework

Semantic Frames and Visual Scenes: Learning Semantic Role Inventories from Image and Video Descriptions


Title	Semantic Frames and Visual Scenes: Learning Semantic Role Inventories from Image and Video Descriptions
Authors	Ekaterina Shutova, Andreas Wundsam, Helen Yannakoudakis
Abstract	Frame-semantic parsing and semantic role labelling, that aim to automatically assign semantic roles to arguments of verbs in a sentence, have become an active strand of research in NLP. However, to date these methods have relied on a predefined inventory of semantic roles. In this paper, we present a method to automatically learn argument role inventories for verbs from large corpora of text, images and videos. We evaluate the method against manually constructed role inventories in FrameNet and show that the visual model outperforms the language-only model and operates with a high precision.
Tasks	Semantic Parsing
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-1018/
PDF	https://www.aclweb.org/anthology/S17-1018
PWC	https://paperswithcode.com/paper/semantic-frames-and-visual-scenes-learning
Repo
Framework

Logical Metonymy in a Distributional Model of Sentence Comprehension


Title	Logical Metonymy in a Distributional Model of Sentence Comprehension
Authors	Emmanuele Chersoni, Aless Lenci, ro, Philippe Blache
Abstract	In theoretical linguistics, logical metonymy is defined as the combination of an event-subcategorizing verb with an entity-denoting direct object (e.g., The author began the book), so that the interpretation of the VP requires the retrieval of a covert event (e.g., writing). Psycholinguistic studies have revealed extra processing costs for logical metonymy, a phenomenon generally explained with the introduction of new semantic structure. In this paper, we present a general distributional model for sentence comprehension inspired by the Memory, Unification and Control model by Hagoort (2013,2016). We show that our distributional framework can account for the extra processing costs of logical metonymy and can identify the covert event in a classification task.
Tasks
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-1021/
PDF	https://www.aclweb.org/anthology/S17-1021
PWC	https://paperswithcode.com/paper/logical-metonymy-in-a-distributional-model-of
Repo
Framework

Liner2 — a Generic Framework for Named Entity Recognition


Title	Liner2 — a Generic Framework for Named Entity Recognition
Authors	Micha{\l} Marci{'n}czuk, Jan Koco{'n}, Marcin Oleksy
Abstract	In the paper we present an adaptation of Liner2 framework to solve the BSNLP 2017 shared task on multilingual named entity recognition. The tool is tuned to recognize and lemmatize named entities for Polish.
Tasks	Morphological Analysis, Named Entity Recognition, Tokenization
Published	2017-04-01
URL	https://www.aclweb.org/anthology/W17-1413/
PDF	https://www.aclweb.org/anthology/W17-1413
PWC	https://paperswithcode.com/paper/liner2-a-a-generic-framework-for-named-entity
Repo
Framework

Identifying Semantically Deviating Outlier Documents


Title	Identifying Semantically Deviating Outlier Documents
Authors	Honglei Zhuang, Chi Wang, Fangbo Tao, Lance Kaplan, Jiawei Han
Abstract	A document outlier is a document that substantially deviates in semantics from the majority ones in a corpus. Automatic identification of document outliers can be valuable in many applications, such as screening health records for medical mistakes. In this paper, we study the problem of mining semantically deviating document outliers in a given corpus. We develop a generative model to identify frequent and characteristic semantic regions in the word embedding space to represent the given corpus, and a robust outlierness measure which is resistant to noisy content in documents. Experiments conducted on two real-world textual data sets show that our method can achieve an up to 135{%} improvement over baselines in terms of recall at top-1{%} of the outlier ranking.
Tasks	Outlier Detection
Published	2017-09-01
URL	https://www.aclweb.org/anthology/D17-1291/
PDF	https://www.aclweb.org/anthology/D17-1291
PWC	https://paperswithcode.com/paper/identifying-semantically-deviating-outlier
Repo
Framework

以語音能量特性發展即時語速偵測裝置-前導型研究 (Real-time monitoring device of phonation speed and volume based on speech energy: A pilot study) [In Chinese]


Title	以語音能量特性發展即時語速偵測裝置-前導型研究 (Real-time monitoring device of phonation speed and volume based on speech energy: A pilot study) [In Chinese]
Authors	Chi-Te Wang, Feng-Chuan Lin, Wei-Zhung Zheng, Shih-Hau Fang, Yu Tsao, Ying-Hui Lai
Abstract
Tasks
Published	2017-11-01
URL	https://www.aclweb.org/anthology/O17-1027/
PDF	https://www.aclweb.org/anthology/O17-1027
PWC	https://paperswithcode.com/paper/aeae3e12ec1c14aa3eaea-ec12-aaac-c-real-time
Repo
Framework

SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity


Title	SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity
Authors	Jose Camacho-Collados, Mohammad Taher Pilehvar, Nigel Collier, Roberto Navigli
Abstract	This paper introduces a new task on Multilingual and Cross-lingual SemanticThis paper introduces a new task on Multilingual and Cross-lingual Semantic Word Similarity which measures the semantic similarity of word pairs within and across five languages: English, Farsi, German, Italian and Spanish. High quality datasets were manually curated for the five languages with high inter-annotator agreements (consistently in the 0.9 ballpark). These were used for semi-automatic construction of ten cross-lingual datasets. 17 teams participated in the task, submitting 24 systems in subtask 1 and 14 systems in subtask 2. Results show that systems that combine statistical knowledge from text corpora, in the form of word embeddings, and external knowledge from lexical resources are best performers in both subtasks. More information can be found on the task website: \url{http://alt.qcri.org/semeval2017/task2/}
Tasks	Information Retrieval, Machine Translation, Question Answering, Representation Learning, Semantic Similarity, Semantic Textual Similarity, Text Summarization, Word Embeddings, Word Sense Disambiguation
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-2002/
PDF	https://www.aclweb.org/anthology/S17-2002
PWC	https://paperswithcode.com/paper/semeval-2017-task-2-multilingual-and-cross
Repo
Framework


Title	Identifying the Authors’ National Variety of English in Social Media Texts
Authors	Vasiliki Simaki, Panagiotis Simakis, Carita Paradis, Andreas Kerren
Abstract	In this paper, we present a study for the identification of authors{'} national variety of English in texts from social media. In data from Facebook and Twitter, information about the author{'}s social profile is annotated, and the national English variety (US, UK, AUS, CAN, NNS) that each author uses is attributed. We tested four feature types: formal linguistic features, POS features, lexicon-based features related to the different varieties, and data-based features from each English variety. We used various machine learning algorithms for the classification experiments, and we implemented a feature selection process. The classification accuracy achieved, when the 31 highest ranked features were used, was up to 77.32{%}. The experimental results are evaluated, and the efficacy of the ranked features discussed.
Tasks	Feature Selection
Published	2017-09-01
URL	https://www.aclweb.org/anthology/R17-1086/
PDF	https://doi.org/10.26615/978-954-452-049-6_086
PWC	https://paperswithcode.com/paper/identifying-the-authors-national-variety-of
Repo
Framework

FCICU at SemEval-2017 Task 1: Sense-Based Language Independent Semantic Textual Similarity Approach


Title	FCICU at SemEval-2017 Task 1: Sense-Based Language Independent Semantic Textual Similarity Approach
Authors	Basma Hassan, Samir AbdelRahman, Reem Bahgat, Ibrahim Farag
Abstract	This paper describes FCICU team systems that participated in SemEval-2017 Semantic Textual Similarity task (Task1) for monolingual and cross-lingual sentence pairs. A sense-based language independent textual similarity approach is presented, in which a proposed alignment similarity method coupled with new usage of a semantic network (BabelNet) is used. Additionally, a previously proposed integration between sense-based and sur-face-based semantic textual similarity approach is applied together with our proposed approach. For all the tracks in Task1, Run1 is a string kernel with alignments metric and Run2 is a sense-based alignment similarity method. The first run is ranked 10th, and the second is ranked 12th in the primary track, with correlation 0.619 and 0.617 respectively
Tasks	Machine Translation, Semantic Textual Similarity
Published	2017-08-01
URL	https://www.aclweb.org/anthology/S17-2015/
PDF	https://www.aclweb.org/anthology/S17-2015
PWC	https://paperswithcode.com/paper/fcicu-at-semeval-2017-task-1-sense-based
Repo
Framework

What is it? Disambiguating the different readings of the pronoun `it’


Title	What is it? Disambiguating the different readings of the pronoun `it’ \|
Authors	Sharid Lo{'a}iciga, Liane Guillou, Christian Hardmeier
Abstract	In this paper, we address the problem of predicting one of three functions for the English pronoun {`}it{'}: anaphoric, event reference or pleonastic. This disambiguation is valuable in the context of machine translation and coreference resolution. We present experiments using a MAXENT classifier trained on gold-standard data and self-training experiments of an RNN trained on silver-standard data, annotated using the MAXENT classifier. Lastly, we report on an analysis of the strengths of these two models. \|
Tasks	Coreference Resolution, Machine Translation
Published	2017-09-01
URL	https://www.aclweb.org/anthology/D17-1137/
PDF	https://www.aclweb.org/anthology/D17-1137
PWC	https://paperswithcode.com/paper/what-is-it-disambiguating-the-different
Repo
Framework