May 4, 2019

1664 words 8 mins read

Paper Group NANR 157

Paper Group NANR 157

WOLVESAAR at SemEval-2016 Task 1: Replicating the Success of Monolingual Word Alignment and Neural Embeddings for Semantic Textual Similarity. SimiHawk at SemEval-2016 Task 1: A Deep Ensemble System for Semantic Textual Similarity. Korean FrameNet Expansion Based on Projection of Japanese FrameNet. Improved Word Embeddings with Implicit Structure I …

WOLVESAAR at SemEval-2016 Task 1: Replicating the Success of Monolingual Word Alignment and Neural Embeddings for Semantic Textual Similarity

Title WOLVESAAR at SemEval-2016 Task 1: Replicating the Success of Monolingual Word Alignment and Neural Embeddings for Semantic Textual Similarity
Authors Hannah Bechara, Rohit Gupta, Liling Tan, Constantin Or{\u{a}}san, Ruslan Mitkov, Josef van Genabith
Abstract
Tasks Information Retrieval, Machine Translation, Semantic Textual Similarity, Text Summarization, Word Alignment, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1096/
PDF https://www.aclweb.org/anthology/S16-1096
PWC https://paperswithcode.com/paper/wolvesaar-at-semeval-2016-task-1-replicating
Repo
Framework

SimiHawk at SemEval-2016 Task 1: A Deep Ensemble System for Semantic Textual Similarity

Title SimiHawk at SemEval-2016 Task 1: A Deep Ensemble System for Semantic Textual Similarity
Authors Peter Potash, William Boag, Alexey Romanov, Vasili Ramanishka, Anna Rumshisky
Abstract
Tasks Machine Translation, Natural Language Inference, Semantic Textual Similarity, Word Alignment, Word Embeddings
Published 2016-06-01
URL https://www.aclweb.org/anthology/S16-1115/
PDF https://www.aclweb.org/anthology/S16-1115
PWC https://paperswithcode.com/paper/simihawk-at-semeval-2016-task-1-a-deep
Repo
Framework

Korean FrameNet Expansion Based on Projection of Japanese FrameNet

Title Korean FrameNet Expansion Based on Projection of Japanese FrameNet
Authors Jeong-uk Kim, Younggyun Hahm, Key-Sun Choi
Abstract FrameNet project has begun from Berkeley in 1997, and is now supported in several countries reflecting characteristics of each language. The work for generating Korean FrameNet was already done by converting annotated English sentences into Korean with trained translators. However, high cost of frame-preservation and error revision was a huge burden on further expansion of FrameNet. This study makes use of linguistic similarity between Japanese and Korean to increase Korean FrameNet corpus with low cost. We also suggest adapting PubAnnotation and Korean-friendly valence patterns to FrameNet for increased accessibility.
Tasks Machine Translation, Reading Comprehension, Semantic Role Labeling
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2037/
PDF https://www.aclweb.org/anthology/C16-2037
PWC https://paperswithcode.com/paper/korean-framenet-expansion-based-on-projection
Repo
Framework

Improved Word Embeddings with Implicit Structure Information

Title Improved Word Embeddings with Implicit Structure Information
Authors Jie Shen, Cong Liu
Abstract Distributed word representation is an efficient method for capturing semantic and syntactic word relations. In this work, we introduce an extension to the continuous bag-of-words model for learning word representations efficiently by using implicit structure information. Instead of relying on a syntactic parser which might be noisy and slow to build, we compute weights representing probabilities of syntactic relations based on the Huffman softmax tree in an efficient heuristic. The constructed {``}implicit graphs{''} from these weights show that these weights contain useful implicit structure information. Extensive experiments performed on several word similarity and word analogy tasks show gains compared to the basic continuous bag-of-words model. |
Tasks Dependency Parsing, Language Modelling, Learning Word Embeddings, Machine Translation, Part-Of-Speech Tagging, Sentiment Analysis, Word Embeddings
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1227/
PDF https://www.aclweb.org/anthology/C16-1227
PWC https://paperswithcode.com/paper/improved-word-embeddings-with-implicit
Repo
Framework

Examples are not enough, learn to criticize! Criticism for Interpretability

Title Examples are not enough, learn to criticize! Criticism for Interpretability
Authors Been Kim, Rajiv Khanna, Oluwasanmi O. Koyejo
Abstract Example-based explanations are widely used in the effort to improve the interpretability of highly complex distributions. However, prototypes alone are rarely sufficient to represent the gist of the complexity. In order for users to construct better mental models and understand complex data distributions, we also need {\em criticism} to explain what are \textit{not} captured by prototypes. Motivated by the Bayesian model criticism framework, we develop \texttt{MMD-critic} which efficiently learns prototypes and criticism, designed to aid human interpretability. A human subject pilot study shows that the \texttt{MMD-critic} selects prototypes and criticism that are useful to facilitate human understanding and reasoning. We also evaluate the prototypes selected by \texttt{MMD-critic} via a nearest prototype classifier, showing competitive performance compared to baselines.
Tasks
Published 2016-12-01
URL http://papers.nips.cc/paper/6300-examples-are-not-enough-learn-to-criticize-criticism-for-interpretability
PDF http://papers.nips.cc/paper/6300-examples-are-not-enough-learn-to-criticize-criticism-for-interpretability.pdf
PWC https://paperswithcode.com/paper/examples-are-not-enough-learn-to-criticize
Repo
Framework

Arabizi Identification in Twitter Data

Title Arabizi Identification in Twitter Data
Authors Taha Tobaili
Abstract
Tasks Sentiment Analysis
Published 2016-08-01
URL https://www.aclweb.org/anthology/P16-3008/
PDF https://www.aclweb.org/anthology/P16-3008
PWC https://paperswithcode.com/paper/arabizi-identification-in-twitter-data
Repo
Framework

PAT workbench: Annotation and Evaluation of Text and Pictures in Multimodal Instructions

Title PAT workbench: Annotation and Evaluation of Text and Pictures in Multimodal Instructions
Authors Ielka van der Sluis, Lennart Kloppenburg, Gisela Redeker
Abstract This paper presents a tool to investigate the design of multimodal instructions (MIs), i.e., instructions that contain both text and pictures. The benefit of including pictures in information presentation has been established, but the characteristics of those pictures and of their textual counterparts and the rela-tion(s) between them have not been researched in a systematic manner. We present the PAT Work-bench, a tool to store, annotate and retrieve MIs based on a validated coding scheme with currently 42 categories that describe instructions in terms of textual features, pictorial elements, and relations be-tween text and pictures. We describe how the PAT Workbench facilitates collaborative annotation and inter-annotator agreement calculation. Future work on the tool includes expanding its functionality and usability by (i) making the MI annotation scheme dynamic for adding relevant features based on empirical evaluations of the MIs, (ii) implementing algorithms for automatic tagging of MI features, and (iii) implementing automatic MI evaluation algorithms based on results obtained via e.g. crowdsourced assessments of MIs.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-4018/
PDF https://www.aclweb.org/anthology/W16-4018
PWC https://paperswithcode.com/paper/pat-workbench-annotation-and-evaluation-of
Repo
Framework

Dictionaries as Networks: Identifying the graph structure of Ogden’s Basic English

Title Dictionaries as Networks: Identifying the graph structure of Ogden’s Basic English
Authors Camilo Garrido, Claudio Gutierrez
Abstract We study the network structure underlying dictionaries. We systematize the properties of such networks and show their relevance for linguistics. As case of study, we apply this technique to identify the graph structure of Ogden{'}s Basic English. We show that it constitutes a strong core of the English language network and that classic centrality measures fail to capture this set of words.
Tasks
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-1336/
PDF https://www.aclweb.org/anthology/C16-1336
PWC https://paperswithcode.com/paper/dictionaries-as-networks-identifying-the
Repo
Framework

Poet Admits // Mute Cypher: Beam Search to find Mutually Enciphering Poetic Texts

Title Poet Admits // Mute Cypher: Beam Search to find Mutually Enciphering Poetic Texts
Authors Cole Peterson, Alona Fyshe
Abstract
Tasks
Published 2016-11-01
URL https://www.aclweb.org/anthology/D16-1141/
PDF https://www.aclweb.org/anthology/D16-1141
PWC https://paperswithcode.com/paper/poet-admits-mute-cypher-beam-search-to-find
Repo
Framework

Shallow Discourse Parsing Using Convolutional Neural Network

Title Shallow Discourse Parsing Using Convolutional Neural Network
Authors Lianhui Qin, Zhisong Zhang, Hai Zhao
Abstract
Tasks Feature Engineering, Feature Selection, Language Modelling
Published 2016-08-01
URL https://www.aclweb.org/anthology/K16-2010/
PDF https://www.aclweb.org/anthology/K16-2010
PWC https://paperswithcode.com/paper/shallow-discourse-parsing-using-convolutional
Repo
Framework

What’s up on Twitter? Catch up with TWIST!

Title What’s up on Twitter? Catch up with TWIST!
Authors Marina Litvak, Natalia Vanetik, Efi Levi, Michael Roistacher
Abstract Event detection and analysis with respect to public opinions and sentiments in social media is a broad and well-addressed research topic. However, the characteristics and sheer volume of noisy Twitter messages make this a difficult task. This demonstration paper describes a TWItter event Summarizer and Trend detector (TWIST) system for event detection, visualization, textual description, and geo-sentiment analysis of real-life events reported in Twitter.
Tasks Sentiment Analysis
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2045/
PDF https://www.aclweb.org/anthology/C16-2045
PWC https://paperswithcode.com/paper/whatas-up-on-twitter-catch-up-with-twist
Repo
Framework

Twitter Named Entity Extraction and Linking Using Differential Evolution

Title Twitter Named Entity Extraction and Linking Using Differential Evolution
Authors Utpal Kumar Sikdar, Bj{"o}rn Gamb{"a}ck
Abstract
Tasks Entity Extraction, Entity Linking, Machine Translation, Question Answering
Published 2016-12-01
URL https://www.aclweb.org/anthology/W16-6326/
PDF https://www.aclweb.org/anthology/W16-6326
PWC https://paperswithcode.com/paper/twitter-named-entity-extraction-and-linking
Repo
Framework

WISDOM X, DISAANA and D-SUMM: Large-scale NLP Systems for Analyzing Textual Big Data

Title WISDOM X, DISAANA and D-SUMM: Large-scale NLP Systems for Analyzing Textual Big Data
Authors Junta Mizuno, Masahiro Tanaka, Kiyonori Ohtake, Jong-Hoon Oh, Julien Kloetzer, Chikara Hashimoto, Kentaro Torisawa
Abstract We demonstrate our large-scale NLP systems: WISDOM X, DISAANA, and D-SUMM. WISDOM X provides numerous possible answers including unpredictable ones to widely diverse natural language questions to provide deep insights about a broad range of issues. DISAANA and D-SUMM enable us to assess the damage caused by large-scale disasters in real time using Twitter as an information source.
Tasks Open-Domain Question Answering, Question Answering
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2055/
PDF https://www.aclweb.org/anthology/C16-2055
PWC https://paperswithcode.com/paper/wisdom-x-disaana-and-d-summ-large-scale-nlp
Repo
Framework

Illinois Cross-Lingual Wikifier: Grounding Entities in Many Languages to the English Wikipedia

Title Illinois Cross-Lingual Wikifier: Grounding Entities in Many Languages to the English Wikipedia
Authors Chen-Tse Tsai, Dan Roth
Abstract We release a cross-lingual wikification system for all languages in Wikipedia. Given a piece of text in any supported language, the system identifies names of people, locations, organizations, and grounds these names to the corresponding English Wikipedia entries. The system is based on two components: a cross-lingual named entity recognition (NER) model and a cross-lingual mention grounding model. The cross-lingual NER model is a language-independent model which can extract named entity mentions in the text of any language in Wikipedia. The extracted mentions are then grounded to the English Wikipedia using the cross-lingual mention grounding model. The only resources required to train the proposed system are the multilingual Wikipedia dump and existing training data for English NER. The system is online at \url{http://cogcomp.cs.illinois.edu/page/demo_view/xl_wikifier}
Tasks Entity Linking, Named Entity Recognition, Tokenization
Published 2016-12-01
URL https://www.aclweb.org/anthology/C16-2031/
PDF https://www.aclweb.org/anthology/C16-2031
PWC https://paperswithcode.com/paper/illinois-cross-lingual-wikifier-grounding
Repo
Framework

Deep Learning of Audio and Language Features for Humor Prediction

Title Deep Learning of Audio and Language Features for Humor Prediction
Authors Dario Bertero, Pascale Fung
Abstract We propose a comparison between various supervised machine learning methods to predict and detect humor in dialogues. We retrieve our humorous dialogues from a very popular TV sitcom: {``}The Big Bang Theory{''}. We build a corpus where punchlines are annotated using the canned laughter embedded in the audio track. Our comparative study involves a linear-chain Conditional Random Field over a Recurrent Neural Network and a Convolutional Neural Network. Using a combination of word-level and audio frame-level features, the CNN outperforms the other methods, obtaining the best F-score of 68.5{%} over 66.5{%} by CRF and 52.9{%} by RNN. Our work is a starting point to developing more effective machine learning and neural network models on the humor prediction task, as well as developing machines capable in understanding humor in general. |
Tasks
Published 2016-05-01
URL https://www.aclweb.org/anthology/L16-1079/
PDF https://www.aclweb.org/anthology/L16-1079
PWC https://paperswithcode.com/paper/deep-learning-of-audio-and-language-features
Repo
Framework
comments powered by Disqus