May 5, 2019

1073 words 6 mins read

Paper Group NANR 73

Supersense tagging with inter-annotator disagreement. Automatically Inferring Implicit Properties in Similes. Fast and Robust POS tagger for Arabic Tweets Using Agreement-based Bootstrapping. Part-of-speech Tagging of Code-Mixed Social Media Text. Using Word Embeddings for Improving Statistical Machine Translation of Phrasal Verbs. Inferring Morpho …

Supersense tagging with inter-annotator disagreement


Title	Supersense tagging with inter-annotator disagreement
Authors	H{'e}ctor Mart{'\i}nez Alonso, Anders Johannsen, Barbara Plank
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-1706/
PDF	https://www.aclweb.org/anthology/W16-1706
PWC	https://paperswithcode.com/paper/supersense-tagging-with-inter-annotator
Repo
Framework

Automatically Inferring Implicit Properties in Similes


Title	Automatically Inferring Implicit Properties in Similes
Authors	Ashequl Qadir, Ellen Riloff, Marilyn A. Walker
Abstract
Tasks	Sentiment Analysis, Word Embeddings
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-1146/
PDF	https://www.aclweb.org/anthology/N16-1146
PWC	https://paperswithcode.com/paper/automatically-inferring-implicit-properties
Repo
Framework

Fast and Robust POS tagger for Arabic Tweets Using Agreement-based Bootstrapping


Title	Fast and Robust POS tagger for Arabic Tweets Using Agreement-based Bootstrapping
Authors	Fahad Albogamy, Allan Ramsay
Abstract	Part-of-Speech(POS) tagging is a key step in many NLP algorithms. However, tweets are difficult to POS tag because they are short, are not always written maintaining formal grammar and proper spelling, and abbreviations are often used to overcome their restricted lengths. Arabic tweets also show a further range of linguistic phenomena such as usage of different dialects, romanised Arabic and borrowing foreign words. In this paper, we present an evaluation and a detailed error analysis of state-of-the-art POS taggers for Arabic when applied to Arabic tweets. On the basis of this analysis, we combine normalisation and external knowledge to handle the domain noisiness and exploit bootstrapping to construct extra training data in order to improve POS tagging for Arabic tweets. Our results show significant improvements over the performance of a number of well-known taggers for Arabic.
Tasks	Part-Of-Speech Tagging
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1238/
PDF	https://www.aclweb.org/anthology/L16-1238
PWC	https://paperswithcode.com/paper/fast-and-robust-pos-tagger-for-arabic-tweets
Repo
Framework


Title	Part-of-speech Tagging of Code-Mixed Social Media Text
Authors	Souvick Ghosh, Satanu Ghosh, Dipankar Das
Abstract
Tasks	Language Identification, Part-Of-Speech Tagging
Published	2016-11-01
URL	https://www.aclweb.org/anthology/W16-5811/
PDF	https://www.aclweb.org/anthology/W16-5811
PWC	https://paperswithcode.com/paper/part-of-speech-tagging-of-code-mixed-social-1
Repo
Framework

Using Word Embeddings for Improving Statistical Machine Translation of Phrasal Verbs


Title	Using Word Embeddings for Improving Statistical Machine Translation of Phrasal Verbs
Authors	Kostadin Cholakov, Valia Kordoni
Abstract
Tasks	Machine Translation, Semantic Textual Similarity, Word Embeddings
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-1808/
PDF	https://www.aclweb.org/anthology/W16-1808
PWC	https://paperswithcode.com/paper/using-word-embeddings-for-improving
Repo
Framework

Inferring Morphotactics from Interlinear Glossed Text: Combining Clustering and Precision Grammars


Title	Inferring Morphotactics from Interlinear Glossed Text: Combining Clustering and Precision Grammars
Authors	Olga Zamaraeva
Abstract
Tasks	Morphological Analysis
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2021/
PDF	https://www.aclweb.org/anthology/W16-2021
PWC	https://paperswithcode.com/paper/inferring-morphotactics-from-interlinear
Repo
Framework

Comparing Translator Acceptability of TM and SMT Outputs


Title	Comparing Translator Acceptability of TM and SMT Outputs
Authors	Joss Moorkens, Andy Way
Abstract
Tasks	Machine Translation
Published	2016-01-01
URL	https://www.aclweb.org/anthology/W16-3404/
PDF	https://www.aclweb.org/anthology/W16-3404
PWC	https://paperswithcode.com/paper/comparing-translator-acceptability-of-tm-and
Repo
Framework

Stand-off Annotation of Web Content as a Legally Safer Alternative to Crawling for Distribution


Title	Stand-off Annotation of Web Content as a Legally Safer Alternative to Crawling for Distribution
Authors	Mikel L. Forcada, Miquel Espl{`a}-Gomis, Juan Antonio P{'e}rez-Ortiz
Abstract
Tasks	Machine Translation
Published	2016-01-01
URL	https://www.aclweb.org/anthology/W16-3405/
PDF	https://www.aclweb.org/anthology/W16-3405
PWC	https://paperswithcode.com/paper/stand-off-annotation-of-web-content-as-a
Repo
Framework

User Embedding for Scholarly Microblog Recommendation


Title	User Embedding for Scholarly Microblog Recommendation
Authors	Yang Yu, Xiaojun Wan, Xinjie Zhou
Abstract
Tasks	Collaborative Ranking
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-2073/
PDF	https://www.aclweb.org/anthology/P16-2073
PWC	https://paperswithcode.com/paper/user-embedding-for-scholarly-microblog
Repo
Framework

A Comparative Study of Minimally Supervised Morphological Segmentation


Title	A Comparative Study of Minimally Supervised Morphological Segmentation
Authors	Teemu Ruokolainen, Oskar Kohonen, Kairit Sirts, Stig-Arne Gr{"o}nroos, Mikko Kurimo, Sami Virpioja
Abstract
Tasks	Boundary Detection
Published	2016-03-01
URL	https://www.aclweb.org/anthology/J16-1003/
PDF	https://www.aclweb.org/anthology/J16-1003
PWC	https://paperswithcode.com/paper/a-comparative-study-of-minimally-supervised
Repo
Framework

Fast and highly parallelizable phrase table for statistical machine translation


Title	Fast and highly parallelizable phrase table for statistical machine translation
Authors	Nikolay Bogoychev, Hieu Hoang
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2211/
PDF	https://www.aclweb.org/anthology/W16-2211
PWC	https://paperswithcode.com/paper/fast-and-highly-parallelizable-phrase-table
Repo
Framework

ParFDA for Instance Selection for Statistical Machine Translation


Title	ParFDA for Instance Selection for Statistical Machine Translation
Authors	Ergun Bi{\c{c}}ici
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2306/
PDF	https://www.aclweb.org/anthology/W16-2306
PWC	https://paperswithcode.com/paper/parfda-for-instance-selection-for-statistical-1
Repo
Framework

A Wizard-of-Oz Study on A Non-Task-Oriented Dialog Systems That Reacts to User Engagement


Title	A Wizard-of-Oz Study on A Non-Task-Oriented Dialog Systems That Reacts to User Engagement
Authors	Zhou Yu, Leah Nicolich-Henkin, Alan W Black, Alex Rudnicky, er
Abstract
Tasks	Machine Translation
Published	2016-09-01
URL	https://www.aclweb.org/anthology/W16-3608/
PDF	https://www.aclweb.org/anthology/W16-3608
PWC	https://paperswithcode.com/paper/a-wizard-of-oz-study-on-a-non-task-oriented
Repo
Framework

What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis


Title	What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis
Authors	Francesco Barbieri, Francesco Ronzano, Horacio Saggion
Abstract	Emojis allow us to describe objects, situations and even feelings with small images, providing a visual and quick way to communicate. In this paper, we analyse emojis used in Twitter with distributional semantic models. We retrieve 10 millions tweets posted by USA users, and we build several skip gram word embedding models by mapping in the same vectorial space both words and emojis. We test our models with semantic similarity experiments, comparing the output of our models with human assessment. We also carry out an exhaustive qualitative evaluation, showing interesting results.
Tasks	Semantic Similarity, Semantic Textual Similarity
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1626/
PDF	https://www.aclweb.org/anthology/L16-1626
PWC	https://paperswithcode.com/paper/what-does-this-emoji-mean-a-vector-space-skip
Repo
Framework

SPLIT: Smart Preprocessing (Quasi) Language Independent Tool


Title	SPLIT: Smart Preprocessing (Quasi) Language Independent Tool
Authors	Mohamed Al-Badrashiny, Arfath Pasha, Mona Diab, Nizar Habash, Owen Rambow, Wael Salloum, Esk, Ramy er
Abstract	Text preprocessing is an important and necessary task for all NLP applications. A simple variation in any preprocessing step may drastically affect the final results. Moreover replicability and comparability, as much as feasible, is one of the goals of our scientific enterprise, thus building systems that can ensure the consistency in our various pipelines would contribute significantly to our goals. The problem has become quite pronounced with the abundance of NLP tools becoming more and more available yet with different levels of specifications. In this paper, we present a dynamic unified preprocessing framework and tool, SPLIT, that is highly configurable based on user requirements which serves as a preprocessing tool for several tools at once. SPLIT aims to standardize the implementations of the most important preprocessing steps by allowing for a unified API that could be exchanged across different researchers to ensure complete transparency in replication. The user is able to select the required preprocessing tasks among a long list of preprocessing steps. The user is also able to specify the order of execution which in turn affects the final preprocessing output.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1640/
PDF	https://www.aclweb.org/anthology/L16-1640
PWC	https://paperswithcode.com/paper/split-smart-preprocessing-quasi-language
Repo
Framework