May 4, 2019

1704 words 8 mins read

Paper Group NANR 178

When Hyperparameters Help: Beneficial Parameter Combinations in Distributional Semantic Models. Using TEI for textbook research. DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents. Morphological reinflection with conditional random fields and unsupervised features. Deconvolving Feedback Loops in Recommender …

When Hyperparameters Help: Beneficial Parameter Combinations in Distributional Semantic Models


Title	When Hyperparameters Help: Beneficial Parameter Combinations in Distributional Semantic Models
Authors	Alicia Krebs, Denis Paperno
Abstract
Tasks	Hyperparameter Optimization, Word Embeddings
Published	2016-08-01
URL	https://www.aclweb.org/anthology/papers/S16-2011/s16-2011
PDF	https://www.aclweb.org/anthology/S16-2011v2
PWC	https://paperswithcode.com/paper/when-hyperparameters-help-beneficial
Repo
Framework

Using TEI for textbook research


Title	Using TEI for textbook research
Authors	Lena-Luise Stahn, Steffen Hennicke, Ernesto William De Luca
Abstract	The following paper describes the first steps in the development of an ontology for the textbook research discipline. The aim of the project WorldViews is to establish a digital edition focussing on views of the world depicted in textbooks. For this purpose an initial TEI profile has been formalised and tested as a use case to enable the semantical encoding of the resource {`}textbook{'}. This profile shall provide a basic data model describing major facets of the textbook{'}s structure relevant to historians. \|
Tasks	Information Retrieval
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-4024/
PDF	https://www.aclweb.org/anthology/W16-4024
PWC	https://paperswithcode.com/paper/using-tei-for-textbook-research
Repo
Framework

DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents


Title	DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents
Authors	Zhao Yan, Nan Duan, Junwei Bao, Peng Chen, Ming Zhou, Zhoujun Li, Jianshe Zhou
Abstract
Tasks	Chatbot, Community Question Answering, Information Retrieval, Learning-To-Rank, Machine Translation, Question Answering, Short-Text Conversation, Text Generation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1049/
PDF	https://www.aclweb.org/anthology/P16-1049
PWC	https://paperswithcode.com/paper/docchat-an-information-retrieval-approach-for
Repo
Framework

Morphological reinflection with conditional random fields and unsupervised features


Title	Morphological reinflection with conditional random fields and unsupervised features
Authors	Ling Liu, Lingshuang Jack Mao
Abstract
Tasks
Published	2016-08-01
URL	https://www.aclweb.org/anthology/W16-2006/
PDF	https://www.aclweb.org/anthology/W16-2006
PWC	https://paperswithcode.com/paper/morphological-reinflection-with-conditional
Repo
Framework

Deconvolving Feedback Loops in Recommender Systems


Title	Deconvolving Feedback Loops in Recommender Systems
Authors	Ayan Sinha, David F. Gleich, Karthik Ramani
Abstract	Collaborative filtering is a popular technique to infer users’ preferences on new content based on the collective information of all users preferences. Recommender systems then use this information to make personalized suggestions to users. When users accept these recommendations it creates a feedback loop in the recommender system, and these loops iteratively influence the collaborative filtering algorithm’s predictions over time. We investigate whether it is possible to identify items affected by these feedback loops. We state sufficient assumptions to deconvolve the feedback loops while keeping the inverse solution tractable. We furthermore develop a metric to unravel the recommender system’s influence on the entire user-item rating matrix. We use this metric on synthetic and real-world datasets to (1) identify the extent to which the recommender system affects the final rating matrix, (2) rank frequently recommended items, and (3) distinguish whether a user’s rated item was recommended or an intrinsic preference. Our results indicate that it is possible to recover the ratings matrix of intrinsic user preferences using a single snapshot of the ratings matrix without any temporal information.
Tasks	Recommendation Systems
Published	2016-12-01
URL	http://papers.nips.cc/paper/6283-deconvolving-feedback-loops-in-recommender-systems
PDF	http://papers.nips.cc/paper/6283-deconvolving-feedback-loops-in-recommender-systems.pdf
PWC	https://paperswithcode.com/paper/deconvolving-feedback-loops-in-recommender
Repo
Framework

Automatic Translation of English Text to Indian Sign Language Synthetic Animations


Title	Automatic Translation of English Text to Indian Sign Language Synthetic Animations
Authors	Lalit Goyal, Vishal Goyal
Abstract
Tasks	Lemmatization
Published	2016-12-01
URL	https://www.aclweb.org/anthology/W16-6319/
PDF	https://www.aclweb.org/anthology/W16-6319
PWC	https://paperswithcode.com/paper/automatic-translation-of-english-text-to
Repo
Framework

Event Nugget and Event Coreference Annotation


Title	Event Nugget and Event Coreference Annotation
Authors	Zhiyi Song, Ann Bies, Stephanie Strassel, Joe Ellis, Teruko Mitamura, Hoa Trang Dang, Yukari Yamakawa, Sue Holm
Abstract
Tasks	Knowledge Base Population
Published	2016-06-01
URL	https://www.aclweb.org/anthology/W16-1005/
PDF	https://www.aclweb.org/anthology/W16-1005
PWC	https://paperswithcode.com/paper/event-nugget-and-event-coreference-annotation
Repo
Framework

UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing


Title	UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing
Authors	Milan Straka, Jan Haji{\v{c}}, Jana Strakov{'a}
Abstract	Automatic natural language processing of large texts often presents recurring challenges in multiple languages: even for most advanced tasks, the texts are first processed by basic processing steps {–} from tokenization to parsing. We present an extremely simple-to-use tool consisting of one binary and one model (per language), which performs these tasks for multiple languages without the need for any other external data. UDPipe, a pipeline processing CoNLL-U-formatted files, performs tokenization, morphological analysis, part-of-speech tagging, lemmatization and dependency parsing for nearly all treebanks of Universal Dependencies 1.2 (namely, the whole pipeline is currently available for 32 out of 37 treebanks). In addition, the pipeline is easily trainable with training data in CoNLL-U format (and in some cases also with additional raw corpora) and requires minimal linguistic knowledge on the users{'} part. The training code is also released.
Tasks	Dependency Parsing, Lemmatization, Morphological Analysis, Part-Of-Speech Tagging, Tokenization
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1680/
PDF	https://www.aclweb.org/anthology/L16-1680
PWC	https://paperswithcode.com/paper/udpipe-trainable-pipeline-for-processing
Repo
Framework

The George Washington University System for the Code-Switching Workshop Shared Task 2016


Title	The George Washington University System for the Code-Switching Workshop Shared Task 2016
Authors	Mohamed Al-Badrashiny, Mona Diab
Abstract
Tasks
Published	2016-11-01
URL	https://www.aclweb.org/anthology/W16-5813/
PDF	https://www.aclweb.org/anthology/W16-5813
PWC	https://paperswithcode.com/paper/the-george-washington-university-system-for
Repo
Framework

DeQue: A Lexicon of Complex Prepositions and Conjunctions in French


Title	DeQue: A Lexicon of Complex Prepositions and Conjunctions in French
Authors	Carlos Ramisch, Alexis Nasr, Andr{'e} Valli, Jos{'e} Deulofeu
Abstract	We introduce DeQue, a lexicon covering French complex prepositions (CPRE) like {}{\`a} partir de{''} (from) and complex conjunctions (CCONJ) like {}bien que{''} (although). The lexicon includes fine-grained linguistic description based on empirical evidence. We describe the general characteristics of CPRE and CCONJ in French, with special focus on syntactic ambiguity. Then, we list the selection criteria used to build the lexicon and the corpus-based methodology employed to collect entries. Finally, we quantify the ambiguity of each construction by annotating around 100 sentences randomly taken from the FRWaC. In addition to its theoretical value, the resource has many potential practical applications. We intend to employ DeQue for treebank annotation and to train a dependency parser that can takes complex constructions into account.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1363/
PDF	https://www.aclweb.org/anthology/L16-1363
PWC	https://paperswithcode.com/paper/deque-a-lexicon-of-complex-prepositions-and
Repo
Framework

IMS HotCoref DE: A Data-driven Co-reference Resolver for German


Title	IMS HotCoref DE: A Data-driven Co-reference Resolver for German
Authors	Ina Roesiger, Jonas Kuhn
Abstract	This paper presents a data-driven co-reference resolution system for German that has been adapted from IMS HotCoref, a co-reference resolver for English. It describes the difficulties when resolving co-reference in German text, the adaptation process and the features designed to address linguistic challenges brought forth by German. We report performance on the reference dataset T{"u}Ba-D/Z and include a post-task SemEval 2010 evaluation, showing that the resolver achieves state-of-the-art performance. We also include ablation experiments that indicate that integrating linguistic features increases results. The paper also describes the steps and the format necessary to use the resolver on new texts. The tool is freely available for download.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1024/
PDF	https://www.aclweb.org/anthology/L16-1024
PWC	https://paperswithcode.com/paper/ims-hotcoref-de-a-data-driven-co-reference
Repo
Framework

Language Resource Citation: the ISLRN Dissemination and Further Developments


Title	Language Resource Citation: the ISLRN Dissemination and Further Developments
Authors	Val{'e}rie Mapelli, Vladimir Popescu, Lin Liu, Khalid Choukri
Abstract	This article presents the latest dissemination activities and technical developments that were carried out for the International Standard Language Resource Number (ISLRN) service. It also recalls the main principle and submission process for providers to obtain their 13-digit ISLRN identifier. Up to March 2016, 2100 Language Resources were allocated an ISLRN number, not only ELRA{'}s and LDC{'}s catalogued Language Resources, but also the ones from other important organisations like the Joint Research Centre (JRC) and the Resource Management Agency (RMA) who expressed their strong support to this initiative.In the research field, not only assigning a unique identification number is important, but also referring to a Language Resource as an object \textit{per se} (like publications) has now become an obvious requirement. The ISLRN could also become an important parameter to be considered to compute a Language Resource Impact Factor (LRIF) in order to recognize the merits of the producers of Language Resources. Integrating the ISLRN number into a LR-oriented bibliographical reference is thus part of the objective. The idea is to make use of a BibTeX entry that would take into account Language Resources items, including ISLRN.The ISLRN being a requested field within the LREC 2016 submission, we expect that several other LRs will be allocated an ISLRN number by the conference date. With this expansion, this number aims to be a spreadly-used LR citation instrument within works referring to LRs.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1254/
PDF	https://www.aclweb.org/anthology/L16-1254
PWC	https://paperswithcode.com/paper/language-resource-citation-the-islrn
Repo
Framework

Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene


Title	Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene
Authors	Izaskun Etxeberria, I{~n}aki Alegria, Larraitz Uria, Mans Hulden
Abstract	This paper presents a method for the normalization of historical texts using a combination of weighted finite-state transducers and language models. We have extended our previous work on the normalization of dialectal texts and tested the method against a 17th century literary work in Basque. This preprocessed corpus is made available in the LREC repository. The performance of this method for learning relations between historical and contemporary word forms is evaluated against resources in three languages. The method we present learns to map phonological changes using a noisy channel model. The model is based on techniques commonly used for phonological inference and producing Grapheme-to-Grapheme conversion systems encoded as weighted transducers and produces F-scores above 80{%} in the task for Basque. A wider evaluation shows that the approach performs equally well with all the languages in our evaluation suite: Basque, Spanish and Slovene. A comparison against other methods that address the same task is also provided.
Tasks
Published	2016-05-01
URL	https://www.aclweb.org/anthology/L16-1169/
PDF	https://www.aclweb.org/anthology/L16-1169
PWC	https://paperswithcode.com/paper/evaluating-the-noisy-channel-model-for-the
Repo
Framework

Continuous Profile Models in ASL Syntactic Facial Expression Synthesis


Title	Continuous Profile Models in ASL Syntactic Facial Expression Synthesis
Authors	Hernisa Kacorri, Matt Huenerfauth
Abstract
Tasks	Machine Translation
Published	2016-08-01
URL	https://www.aclweb.org/anthology/P16-1196/
PDF	https://www.aclweb.org/anthology/P16-1196
PWC	https://paperswithcode.com/paper/continuous-profile-models-in-asl-syntactic
Repo
Framework

A Corpus and Semantic Parser for Multilingual Natural Language Querying of OpenStreetMap


Title	A Corpus and Semantic Parser for Multilingual Natural Language Querying of OpenStreetMap
Authors	Carolin Haas, Stefan Riezler
Abstract
Tasks	Machine Translation, Semantic Parsing
Published	2016-06-01
URL	https://www.aclweb.org/anthology/N16-1088/
PDF	https://www.aclweb.org/anthology/N16-1088
PWC	https://paperswithcode.com/paper/a-corpus-and-semantic-parser-for-multilingual
Repo
Framework