Skip to main content

Showing 1–50 of 58 results for author: Stein, B

  1. arXiv:2405.07920  [pdf, other

    cs.IR

    A Systematic Investigation of Distilling Large Language Models into Cross-Encoders for Passage Re-ranking

    Authors: Ferdinand Schlatt, Maik Fröbe, Harrisen Scells, Shengyao Zhuang, Bevan Koopman, Guido Zuccon, Benno Stein, Martin Potthast, Matthias Hagen

    Abstract: Cross-encoders distilled from large language models (LLMs) are often more effective re-rankers than cross-encoders fine-tuned on manually labeled data. However, the distilled models usually do not reach their teacher LLM's effectiveness. To investigate whether best practices for fine-tuning cross-encoders on manually labeled data (e.g., hard-negative sampling, deep sampling, and listwise loss func… ▽ More

    Submitted 16 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2404.09696  [pdf, other

    cs.CL cs.AI cs.ET

    Are Large Language Models Reliable Argument Quality Annotators?

    Authors: Nailia Mirzakhmedova, Marcel Gohsen, Chia Hao Chang, Benno Stein

    Abstract: Evaluating the quality of arguments is a crucial aspect of any system leveraging argument mining. However, it is a challenge to obtain reliable and consistent annotations regarding argument quality, as this usually requires domain-specific expertise of the annotators. Even among experts, the assessment of argument quality is often inconsistent due to the inherent subjectivity of this task. In this… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 18 pages, 5 figures, 5 tables

  3. arXiv:2404.09615  [pdf, other

    cs.CL cs.CY

    If there's a Trigger Warning, then where's the Trigger? Investigating Trigger Warnings at the Passage Level

    Authors: Matti Wiegmann, Jennifer Rakete, Magdalena Wolska, Benno Stein, Martin Potthast

    Abstract: Trigger warnings are labels that preface documents with sensitive content if this content could be perceived as harmful by certain groups of readers. Since warnings about a document intuitively need to be shown before reading it, authors usually assign trigger warnings at the document level. What parts of their writing prompted them to assign a warning, however, remains unclear. We investigate for… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2404.06912  [pdf, other

    cs.IR

    Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders

    Authors: Ferdinand Schlatt, Maik Fröbe, Harrisen Scells, Shengyao Zhuang, Bevan Koopman, Guido Zuccon, Benno Stein, Martin Potthast, Matthias Hagen

    Abstract: Existing cross-encoder re-rankers can be categorized as pointwise, pairwise, or listwise models. Pair- and listwise models allow passage interactions, which usually makes them more effective than pointwise models but also less efficient and less robust to input order permutations. To enable efficient permutation-invariant passage interactions during re-ranking, we propose a new cross-encoder archi… ▽ More

    Submitted 16 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  5. arXiv:2403.17564  [pdf, other

    cs.CL

    Task-Oriented Paraphrase Analytics

    Authors: Marcel Gohsen, Matthias Hagen, Martin Potthast, Benno Stein

    Abstract: Since paraphrasing is an ill-defined task, the term "paraphrasing" covers text transformation tasks with different characteristics. Consequently, existing paraphrasing studies have applied quite different (explicit and implicit) criteria as to when a pair of texts is to be considered a paraphrase, all of which amount to postulating a certain level of semantic or lexical similarity. In this paper,… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  6. Detecting Generated Native Ads in Conversational Search

    Authors: Sebastian Schmidt, Ines Zelch, Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast

    Abstract: Conversational search engines such as YouChat and Microsoft Copilot use large language models (LLMs) to generate responses to queries. It is only a small step to also let the same technology insert ads within the generated responses - instead of separately placing ads next to a response. Inserted ads would be reminiscent of native advertising and product placement, both of which are very effective… ▽ More

    Submitted 30 April, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: WWW'24 Short Papers Track; 4 pages

  7. Assisted Knowledge Graph Authoring: Human-Supervised Knowledge Graph Construction from Natural Language

    Authors: Marcel Gohsen, Benno Stein

    Abstract: Encyclopedic knowledge graphs, such as Wikidata, host an extensive repository of millions of knowledge statements. However, domain-specific knowledge from fields such as history, physics, or medicine is significantly underrepresented in those graphs. Although few domain-specific knowledge graphs exist (e.g., Pubmed for medicine), developing specialized retrieval applications for many domains still… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: accepted at CHIIR 2024

  8. arXiv:2401.00366  [pdf

    cs.CL

    Argumentation in Waltz's "Emerging Structure of International Politics''

    Authors: Magdalena Wolska, Bernd Fröhlich, Katrin Girgensohn, Sassan Gholiagha, Dora Kiesel, Jürgen Neyer, Patrick Riehmann, Mitja Sienknecht, Benno Stein

    Abstract: We present an annotation scheme for argumentative and domain-specific aspects of scholarly articles on the theory of International Relations. At argumentation level we identify Claims and Support/Attack relations. At domain level we model discourse content in terms of Theory and Data-related statements. We annotate Waltz's 1993 text on structural realism and show that our scheme can be reliably ap… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: 9 pages

  9. Evaluating Generative Ad Hoc Information Retrieval

    Authors: Lukas Gienapp, Harrisen Scells, Niklas Deckers, Janek Bevendorff, Shuai Wang, Johannes Kiesel, Shahbaz Syed, Maik Fröbe, Guido Zuccon, Benno Stein, Matthias Hagen, Martin Potthast

    Abstract: Recent advances in large language models have enabled the development of viable generative retrieval systems. Instead of a traditional document ranking, generative retrieval systems often directly return a grounded generated text as a response to a query. Quantifying the utility of the textual responses is essential for appropriately evaluating such generative ad hoc retrieval. Yet, the establishe… ▽ More

    Submitted 22 May, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 14 pages, 6 figures, 1 table. Published at SIGIR'24 perspective paper track

  10. arXiv:2309.09742  [pdf, other

    cs.CV

    Drawing the Same Bounding Box Twice? Coping Noisy Annotations in Object Detection with Repeated Labels

    Authors: David Tschirschwitz, Christian Benz, Morris Florek, Henrik Norderhus, Benno Stein, Volker Rodehorst

    Abstract: The reliability of supervised machine learning systems depends on the accuracy and availability of ground truth labels. However, the process of human annotation, being prone to error, introduces the potential for noisy labels, which can impede the practicality of these systems. While training with noisy labels is a significant consideration, the reliability of test data is also crucial to ascertai… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  11. The Information Retrieval Experiment Platform

    Authors: Maik Fröbe, Jan Heinrich Reimer, Sean MacAvaney, Niklas Deckers, Simon Reich, Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast

    Abstract: We integrate ir_datasets, ir_measures, and PyTerrier with TIRA in the Information Retrieval Experiment Platform (TIREx) to promote more standardized, reproducible, scalable, and even blinded retrieval experiments. Standardization is achieved when a retrieval approach implements PyTerrier's interfaces and the input and output of an experiment are compatible with ir_datasets and ir_measures. However… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 11 pages. To be published in the proceedings of SIGIR 2023

  12. arXiv:2305.15629  [pdf, other

    cs.LG cs.AI

    Patient Outcome Predictions Improve Operations at a Large Hospital Network

    Authors: Liangyuan Na, Kimberly Villalobos Carballo, Jean Pauphilet, Ali Haddad-Sisakht, Daniel Kombert, Melissa Boisjoli-Langlois, Andrew Castiglione, Maram Khalifa, Pooja Hebbal, Barry Stein, Dimitris Bertsimas

    Abstract: Problem definition: Access to accurate predictions of patients' outcomes can enhance medical staff's decision-making, which ultimately benefits all stakeholders in the hospitals. A large hospital network in the US has been collaborating with academics and consultants to predict short-term and long-term outcomes for all inpatients across their seven hospitals. Methodology/results: We develop machin… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 41 pages, 13 figures

  13. Perspectives on Large Language Models for Relevance Judgment

    Authors: Guglielmo Faggioli, Laura Dietz, Charles Clarke, Gianluca Demartini, Matthias Hagen, Claudia Hauff, Noriko Kando, Evangelos Kanoulas, Martin Potthast, Benno Stein, Henning Wachsmuth

    Abstract: When asked, large language models (LLMs) like ChatGPT claim that they can assist with relevance judgments but it is not clear whether automated judgments can reliably be used in evaluations of retrieval systems. In this perspectives paper, we discuss possible ways for LLMs to support relevance judgments along with concerns and issues that arise. We devise a human--machine collaboration spectrum th… ▽ More

    Submitted 18 November, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    ACM Class: H.3.3

  14. arXiv:2304.01869  [pdf, other

    cs.NE cs.AI

    Deep-BIAS: Detecting Structural Bias using Explainable AI

    Authors: Bas van Stein, Diederick Vermetten, Fabio Caraffini, Anna V. Kononova

    Abstract: Evaluating the performance of heuristic optimisation algorithms is essential to determine how well they perform under various conditions. Recently, the BIAS toolbox was introduced as a behaviour benchmark to detect structural bias (SB) in search algorithms. The toolbox can be used to identify biases in existing algorithms, as well as to test for bias in newly developed algorithms. In this article,… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 9 pages

  15. arXiv:2304.01219  [pdf, other

    math.OC cs.AI cs.LG cs.NE

    DoE2Vec: Deep-learning Based Features for Exploratory Landscape Analysis

    Authors: Bas van Stein, Fu Xing Long, Moritz Frenzel, Peter Krause, Markus Gitterle, Thomas Bäck

    Abstract: We propose DoE2Vec, a variational autoencoder (VAE)-based methodology to learn optimization landscape characteristics for downstream meta-learning tasks, e.g., automated selection of optimization algorithms. Principally, using large training data sets generated with a random function generator, DoE2Vec self-learns an informative latent representation for any design of experiments (DoE). Unlike the… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  16. The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives

    Authors: Jan Heinrich Reimer, Sebastian Schmidt, Maik Fröbe, Lukas Gienapp, Harrisen Scells, Benno Stein, Matthias Hagen, Martin Potthast

    Abstract: The Archive Query Log (AQL) is a previously unused, comprehensive query log collected at the Internet Archive over the last 25 years. Its first version includes 356 million queries, 166 million search result pages, and 1.7 billion search results across 550 search providers. Although many query logs have been studied in the literature, the search providers that own them generally do not publish the… ▽ More

    Submitted 31 July, 2023; v1 submitted 1 April, 2023; originally announced April 2023.

    Comments: SIGIR 2023 resource paper, 13 pages

  17. arXiv:2301.13771  [pdf, other

    cs.CL

    The Touché23-ValueEval Dataset for Identifying Human Values behind Arguments

    Authors: Nailia Mirzakhmedova, Johannes Kiesel, Milad Alshomary, Maximilian Heinrich, Nicolas Handke, Xiaoni Cai, Barriere Valentin, Doratossadat Dastgheib, Omid Ghahroodi, Mohammad Ali Sadraei, Ehsaneddin Asgari, Lea Kawaletz, Henning Wachsmuth, Benno Stein

    Abstract: We present the Touché23-ValueEval Dataset for Identifying Human Values behind Arguments. To investigate approaches for the automated detection of human values behind arguments, we collected 9324 arguments from 6 diverse sources, covering religious texts, political discussions, free-text arguments, newspaper editorials, and online democracy platforms. Each argument was annotated by 3 crowdworkers f… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  18. arXiv:2301.11030  [pdf, other

    cs.CL

    Paraphrase Acquisition from Image Captions

    Authors: Marcel Gohsen, Matthias Hagen, Martin Potthast, Benno Stein

    Abstract: We propose to use image captions from the Web as a previously underutilized resource for paraphrases (i.e., texts with the same "message") and to create and analyze a corresponding dataset. When an image is reused on the Web, an original caption is often assigned. We hypothesize that different captions for the same image naturally form a set of mutual paraphrases. To demonstrate the suitability of… ▽ More

    Submitted 15 February, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  19. arXiv:2301.09759  [pdf, other

    cs.CL

    Topic Ontologies for Arguments

    Authors: Yamen Ajjour, Johannes Kiesel, Benno Stein, Martin Potthast

    Abstract: Many computational argumentation tasks, like stance classification, are topic-dependent: the effectiveness of approaches to these tasks significantly depends on whether the approaches were trained on arguments from the same topics as those they are tested on. So, which are these topics that researchers train approaches on? This paper contributes the first comprehensive survey of topic coverage, as… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  20. arXiv:2212.07476  [pdf, other

    cs.IR cs.CL cs.CV

    The Infinite Index: Information Retrieval on Generative Text-To-Image Models

    Authors: Niklas Deckers, Maik Fröbe, Johannes Kiesel, Gianluca Pandolfo, Christopher Schröder, Benno Stein, Martin Potthast

    Abstract: Conditional generative models such as DALL-E and Stable Diffusion generate images based on a user-defined text, the prompt. Finding and refining prompts that produce a desired image has become the art of prompt engineering. Generative models do not provide a built-in retrieval model for a user's information need expressed through prompts. In light of an extensive literature review, we reframe prom… ▽ More

    Submitted 21 January, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: Final version for CHIIR 2023

  21. arXiv:2212.06438  [pdf, ps, other

    cs.NE

    Multi-surrogate Assisted Efficient Global Optimization for Discrete Problems

    Authors: Qi Huang, Roy de Winter, Bas van Stein, Thomas Bäck, Anna V. Kononova

    Abstract: Decades of progress in simulation-based surrogate-assisted optimization and unprecedented growth in computational power have enabled researchers and practitioners to optimize previously intractable complex engineering problems. This paper investigates the possible benefit of a concurrent utilization of multiple simulation-based surrogate models to solve complex discrete optimization problems. To f… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  22. arXiv:2211.16318  [pdf, other

    cs.NE

    BBOB Instance Analysis: Landscape Properties and Algorithm Performance across Problem Instances

    Authors: Fu Xing Long, Diederick Vermetten, Bas van Stein, Anna V. Kononova

    Abstract: Benchmarking is a key aspect of research into optimization algorithms, and as such the way in which the most popular benchmark suites are designed implicitly guides some parts of algorithm design. One of these suites is the black-box optimization benchmarking (BBOB) suite of 24 single-objective noiseless functions, which has been a standard for over a decade. Within this problem suite, different i… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  23. arXiv:2211.02477  [pdf, other

    cs.CL cs.DL

    SMAuC -- The Scientific Multi-Authorship Corpus

    Authors: Janek Bevendorff, Philipp Sauer, Lukas Gienapp, Wolfgang Kircheis, Erik Körner, Benno Stein, Martin Potthast

    Abstract: The rapidly growing volume of scientific publications offers an interesting challenge for research on methods for analyzing the authorship of documents with one or more authors. However, most existing datasets lack scientific documents or the necessary metadata for constructing new experiments and test cases. We introduce SMAuC, a comprehensive, metadata-rich corpus tailored to scientific authorsh… ▽ More

    Submitted 10 May, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

  24. arXiv:2210.06970  [pdf, other

    cs.CL cs.IR

    Differential Bias: On the Perceptibility of Stance Imbalance in Argumentation

    Authors: Alonso Palomino, Martin Potthast, Khalid Al-Khatib, Benno Stein

    Abstract: Most research on natural language processing treats bias as an absolute concept: Based on a (probably complex) algorithmic analysis, a sentence, an article, or a text is classified as biased or not. Given the fact that for humans the question of whether a text is biased can be difficult to answer or is answered contradictory, we ask whether an "absolute bias classification" is a promising goal at… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted at AACL-IJCNLP 2022, Findings Volume

  25. arXiv:2209.10178  [pdf, other

    cs.LG

    Deep Learning based pipeline for anomaly detection and quality enhancement in industrial binder jetting processes

    Authors: Alexander Zeiser, Bas van Stein, Thomas Bäck

    Abstract: Anomaly detection describes methods of finding abnormal states, instances or data points that differ from a normal value space. Industrial processes are a domain where predicitve models are needed for finding anomalous data instances for quality enhancement. A main challenge, however, is absence of labels in this environment. This paper contributes to a data-centric way of approaching artificial i… ▽ More

    Submitted 23 September, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: Conference paper for: 17. Fachtagung "Entwurf komplexer Automatisierungssysteme (EKA)", Magdeburg/Germany, June 2022

  26. arXiv:2209.04409  [pdf, other

    cs.CL

    Trigger Warnings: Bootstrapping a Violence Detector for FanFiction

    Authors: Magdalena Wolska, Christopher Schröder, Ole Borchardt, Benno Stein, Martin Potthast

    Abstract: We present the first dataset and evaluation results on a newly defined computational task of trigger warning assignment. Labeled corpus data has been compiled from narrative works hosted on Archive of Our Own (AO3), a well-known fanfiction site. In this paper, we focus on the most frequently assigned trigger type--violence--and define a document-level binary classification task of whether or not t… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: 5 pages

  27. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  28. arXiv:2203.13108  [pdf, other

    cs.LG cs.AI cs.NE cs.SC

    Explainable Artificial Intelligence for Exhaust Gas Temperature of Turbofan Engines

    Authors: Marios Kefalas, Juan de Santiago Rojo Jr., Asteris Apostolidis, Dirk van den Herik, Bas van Stein, Thomas Bäck

    Abstract: Data-driven modeling is an imperative tool in various industrial applications, including many applications in the sectors of aeronautics and commercial aviation. These models are in charge of providing key insights, such as which parameters are important on a specific measured outcome or which parameter values we should expect to observe given a set of input parameters. At the same time, however,… ▽ More

    Submitted 25 March, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Main paper: 20 pages, 4 figures. Supplemental material: 18 pages, 30 figures. Published; Removed footnote on page 11 of the main article regarding a typo on formula (8) on the Journal version of this work. The Journal has corrected this typo since, therefore, there is no need for the footnote

  29. arXiv:2201.06594  [pdf, other

    cs.CV cs.LG

    Using Machine Learning to Detect Rotational Symmetries from Reflectional Symmetries in 2D Images

    Authors: Koen Ponse, Anna V. Kononova, Maria Loleyt, Bas van Stein

    Abstract: Automated symmetry detection is still a difficult task in 2021. However, it has applications in computer vision, and it also plays an important part in understanding art. This paper focuses on aiding the latter by comparing different state-of-the-art automated symmetry detection algorithms. For one of such algorithms aimed at reflectional symmetries, we propose post-processing improvements to find… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: 8 pages, 12 figures

  30. arXiv:2112.11800  [pdf, other

    cs.DL cs.CL cs.IR

    STEREO: Scientific Text Reuse in Open Access Publications

    Authors: Lukas Gienapp, Wolfgang Kircheis, Bjarne Sievers, Benno Stein, Martin Potthast

    Abstract: We present the Webis-STEREO-21 dataset, a massive collection of Scientific Text Reuse in Open-access publications. It contains more than 91 million cases of reused text passages found in 4.2 million unique open-access publications. Featuring a high coverage of scientific disciplines and varieties of reuse, as well as comprehensive metadata to contextualize each case, our dataset addresses the most… ▽ More

    Submitted 13 December, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

    Comments: 14 pages, 3 figures, 4 tables

  31. arXiv:2112.03103  [pdf, other

    cs.IR

    FastWARC: Optimizing Large-Scale Web Archive Analytics

    Authors: Janek Bevendorff, Martin Potthast, Benno Stein

    Abstract: Web search and other large-scale web data analytics rely on processing archives of web pages stored in a standardized and efficient format. Since its introduction in 2008, the IIPC's Web ARCive (WARC) format has become the standard format for this purpose. As a list of individually compressed records of HTTP requests and responses, it allows for constant-time random access to all kinds of web data… ▽ More

    Submitted 22 November, 2021; originally announced December 2021.

    Journal ref: OSSYM 2021 - 3rd International Open Search Symposium

  32. arXiv:2111.10864  [pdf, other

    cs.IR

    The Impact of Main Content Extraction on Near-Duplicate Detection

    Authors: Maik Fröbe, Matthias Hagen, Janek Bevendorff, Michael Völske, Benno Stein, Christopher Schröder, Robby Wagner, Lukas Gienapp, Martin Potthast

    Abstract: Commercial web search engines employ near-duplicate detection to ensure that users see each relevant result only once, albeit the underlying web crawls typically include (near-)duplicates of many web pages. We revisit the risks and potential of near-duplicates with an information retrieval focus, motivating that current efforts toward an open and independent European web search infrastructure shou… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

  33. arXiv:2109.04957  [pdf, other

    cs.CL

    Controlled Neural Sentence-Level Reframing of News Articles

    Authors: Wei-Fan Chen, Khalid Al-Khatib, Benno Stein, Henning Wachsmuth

    Abstract: Framing a news article means to portray the reported event from a specific perspective, e.g., from an economic or a health perspective. Reframing means to change this perspective. Depending on the audience or the submessage, reframing can become necessary to achieve the desired effect on the readers. Reframing is related to adapting style and sentiment, which can be tackled with neural text genera… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Journal ref: EMNLP 2021 Findings

  34. arXiv:2107.09615  [pdf

    cs.HC cs.CY

    Readability Research: An Interdisciplinary Approach

    Authors: Sofie Beier, Sam Berlow, Esat Boucaud, Zoya Bylinskii, Tianyuan Cai, Jenae Cohn, Kathy Crowley, Stephanie L. Day, Tilman Dingler, Jonathan Dobres, Jennifer Healey, Rajiv Jain, Marjorie Jordan, Bernard Kerr, Qisheng Li, Dave B. Miller, Susanne Nobles, Alexandra Papoutsaki, Jing Qian, Tina Rezvanian, Shelley Rodrigo, Ben D. Sawyer, Shannon M. Sheppard, Bram Stein, Rick Treitman , et al. (3 additional authors not shown)

    Abstract: Readability is on the cusp of a revolution. Fixed text is becoming fluid as a proliferation of digital reading devices rewrite what a document can do. As past constraints make way for more flexible opportunities, there is great need to understand how reading formats can be tuned to the situation and the individual. We aim to provide a firm foundation for readability research, a comprehensive frame… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: This paper was generated collaboratively over the course of a series of online workshops, the results of which were extensively edited by Dr. Zoya Bylinskii, Dr. Ben D. Sawyer, and Dr. Benjamin Wolfe. Original illustrations by Bernard Kerr. Corresponding Author: Dr. Ben D. Sawyer

  35. arXiv:2107.00893  [pdf, other

    cs.DL cs.NI cs.SI

    Web Archive Analytics

    Authors: Michael Völske, Janek Bevendorff, Johannes Kiesel, Benno Stein, Maik Fröbe, Matthias Hagen, Martin Potthast

    Abstract: Web archive analytics is the exploitation of publicly accessible web pages and their evolution for research purposes -- to the extent organizationally possible for researchers. In order to better understand the complexity of this task, the first part of this paper puts the entirety of the world's captured, created, and replicated data (the "Global Datasphere") in relation to other important data s… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 12 pages, 5 figures. Published in the proceedings of INFORMATIK 2020

    Journal ref: INFORMATIK 2020. Gesellschaft für Informatik, Bonn. (pp. 61-72)

  36. Towards Axiomatic Explanations for Neural Ranking Models

    Authors: Michael Völske, Alexander Bondarenko, Maik Fröbe, Matthias Hagen, Benno Stein, Jaspreet Singh, Avishek Anand

    Abstract: Recently, neural networks have been successfully employed to improve upon state-of-the-art performance in ad-hoc retrieval tasks via machine-learned ranking functions. While neural retrieval models grow in complexity and impact, little is understood about their correspondence with well-studied IR principles. Recent work on interpretability in machine learning has provided tools and techniques to u… ▽ More

    Submitted 11 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 10 pages, 2 figures. Published in the proceedings of ICTIR 2021

  37. Emergence of Structural Bias in Differential Evolution

    Authors: Bas van Stein, Fabio Caraffini, Anna V. Kononova

    Abstract: Heuristic optimisation algorithms are in high demand due to the overwhelming amount of complex optimisation problems that need to be solved. The complexity of these problems is well beyond the boundaries of applicability of exact optimisation algorithms and therefore require modern heuristics to find feasible solutions quickly. These heuristics and their effects are almost always evaluated and exp… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  38. Demanded Abstract Interpretation (Extended Version)

    Authors: Benno Stein, Bor-Yuh Evan Chang, Manu Sridharan

    Abstract: We consider the problem of making expressive static analyzers interactive. Formal static analysis is seeing increasingly widespread adoption as a tool for verification and bug-finding, but even with powerful cloud infrastructure it can take minutes or hours to get batch analysis results after a code change. While existing techniques offer some demand-driven or incremental aspects for certain class… ▽ More

    Submitted 6 April, 2021; v1 submitted 2 April, 2021; originally announced April 2021.

    Comments: extended version of PLDI'21 paper (with appendices)

  39. arXiv:2011.00521  [pdf, other

    cs.LG cs.NE

    Neural Network Design: Learning from Neural Architecture Search

    Authors: Bas van Stein, Hao Wang, Thomas Bäck

    Abstract: Neural Architecture Search (NAS) aims to optimize deep neural networks' architecture for better accuracy or smaller computational cost and has recently gained more research interests. Despite various successful approaches proposed to solve the NAS task, the landscape of it, along with its properties, are rarely investigated. In this paper, we argue for the necessity of studying the landscape prope… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

  40. arXiv:2010.10652  [pdf, other

    cs.CL

    Analyzing Political Bias and Unfairness in News Articles at Different Levels of Granularity

    Authors: Wei-Fan Chen, Khalid Al-Khatib, Henning Wachsmuth, Benno Stein

    Abstract: Media organizations bear great reponsibility because of their considerable influence on shaping beliefs and positions of our society. Any form of media can contain overly biased content, e.g., by reporting on political events in a selective or incomplete manner. A relevant question hence is whether and how such form of imbalanced news coverage can be exposed. The research presented in this paper a… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Journal ref: NLP+CSS 2020

  41. arXiv:2010.10649  [pdf, other

    cs.CL

    Detecting Media Bias in News Articles using Gaussian Bias Distributions

    Authors: Wei-Fan Chen, Khalid Al-Khatib, Benno Stein, Henning Wachsmuth

    Abstract: Media plays an important role in shaping public opinion. Biased media can influence people in undesirable directions and hence should be unmasked as such. We observe that featurebased and neural text classification approaches which rely only on the distribution of low-level lexical information fail to detect media bias. This weakness becomes most noticeable for articles on new events, where words… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Journal ref: EMNLP 2020 Findings

  42. arXiv:2005.14714  [pdf, other

    cs.CL

    The Importance of Suppressing Domain Style in Authorship Analysis

    Authors: Sebastian Bischoff, Niklas Deckers, Marcel Schliebs, Ben Thies, Matthias Hagen, Efstathios Stamatatos, Benno Stein, Martin Potthast

    Abstract: The prerequisite of many approaches to authorship analysis is a representation of writing style. But despite decades of research, it still remains unclear to what extent commonly used and widely accepted representations like character trigram frequencies actually represent an author's writing style, in contrast to more domain-specific style components or even topic. We address this shortcoming for… ▽ More

    Submitted 29 May, 2020; originally announced May 2020.

  43. arXiv:2005.08658  [pdf, other

    cs.IR cs.CL cs.HC

    Conversational Search -- A Report from Dagstuhl Seminar 19461

    Authors: Avishek Anand, Lawrence Cavedon, Matthias Hagen, Hideo Joho, Mark Sanderson, Benno Stein

    Abstract: Dagstuhl Seminar 19461 "Conversational Search" was held on 10-15 November 2019. 44~researchers in Information Retrieval and Web Search, Natural Language Processing, Human Computer Interaction, and Dialogue Systems were invited to share the latest development in the area of Conversational Search and discuss its research agenda and future directions. A 5-day program of the seminar consisted of six i… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: contains arXiv:2001.06910, arXiv:2001.02912

  44. arXiv:2004.06564  [pdf, other

    cs.NE cs.AI

    A Tailored NSGA-III Instantiation for Flexible Job Shop Scheduling

    Authors: Yali Wang, Bas van Stein, Michael T. M. Emmerich, Thomas Bäck

    Abstract: A customized multi-objective evolutionary algorithm (MOEA) is proposed for the multi-objective flexible job shop scheduling problem (FJSP). It uses smart initialization approaches to enrich the first generated population, and proposes various crossover operators to create a better diversity of offspring. Especially, the MIP-EGO configurator, which can tune algorithm parameters, is adopted to autom… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

  45. Abstractive Snippet Generation

    Authors: Wei-Fan Chen, Shahbaz Syed, Benno Stein, Matthias Hagen, Martin Potthast

    Abstract: An abstractive snippet is an originally created piece of text to summarize a web page on a search engine results page. Compared to the conventional extractive snippets, which are generated by extracting phrases and sentences verbatim from a web page, abstractive snippets circumvent copyright issues; even more interesting is the fact that they open the door for personalization. Abstractive snippets… ▽ More

    Submitted 15 March, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Accepted by WWW 2020

  46. arXiv:1812.10847  [pdf, other

    cs.CL cs.IR

    The Clickbait Challenge 2017: Towards a Regression Model for Clickbait Strength

    Authors: Martin Potthast, Tim Gollub, Matthias Hagen, Benno Stein

    Abstract: Clickbait has grown to become a nuisance to social media users and social media operators alike. Malicious content publishers misuse social media to manipulate as many users as possible to visit their websites using clickbait messages. Machine learning technology may help to handle this problem, giving rise to automatic clickbait detection. To accelerate progress in this direction, we organized th… ▽ More

    Submitted 27 December, 2018; originally announced December 2018.

  47. arXiv:1812.09221  [pdf, other

    cs.IR

    Wikipedia Text Reuse: Within and Without

    Authors: Milad Alshomary, Michael Völske, Tristan Licht, Henning Wachsmuth, Benno Stein, Matthias Hagen, Martin Potthast

    Abstract: We study text reuse related to Wikipedia at scale by compiling the first corpus of text reuse cases within Wikipedia as well as without (i.e., reuse of Wikipedia text in a sample of the Common Crawl). To discover reuse beyond verbatim copy and paste, we employ state-of-the-art text reuse detection technology, scaling it for the first time to process the entire Wikipedia as part of a distributed re… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: accepted at ECIR 2019

  48. arXiv:1812.08562  [pdf, other

    cs.IT

    Efficient Error-Correcting Codes in the Short Blocklength Regime

    Authors: Mustafa Cemil Coşkun, Giuseppe Durisi, Thomas Jerkovits, Gianluigi Liva, William Ryan, Brian Stein, Fabian Steiner

    Abstract: The design of block codes for short information blocks (e.g., a thousand or less information bits) is an open research problem that is gaining relevance thanks to emerging applications in wireless communication networks. In this paper, we review some of the most promising code constructions targeting the short block regime, and we compare them with both finite-length performance bounds and classic… ▽ More

    Submitted 10 March, 2019; v1 submitted 20 December, 2018; originally announced December 2018.

    Comments: Preprint submitted to Physical Communication; corrected typos; added references; extended discussion on list decoding of polar codes

  49. arXiv:1810.05526  [pdf, other

    cs.LG cs.NE stat.ML

    Automatic Configuration of Deep Neural Networks with EGO

    Authors: Bas van Stein, Hao Wang, Thomas Bäck

    Abstract: Designing the architecture for an artificial neural network is a cumbersome task because of the numerous parameters to configure, including activation functions, layer types, and hyper-parameters. With the large number of parameters for most networks nowadays, it is intractable to find a good configuration for a given task by hand. In this paper an Efficient Global Optimization (EGO) algorithm is… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.

  50. Safe Stream-Based Programming with Refinement Types

    Authors: Benno Stein, Lazaro Clapp, Manu Sridharan, Bor-Yuh Evan Chang

    Abstract: In stream-based programming, data sources are abstracted as a stream of values that can be manipulated via callback functions. Stream-based programming is exploding in popularity, as it provides a powerful and expressive paradigm for handling asynchronous data sources in interactive software. However, high-level stream abstractions can also make it difficult for developers to reason about control-… ▽ More

    Submitted 8 August, 2018; originally announced August 2018.

    Journal ref: Proceedings of the 2018 33rd ACM/IEEE International Conference on Automated Software Engineering