Skip to main content

Showing 1–18 of 18 results for author: Markert, K

  1. arXiv:2309.08047  [pdf, other

    cs.CL

    Bias in News Summarization: Measures, Pitfalls and Corpora

    Authors: Julius Steen, Katja Markert

    Abstract: Summarization is an important application of large language models (LLMs). Most previous evaluation of summarization models has focused on their content selection, faithfulness, grammaticality and coherence. However, it is well known that LLMs can reproduce and reinforce harmful social biases. This raises the question: Do biases affect model outputs in a constrained setting like summarization? To… ▽ More

    Submitted 6 June, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Findings of ACL 24 Camera Ready

  2. arXiv:2306.04523  [pdf, ps, other

    cs.CL

    Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairs

    Authors: Ines Reinig, Katja Markert

    Abstract: Compared to English, German word order is freer and therefore poses additional challenges for natural language inference (NLI). We create WOGLI (Word Order in German Language Inference), the first adversarial NLI dataset for German word order that has the following properties: (i) each premise has an entailed and a non-entailed hypothesis; (ii) premise and hypotheses differ only in word order and… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  3. arXiv:2305.16819  [pdf, other

    cs.CL

    With a Little Push, NLI Models can Robustly and Efficiently Predict Faithfulness

    Authors: Julius Steen, Juri Opitz, Anette Frank, Katja Markert

    Abstract: Conditional language models still generate unfaithful output that is not supported by their input. These unfaithful generations jeopardize trust in real-world applications such as summarization or human-machine interaction, motivating a need for automatic faithfulness metrics. To implement such metrics, NLI models seem attractive, since they solve a strongly related task that comes with a wealth o… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL 2023 (short paper)

  4. arXiv:2304.01621  [pdf, other

    cs.CL

    SimCSum: Joint Learning of Simplification and Cross-lingual Summarization for Cross-lingual Science Journalism

    Authors: Mehwish Fatima, Tim Kolber, Katja Markert, Michael Strube

    Abstract: Cross-lingual science journalism generates popular science stories of scientific articles different from the source language for a non-expert audience. Hence, a cross-lingual popular summary must contain the salient content of the input document, and the content should be coherent, comprehensible, and in a local language for the targeted audience. We improve these aspects of cross-lingual summary… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  5. arXiv:2209.06517  [pdf, other

    cs.CL

    How to Find Strong Summary Coherence Measures? A Toolbox and a Comparative Study for Summary Coherence Measure Evaluation

    Authors: Julius Steen, Katja Markert

    Abstract: Automatically evaluating the coherence of summaries is of great significance both to enable cost-efficient summarizer evaluation and as a tool for improving coherence by selecting high-scoring candidate summaries. While many different approaches have been suggested to model summary coherence, they are often evaluated using disparate datasets and metrics. This makes it difficult to understand their… ▽ More

    Submitted 15 September, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted at COLING2022. Edited to correct differences to COLING version caused by arxiv package versions

  6. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  7. arXiv:2202.00673  [pdf, other

    cs.LG cs.AI cs.CL cs.HC cs.SD eess.AS

    Visualizing Automatic Speech Recognition -- Means for a Better Understanding?

    Authors: Karla Markert, Romain Parracone, Mykhailo Kulakov, Philip Sperl, Ching-Yu Kao, Konstantin Böttinger

    Abstract: Automatic speech recognition (ASR) is improving ever more at mimicking human speech processing. The functioning of ASR, however, remains to a large extent obfuscated by the complex structure of the deep neural networks (DNNs) they are based on. In this paper, we show how so-called attribution methods, that we import from image recognition and suitably adapt to handle audio data, can help to clarif… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Proc. 2021 ISCA Symposium on Security and Privacy in Speech Communication

  8. arXiv:2202.00399  [pdf, ps, other

    cs.CL cs.CR cs.SD eess.AS

    Language Dependencies in Adversarial Attacks on Speech Recognition Systems

    Authors: Karla Markert, Donika Mirdita, Konstantin Böttinger

    Abstract: Automatic speech recognition (ASR) systems are ubiquitously present in our daily devices. They are vulnerable to adversarial attacks, where manipulated input samples fool the ASR system's recognition. While adversarial examples for various English ASR systems have already been analyzed, there exists no inter-language comparative vulnerability analysis. We compare the attackability of a German and… ▽ More

    Submitted 2 February, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Journal ref: Proc. 2021 ISCA Symposium on Security and Privacy in Speech Communication

  9. arXiv:2101.11298  [pdf, other

    cs.CL

    How to Evaluate a Summarizer: Study Design and Statistical Analysis for Manual Linguistic Quality Evaluation

    Authors: Julius Steen, Katja Markert

    Abstract: Manual evaluation is essential to judge progress on automatic text summarization. However, we conduct a survey on recent summarization system papers that reveals little agreement on how to perform such evaluation studies. We conduct two evaluation experiments on two aspects of summaries' linguistic quality (coherence and repetitiveness) to compare Likert-type and ranking annotations and show that… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted at EACL 2021

  10. arXiv:2012.02015  [pdf, other

    cs.CL

    Context in Informational Bias Detection

    Authors: Esther van den Berg, Katja Markert

    Abstract: Informational bias is bias conveyed through sentences or clauses that provide tangential, speculative or background information that can sway readers' opinions towards entities. By nature, informational bias is context-dependent, but previous work on informational bias detection has not explored the role of context beyond the sentence. In this paper, we explore four kinds of context for informatio… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted to COLING'2020

  11. arXiv:2010.07190  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Towards Resistant Audio Adversarial Examples

    Authors: Tom Dörr, Karla Markert, Nicolas M. Müller, Konstantin Böttinger

    Abstract: Adversarial examples tremendously threaten the availability and integrity of machine learning-based systems. While the feasibility of such attacks has been observed first in the domain of image processing, recent research shows that speech recognition is also susceptible to adversarial attacks. However, reliably bridging the air gap (i.e., making the adversarial examples work when recorded via a m… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    ACM Class: I.2

    Journal ref: SPAI 20: Proceedings of the 1st ACM Workshop on Security and Privacy on Artificial IntelligenceOctober 2020 Pages 3-10

  12. arXiv:2005.01791  [pdf, other

    cs.CL

    Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction

    Authors: Raphael Schumann, Lili Mou, Yao Lu, Olga Vechtomova, Katja Markert

    Abstract: Automatic sentence summarization produces a shorter version of a sentence, while preserving its most important information. A good summary is characterized by language fluency and high information overlap with the source sentence. We model these two aspects in an unsupervised objective function, consisting of language modeling and semantic similarity metrics. We search for a high-scoring summary b… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: Accepted at ACL 2020

  13. Identifying Mislabeled Instances in Classification Datasets

    Authors: Nicolas Michael Müller, Karla Markert

    Abstract: A key requirement for supervised machine learning is labeled training data, which is created by annotating unlabeled data with the appropriate class. Because this process can in many cases not be done by machines, labeling needs to be performed by human domain experts. This process tends to be expensive both in time and money, and is prone to errors. Additionally, reviewing an entire labeled datas… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Journal ref: 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 2019

  14. arXiv:1810.07949  [pdf, ps, other

    cs.CL

    A Temporally Sensitive Submodularity Framework for Timeline Summarization

    Authors: Sebastian Martschat, Katja Markert

    Abstract: Timeline summarization (TLS) creates an overview of long-running events via dated daily summaries for the most important dates. TLS differs from standard multi-document summarization (MDS) in the importance of date selection, interdependencies between summaries of different dates and by having very short summaries compared to the number of corpus documents. However, we show that MDS optimization m… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

    Comments: To appear at CoNLL 2018

  15. arXiv:1707.07278  [pdf, other

    cs.CL

    Fine Grained Citation Span for References in Wikipedia

    Authors: Besnik Fetahu, Katja Markert, Avishek Anand

    Abstract: \emph{Verifiability} is one of the core editing principles in Wikipedia, editors being encouraged to provide citations for the added content. For a Wikipedia article, determining the \emph{citation span} of a citation, i.e. what content is covered by a citation, is important as it helps decide for which content citations are still missing. We are the first to address the problem of determining t… ▽ More

    Submitted 23 July, 2017; originally announced July 2017.

  16. arXiv:1703.10344  [pdf, other

    cs.IR cs.CL cs.SI

    Automated News Suggestions for Populating Wikipedia Entity Pages

    Authors: Besnik Fetahu, Katja Markert, Avishek Anand

    Abstract: Wikipedia entity pages are a valuable source of information for direct consumption and for knowledge-base construction, update and maintenance. Facts in these entity pages are typically supported by references. Recent studies show that as much as 20\% of the references are from online news sources. However, many entity pages are incomplete even if relevant information is already available in exist… ▽ More

    Submitted 30 March, 2017; originally announced March 2017.

  17. arXiv:1703.10339  [pdf, other

    cs.IR cs.CL cs.SI

    Finding News Citations for Wikipedia

    Authors: Besnik Fetahu, Katja Markert, Wolfgang Nejdl, Avishek Anand

    Abstract: An important editing policy in Wikipedia is to provide citations for added statements in Wikipedia pages, where statements can be arbitrary pieces of text, ranging from a sentence to a paragraph. In many cases citations are either outdated or missing altogether. In this work we address the problem of finding and updating news citations for statements in entity pages. We propose a two-stage super… ▽ More

    Submitted 24 April, 2017; v1 submitted 30 March, 2017; originally announced March 2017.

  18. arXiv:cmp-lg/9605025  [pdf, ps

    cs.CL

    A Conceptual Reasoning Approach to Textual Ellipsis

    Authors: Udo Hahn, Katja Markert, Michael Strube

    Abstract: We present a hybrid text understanding methodology for the resolution of textual ellipsis. It integrates conceptual criteria (based on the well-formedness and conceptual strength of role chains in a terminological knowledge base) and functional constraints reflecting the utterances' information structure (based on the distinction between context-bound and unbound discourse elements). The methodo… ▽ More

    Submitted 15 May, 1996; originally announced May 1996.

    Comments: 5 pages, uuencoded gzipped PS file (see also Technical Report at: http://www.coling.uni-freiburg.de/public/papers/ecai96.ps.gz)

    Journal ref: ECAI '96: Proc. of 12th European Conference on Artificial Intelligence. Budapest, Aug 12-16 1996, pp.572-576