Skip to main content

Showing 1–30 of 30 results for author: Kiritchenko, S

  1. arXiv:2406.15583  [pdf, other

    cs.CL cs.CY

    Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

    Authors: Kathleen C. Fraser, Hillary Dawkins, Svetlana Kiritchenko

    Abstract: Large language models (LLMs) have advanced to a point that even humans have difficulty discerning whether a text was generated by another human, or by a computer. However, knowing whether a text was produced by human or artificial intelligence (AI) is important to determining its trustworthiness, and has applications in many domains including detecting fraud and academic dishonesty, as well as com… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.20152  [pdf, other

    cs.CV

    Uncovering Bias in Large Vision-Language Models at Scale with Counterfactuals

    Authors: Phillip Howard, Kathleen C. Fraser, Anahita Bhiwandiwalla, Svetlana Kiritchenko

    Abstract: With the advent of Large Language Models (LLMs) possessing increasingly impressive capabilities, a number of Large Vision-Language Models (LVLMs) have been proposed to augment LLMs with visual inputs. Such models condition generated text on both an input image and a text prompt, enabling a variety of use cases such as visual question answering and multimodal chat. While prior studies have examined… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2404.11845  [pdf, other

    cs.CL cs.CY

    Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes

    Authors: Isar Nejadgholi, Kathleen C. Fraser, Anna Kerkhof, Svetlana Kiritchenko

    Abstract: Gender stereotypes are pervasive beliefs about individuals based on their gender that play a significant role in shaping societal attitudes, behaviours, and even opportunities. Recognizing the negative implications of gender stereotypes, particularly in online communications, this study investigates eleven strategies to automatically counter-act and challenge these views. We present AI-generated g… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: LREC-COLING2024

  4. arXiv:2404.00166  [pdf, other

    cs.CV cs.AI

    Uncovering Bias in Large Vision-Language Models with Counterfactuals

    Authors: Phillip Howard, Anahita Bhiwandiwalla, Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: With the advent of Large Language Models (LLMs) possessing increasingly impressive capabilities, a number of Large Vision-Language Models (LVLMs) have been proposed to augment LLMs with visual inputs. Such models condition generated text on both an input image and a text prompt, enabling a variety of use cases such as visual question answering and multimodal chat. While prior studies have examined… ▽ More

    Submitted 7 June, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: Accepted to the CVPR 2024 Responsible Generative AI (ReGenAI) Workshop

  5. arXiv:2402.05779  [pdf, other

    cs.CY cs.CL cs.CV

    Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images

    Authors: Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: Following on recent advances in large language models (LLMs) and subsequent chat models, a new wave of large vision-language models (LVLMs) has emerged. Such models can incorporate images as input in addition to text, and perform tasks such as visual question answering, image captioning, story generation, etc. Here, we examine potential gender and racial biases in such systems, based on the percei… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: To appear at EACL 2024

  6. arXiv:2307.01900  [pdf, other

    cs.CL cs.AI

    Concept-Based Explanations to Test for False Causal Relationships Learned by Abusive Language Classifiers

    Authors: Isar Nejadgholi, Svetlana Kiritchenko, Kathleen C. Fraser, Esma Balkır

    Abstract: Classifiers tend to learn a false causal relationship between an over-represented concept and a label, which can result in over-reliance on the concept and compromised classification accuracy. It is imperative to have methods in place that can compare different models and identify over-reliances on specific concepts. We consider three well-known abusive language classifiers trained on large Englis… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Published at WOAH2023 co-located with ACL2023

  7. arXiv:2303.14128  [pdf, other

    cs.CL

    The crime of being poor

    Authors: Georgina Curto, Svetlana Kiritchenko, Isar Nejadgholi, Kathleen C. Fraser

    Abstract: The criminalization of poverty has been widely denounced as a collective bias against the most vulnerable. NGOs and international organizations claim that the poor are blamed for their situation, are more often associated with criminal offenses than the wealthy strata of society and even incur criminal offenses simply as a result of being poor. While no evidence has been found in the literature th… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  8. arXiv:2302.07159  [pdf, other

    cs.CY cs.CL

    A Friendly Face: Do Text-to-Image Systems Rely on Stereotypes when the Input is Under-Specified?

    Authors: Kathleen C. Fraser, Svetlana Kiritchenko, Isar Nejadgholi

    Abstract: As text-to-image systems continue to grow in popularity with the general public, questions have arisen about bias and diversity in the generated images. Here, we investigate properties of images generated in response to prompts which are visually under-specified, but contain salient social attributes (e.g., 'a portrait of a threatening person' versus 'a portrait of a friendly person'). Grounding o… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: Appearing in the AAAI 2023 Workshop on Creative AI Across Modalities

  9. arXiv:2210.10689  [pdf, other

    cs.CL

    Towards Procedural Fairness: Uncovering Biases in How a Toxic Language Classifier Uses Sentiment Information

    Authors: Isar Nejadgholi, Esma Balkır, Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: Previous works on the fairness of toxic language classifiers compare the output of models with different identity terms as input features but do not consider the impact of other important concepts present in the context. Here, besides identity terms, we take into account high-level latent features learned by the classifier and investigate the interaction between these features and identity terms.… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 13 pages, 2 figures, accepted at the fifth edition of BlackBoxNLP collocated with EMNLP2022

  10. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  11. arXiv:2206.03945  [pdf, other

    cs.CL

    Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models

    Authors: Esma Balkir, Svetlana Kiritchenko, Isar Nejadgholi, Kathleen C. Fraser

    Abstract: Motivations for methods in explainable artificial intelligence (XAI) often include detecting, quantifying and mitigating bias, and contributing to making machine learning models fairer. However, exactly how an XAI method can help in combating biases is often left unspecified. In this paper, we briefly review trends in explainability and fairness in NLP research, identify the current practices in w… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: TrustNLP Workshop at NAACL 2022

  12. arXiv:2205.12771  [pdf, other

    cs.CY cs.CL

    Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy

    Authors: Kathleen C. Fraser, Svetlana Kiritchenko, Esma Balkir

    Abstract: In an effort to guarantee that machine learning model outputs conform with human moral values, recent work has begun exploring the possibility of explicitly training models to learn the difference between right and wrong. This is typically done in a bottom-up fashion, by exposing the model to different scenarios, annotated with human moral judgements. One question, however, is whether the trained… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: To appear at TrustNLP Workshop @ NAACL 2022

  13. arXiv:2205.03302  [pdf, other

    cs.CL

    Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection

    Authors: Esma Balkir, Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: We present a novel feature attribution method for explaining text classifiers, and analyze it in the context of hate speech detection. Although feature attribution models usually provide a single importance score for each token, we instead provide two complementary and theoretically-grounded scores -- necessity and sufficiency -- resulting in more informative explanations. We propose a transparent… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  14. arXiv:2204.02261  [pdf, other

    cs.CL cs.LG

    Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors

    Authors: Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko

    Abstract: Robustness of machine learning models on ever-changing real-world data is critical, especially for applications affecting human well-being such as content moderation. New kinds of abusive language continually emerge in online discussions in response to current events (e.g., COVID-19), and the deployed abuse detection systems should be updated regularly to remain accurate. In this paper, we show th… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: accepted to be published at ACL2022

  15. arXiv:2106.02596  [pdf, other

    cs.CY cs.AI cs.CL

    Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model

    Authors: Kathleen C. Fraser, Isar Nejadgholi, Svetlana Kiritchenko

    Abstract: Stereotypical language expresses widely-held beliefs about different social categories. Many stereotypes are overtly negative, while others may appear positive on the surface, but still lead to negative consequences. In this work, we present a computational approach to interpreting stereotypes in text through the Stereotype Content Model (SCM), a comprehensive causal theory from social psychology.… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: In Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)

  16. arXiv:2012.12305  [pdf, other

    cs.CL cs.AI cs.CY

    Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective

    Authors: Svetlana Kiritchenko, Isar Nejadgholi, Kathleen C. Fraser

    Abstract: The pervasiveness of abusive content on the internet can lead to severe psychological and physical harm. Significant effort in Natural Language Processing (NLP) research has been devoted to addressing this problem through abusive content detection and related sub-areas, such as the detection of hate speech, toxicity, cyberbullying, etc. Although current technologies achieve high classification per… ▽ More

    Submitted 22 July, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: published in Journal of Artificial Intelligence Research, 71: 431-478, July 2021

  17. arXiv:2010.14952  [pdf, other

    cs.CL

    Towards Ethics by Design in Online Abusive Content Detection

    Authors: Svetlana Kiritchenko, Isar Nejadgholi

    Abstract: To support safety and inclusion in online communications, significant efforts in NLP research have been put towards addressing the problem of abusive content detection, commonly defined as a supervised classification task. The research effort has spread out across several closely related sub-areas, such as detection of hate speech, toxicity, cyberbullying, etc. There is a pressing need to consolid… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: 14 pages, 2 figures

  18. arXiv:2010.07414  [pdf, other

    cs.CL cs.AI

    On Cross-Dataset Generalization in Automatic Detection of Online Abuse

    Authors: Isar Nejadgholi, Svetlana Kiritchenko

    Abstract: NLP research has attained high performances in abusive language detection as a supervised classification task. While in research settings, training and test datasets are usually obtained from similar data samples, in practice systems are often applied on data that are different from the training set in topic and class distributions. Also, the ambiguity in class definitions inherited in this task a… ▽ More

    Submitted 19 May, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: 13 pages, 3 figures, published at WOAH-2020 (The 4th Workshop on Online Abuse and Harms)

  19. arXiv:2006.03096  [pdf, other

    cs.CL cs.CY

    SOLO: A Corpus of Tweets for Examining the State of Being Alone

    Authors: Svetlana Kiritchenko, Will E. Hipson, Robert J. Coplan, Saif M. Mohammad

    Abstract: The state of being alone can have a substantial impact on our lives, though experiences with time alone diverge significantly among individuals. Psychologists distinguish between the concept of solitude, a positive state of voluntary aloneness, and the concept of loneliness, a negative state of dissatisfaction with the quality of one's social interactions. Here, for the first time, we conduct a la… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: In Proceedings of the 12th edition of the Language Resources and Evaluation Conference (LREC), May 2020

  20. arXiv:1912.02387  [pdf, other

    cs.CL cs.IR cs.LG

    SemEval-2015 Task 10: Sentiment Analysis in Twitter

    Authors: Sara Rosenthal, Saif M Mohammad, Preslav Nakov, Alan Ritter, Svetlana Kiritchenko, Veselin Stoyanov

    Abstract: In this paper, we describe the 2015 iteration of the SemEval shared task on Sentiment Analysis in Twitter. This was the most popular sentiment analysis shared task to date with more than 40 teams participating in each of the last three years. This year's shared task competition consisted of five sentiment prediction subtasks. Two were reruns from previous years: (A) sentiment expressed by a phrase… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: Sentiment analysis, sentiment towards a topic, quantification, microblog sentiment analysis; Twitter opinion mining

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: SemEval-2015

  21. arXiv:1805.04558  [pdf, ps, other

    cs.CL

    NRC-Canada at SMM4H Shared Task: Classifying Tweets Mentioning Adverse Drug Reactions and Medication Intake

    Authors: Svetlana Kiritchenko, Saif M. Mohammad, Jason Morin, Berry de Bruijn

    Abstract: Our team, NRC-Canada, participated in two shared tasks at the AMIA-2017 Workshop on Social Media Mining for Health Applications (SMM4H): Task 1 - classification of tweets mentioning adverse drug reactions, and Task 2 - classification of tweets describing personal medication intake. For both tasks, we trained Support Vector Machine classifiers using a variety of surface-form, sentiment, and domain-… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

    Comments: In Proceedings of the Social Media Mining for Health Applications Workshop at AMIA-2017, Washington, DC, USA, 2017

  22. arXiv:1805.04542  [pdf, ps, other

    cs.CL

    Sentiment Composition of Words with Opposing Polarities

    Authors: Svetlana Kiritchenko, Saif M. Mohammad

    Abstract: In this paper, we explore sentiment composition in phrases that have at least one positive and at least one negative word---phrases like 'happy accident' and 'best winter break'. We compiled a dataset of such opposing polarity phrases and manually annotated them with real-valued scores of sentiment association. Using this dataset, we analyze the linguistic patterns present in opposing polarity phr… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

    Comments: In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), San Diego, California, 2016

  23. arXiv:1805.04508  [pdf, other

    cs.CL

    Examining Gender and Race Bias in Two Hundred Sentiment Analysis Systems

    Authors: Svetlana Kiritchenko, Saif M. Mohammad

    Abstract: Automatic machine learning systems can inadvertently accentuate and perpetuate inappropriate human biases. Past work on examining inappropriate biases has largely focused on just individual systems. Further, there is no benchmark dataset for examining inappropriate biases in systems. Here for the first time, we present the Equity Evaluation Corpus (EEC), which consists of 8,640 English sentences c… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

    Comments: In Proceedings of the 7th Joint Conference on Lexical and Computational Semantics (*SEM), New Orleans, USA, 2018

  24. arXiv:1712.01794  [pdf, other

    cs.CL

    The Effect of Negators, Modals, and Degree Adverbs on Sentiment Composition

    Authors: Svetlana Kiritchenko, Saif M. Mohammad

    Abstract: Negators, modals, and degree adverbs can significantly affect the sentiment of the words they modify. Often, their impact is modeled with simple heuristics; although, recent work has shown that such heuristics do not capture the true sentiment of multi-word phrases. We created a dataset of phrases that include various negators, modals, and degree adverbs, as well as their combinations. Both the ph… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

    Comments: In Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), San Diego, California, 2016

  25. arXiv:1712.01765  [pdf, other

    cs.CL

    Best-Worst Scaling More Reliable than Rating Scales: A Case Study on Sentiment Intensity Annotation

    Authors: Svetlana Kiritchenko, Saif M. Mohammad

    Abstract: Rating scales are a widely used method for data annotation; however, they present several challenges, such as difficulty in maintaining inter- and intra-annotator consistency. Best-worst scaling (BWS) is an alternative method of annotation that is claimed to produce high-quality annotations while keeping the required number of annotations similar to that of rating scales. However, the veracity of… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

    Comments: In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), Vancouver, Canada, 2017

  26. arXiv:1712.01741  [pdf, other

    cs.CL

    Capturing Reliable Fine-Grained Sentiment Associations by Crowdsourcing and Best-Worst Scaling

    Authors: Svetlana Kiritchenko, Saif M. Mohammad

    Abstract: Access to word-sentiment associations is useful for many applications, including sentiment analysis, stance detection, and linguistic analysis. However, manually assigning fine-grained sentiment association scores to words has many challenges with respect to keeping annotations consistent. We apply the annotation technique of Best-Worst Scaling to obtain real-valued sentiment association scores fo… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

    Comments: In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), San Diego, California, 2016

  27. arXiv:1605.01655  [pdf, other

    cs.CL

    Stance and Sentiment in Tweets

    Authors: Saif M. Mohammad, Parinaz Sobhani, Svetlana Kiritchenko

    Abstract: We can often detect from a person's utterances whether he/she is in favor of or against a given target entity -- their stance towards the target. However, a person may express the same stance towards a target by using negative or positive language. Here for the first time we present a dataset of tweet--target pairs annotated for both stance and sentiment. The targets may or may not be referred to… ▽ More

    Submitted 5 May, 2016; originally announced May 2016.

    Comments: 22 pages

  28. arXiv:1311.1194  [pdf, ps, other

    cs.CL

    Identifying Purpose Behind Electoral Tweets

    Authors: Saif M. Mohammad, Svetlana Kiritchenko, Joel Martin

    Abstract: Tweets pertaining to a single event, such as a national election, can number in the hundreds of millions. Automatically analyzing them is beneficial in many downstream natural language applications such as question answering and summarization. In this paper, we propose a new task: identifying the purpose behind electoral tweets--why do people post election-oriented tweets? We show that identifying… ▽ More

    Submitted 5 November, 2013; originally announced November 2013.

    Journal ref: In Proceedings of the KDD Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM-2013), August 2013, Chicago, USA

  29. arXiv:1309.6352  [pdf, ps, other

    cs.CL

    Using Nuances of Emotion to Identify Personality

    Authors: Saif M. Mohammad, Svetlana Kiritchenko

    Abstract: Past work on personality detection has shown that frequency of lexical categories such as first person pronouns, past tense verbs, and sentiment words have significant correlations with personality traits. In this paper, for the first time, we show that fine affect (emotion) categories such as that of excitement, guilt, yearning, and admiration are significant indicators of personality. Additional… ▽ More

    Submitted 24 September, 2013; originally announced September 2013.

    Comments: In Proceedings of the ICWSM Workshop on Computational Personality Recognition, July 2013, Boston, USA

  30. arXiv:1308.6242  [pdf, ps, other

    cs.CL

    NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets

    Authors: Saif M. Mohammad, Svetlana Kiritchenko, Xiaodan Zhu

    Abstract: In this paper, we describe how we created two state-of-the-art SVM classifiers, one to detect the sentiment of messages such as tweets and SMS (message-level task) and one to detect the sentiment of a term within a submissions stood first in both tasks on tweets, obtaining an F-score of 69.02 in the message-level task and 88.93 in the term-level task. We implemented a variety of surface-form, sema… ▽ More

    Submitted 28 August, 2013; originally announced August 2013.

    Journal ref: In Proceedings of the seventh international workshop on Semantic Evaluation Exercises (SemEval-2013), June 2013, Atlanta, USA