Skip to main content

Showing 1–4 of 4 results for author: Shkaruta, K

  1. arXiv:2209.05286  [pdf, other

    cs.CL

    DECK: Behavioral Tests to Improve Interpretability and Generalizability of BERT Models Detecting Depression from Text

    Authors: Jekaterina Novikova, Ksenia Shkaruta

    Abstract: Models that accurately detect depression from text are important tools for addressing the post-pandemic mental health crisis. BERT-based classifiers' promising performance and the off-the-shelf availability make them great candidates for this task. However, these models are known to suffer from performance inconsistencies and poor generalization. In this paper, we introduce the DECK (DEpression Ch… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  2. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  3. arXiv:1910.00065  [pdf, other

    cs.CL

    Lexical Features Are More Vulnerable, Syntactic Features Have More Predictive Power

    Authors: Jekaterina Novikova, Aparna Balagopalan, Ksenia Shkaruta, Frank Rudzicz

    Abstract: Understanding the vulnerability of linguistic features extracted from noisy text is important for both developing better health text classification models and for interpreting vulnerabilities of natural language models. In this paper, we investigate how generic language characteristics, such as syntax or the lexicon, are impacted by artificial text alterations. The vulnerability of features is ana… ▽ More

    Submitted 30 September, 2019; originally announced October 2019.

    Comments: EMNLP Workshop on Noisy User-generated Text (W-NUT 2019)

  4. arXiv:1904.01684  [pdf, other

    cs.CL

    Impact of ASR on Alzheimer's Disease Detection: All Errors are Equal, but Deletions are More Equal than Others

    Authors: Aparna Balagopalan, Ksenia Shkaruta, Jekaterina Novikova

    Abstract: Automatic Speech Recognition (ASR) is a critical component of any fully-automated speech-based dementia detection model. However, despite years of speech recognition research, little is known about the impact of ASR accuracy on dementia detection. In this paper, we experiment with controlled amounts of artificially generated ASR errors and investigate their influence on dementia detection. We find… ▽ More

    Submitted 13 October, 2020; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: EMNLP Workshop on Noisy User-generated Text (W-NUT 2020)