Skip to main content

Showing 1–5 of 5 results for author: Stinson, C

  1. The State of Documentation Practices of Third-party Machine Learning Models and Datasets

    Authors: Ernesto Lang Oreamuno, Rohan Faiyaz Khan, Abdul Ali Bangash, Catherine Stinson, Bram Adams

    Abstract: Model stores offer third-party ML models and datasets for easy project integration, minimizing coding efforts. One might hope to find detailed specifications of these models and datasets in the documentation, leveraging documentation standards such as model and dataset cards. In this study, we use statistical analysis and hybrid card sorting to assess the state of the practice of documenting model… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 7 pages, 4 figures, IEEESoftware format

    Journal ref: IEEE Software 2024

  2. arXiv:2312.03912  [pdf, other

    cs.CL

    Collaboration or Corporate Capture? Quantifying NLP's Reliance on Industry Artifacts and Contributions

    Authors: Will Aitken, Mohamed Abdalla, Karen Rudie, Catherine Stinson

    Abstract: Impressive performance of pre-trained models has garnered public attention and made news headlines in recent years. Almost always, these models are produced by or in collaboration with industry. Using them is critical for competing on natural language processing (NLP) benchmarks and correspondingly to stay relevant in NLP research. We surveyed 100 papers published at EMNLP 2022 to determine the de… ▽ More

    Submitted 22 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: ACL 2024 Main Conference

  3. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  4. arXiv:2110.08353  [pdf, other

    cs.IR cs.AI cs.LG

    Revisiting Popularity and Demographic Biases in Recommender Evaluation and Effectiveness

    Authors: Nicola Neophytou, Bhaskar Mitra, Catherine Stinson

    Abstract: Recommendation algorithms are susceptible to popularity bias: a tendency to recommend popular items even when they fail to meet user needs. A related issue is that the recommendation quality can vary by demographic groups. Marginalized groups or groups that are under-represented in the training data may receive less relevant recommendations from these algorithms compared to others. In a recent stu… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

  5. arXiv:2105.01031  [pdf

    cs.CY cs.LG

    Algorithms are not neutral: Bias in collaborative filtering

    Authors: Catherine Stinson

    Abstract: Discussions of algorithmic bias tend to focus on examples where either the data or the people building the algorithms are biased. This gives the impression that clean data and good intentions could eliminate bias. The neutrality of the algorithms themselves is defended by prominent Artificial Intelligence researchers. However, algorithms are not neutral. In addition to biased data and biased algor… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.