Skip to main content

Showing 1–8 of 8 results for author: Reynolds, L

  1. arXiv:2305.16367  [pdf, other

    cs.CL cs.AI cs.LG

    Role-Play with Large Language Models

    Authors: Murray Shanahan, Kyle McDonell, Laria Reynolds

    Abstract: As dialogue agents become increasingly human-like in their performance, it is imperative that we develop effective ways to describe their behaviour in high-level terms without falling into the trap of anthropomorphism. In this paper, we foreground the concept of role-play. Casting dialogue agent behaviour in terms of role-play allows us to draw on familiar folk psychological terms, without ascribi… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  2. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  3. arXiv:2206.02841  [pdf, other

    cs.CY cs.AI

    Researching Alignment Research: Unsupervised Analysis

    Authors: Jan H. Kirchner, Logan Smith, Jacques Thibodeau, Kyle McDonell, Laria Reynolds

    Abstract: AI alignment research is the field of study dedicated to ensuring that artificial intelligence (AI) benefits humans. As machine intelligence gets more advanced, this research is becoming increasingly important. Researchers in the field share ideas across different media to speed up the exchange of information. However, this focus on speed means that the research landscape is opaque, making it diff… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  4. arXiv:2204.06745  [pdf, other

    cs.CL

    GPT-NeoX-20B: An Open-Source Autoregressive Language Model

    Authors: Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle McDonell, Jason Phang, Michael Pieler, USVSN Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach

    Abstract: We introduce GPT-NeoX-20B, a 20 billion parameter autoregressive language model trained on the Pile, whose weights will be made freely and openly available to the public through a permissive license. It is, to the best of our knowledge, the largest dense autoregressive model that has publicly available weights at the time of submission. In this work, we describe \model{}'s architecture and trainin… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: To appear in the Proceedings of the ACL Workshop on Challenges & Perspectives in Creating Large Language Models

  5. arXiv:2102.07350  [pdf, ps, other

    cs.CL cs.AI

    Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm

    Authors: Laria Reynolds, Kyle McDonell

    Abstract: Prevailing methods for mapping large generative language models to supervised tasks may fail to sufficiently probe models' novel capabilities. Using GPT-3 as a case study, we show that 0-shot prompts can significantly outperform few-shot prompts. We suggest that the function of few-shot examples in these cases is better described as locating an already learned task rather than meta-learning. This… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  6. arXiv:2102.06391  [pdf, other

    cs.HC cs.CL

    Multiversal views on language models

    Authors: Laria Reynolds, Kyle McDonell

    Abstract: The virtuosity of language models like GPT-3 opens a new world of possibility for human-AI collaboration in writing. In this paper, we present a framework in which generative language models are conceptualized as multiverse generators. This framework also applies to human imagination and is core to how we read and write fiction. We call for exploration into this commonality through new forms of in… ▽ More

    Submitted 15 February, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: 10 pages, 7 figures

  7. arXiv:2009.02998  [pdf, other

    cs.HC

    A Visualization Interface to Improve the Transparency of Collected Personal Data on the Internet

    Authors: Marija Schufrin, Steven Lamarr Reynolds, Arjan Kuijper, Jörn Kohlhammer

    Abstract: Online services are used for all kinds of activities, like news, entertainment, publishing content or connecting with others. But information technology enables new threats to privacy by means of global mass surveillance, vast databases and fast distribution networks. Current news are full of misuses and data leakages. In most cases, users are powerless in such situations and develop an attitude o… ▽ More

    Submitted 8 September, 2022; v1 submitted 7 September, 2020; originally announced September 2020.

  8. The Effect of Computer-Generated Descriptions on Photo-Sharing Experiences of People with Visual Impairments

    Authors: Yuhang Zhao, Shaomei Wu, Lindsay Reynolds, Shiri Azenkot

    Abstract: Like sighted people, visually impaired people want to share photographs on social networking services, but find it difficult to identify and select photos from their albums. We aimed to address this problem by incorporating state-of-the-art computer-generated descriptions into Facebook's photo-sharing feature. We interviewed 12 visually impaired participants to understand their photo-sharing exper… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

    Comments: CSCW 2018

    ACM Class: H.5.1; K.4.2

    Journal ref: Proc. ACM Hum.-Comput. Interact. 1, CSCW, 121 (November 2017), 22 pages