Skip to main content

Showing 1–5 of 5 results for author: Waites, C

  1. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  2. arXiv:2106.04378  [pdf, other

    cs.LG stat.ML

    Adaptive Machine Unlearning

    Authors: Varun Gupta, Christopher Jung, Seth Neel, Aaron Roth, Saeed Sharifi-Malvajerdi, Chris Waites

    Abstract: Data deletion algorithms aim to remove the influence of deleted data points from trained models at a cheaper computational cost than fully retraining those models. However, for sequences of deletions, most prior work in the non-convex setting gives valid guarantees only for sequences that are chosen independently of the models that are published. If people choose to delete their data as a function… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  3. arXiv:2104.00722  [pdf, other

    cs.LG cs.AI

    GABO: Graph Augmentations with Bi-level Optimization

    Authors: Heejung W. Chung, Avoy Datta, Chris Waites

    Abstract: Data augmentation refers to a wide range of techniques for improving model generalization by augmenting training examples. Oftentimes such methods require domain knowledge about the dataset at hand, spawning a plethora of recent literature surrounding automated techniques for data augmentation. In this work we apply one such method, bilevel optimization, to tackle the problem of graph classificati… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  4. arXiv:2103.14068  [pdf, other

    cs.LG cs.AI cs.CR cs.DS stat.ML

    Differentially Private Normalizing Flows for Privacy-Preserving Density Estimation

    Authors: Chris Waites, Rachel Cummings

    Abstract: Normalizing flow models have risen as a popular solution to the problem of density estimation, enabling high-quality synthetic data generation as well as exact probability density evaluation. However, in contexts where individuals are directly associated with the training data, releasing such a model raises privacy concerns. In this work, we propose the use of normalizing flow models that provide… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  5. arXiv:1912.03250  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Synthetic Mixed-Type Data Generation For Unsupervised Learning

    Authors: Uthaipon Tantipongpipat, Chris Waites, Digvijay Boob, Amaresh Ankit Siva, Rachel Cummings

    Abstract: We introduce the DP-auto-GAN framework for synthetic data generation, which combines the low dimensional representation of autoencoders with the flexibility of Generative Adversarial Networks (GANs). This framework can be used to take in raw sensitive data and privately train a model for generating synthetic data that will satisfy similar statistical properties as the original data. This learned m… ▽ More

    Submitted 9 December, 2020; v1 submitted 6 December, 2019; originally announced December 2019.