Skip to main content

Showing 1–5 of 5 results for author: Dillavou, S

  1. arXiv:2405.21042  [pdf, other

    cs.LG

    Comparing information content of representation spaces for disentanglement with VAE ensembles

    Authors: Kieran A. Murphy, Sam Dillavou, Dani S. Bassett

    Abstract: Disentanglement is the endeavour to use machine learning to divide information about a dataset into meaningful fragments. In practice these fragments are representation (sub)spaces, often the set of channels in the latent space of a variational autoencoder (VAE). Assessments of disentanglement predominantly employ metrics that are coarse-grained at the model level, but this approach can obscure mu… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Code: https://github.com/murphyka/representation-space-info-comparison

  2. arXiv:2311.00537  [pdf, other

    cond-mat.soft cs.ET cs.LG

    Machine Learning Without a Processor: Emergent Learning in a Nonlinear Electronic Metamaterial

    Authors: Sam Dillavou, Benjamin D Beyer, Menachem Stern, Andrea J Liu, Marc Z Miskin, Douglas J Durian

    Abstract: Standard deep learning algorithms require differentiating large nonlinear networks, a process that is slow and power-hungry. Electronic learning metamaterials offer potentially fast, efficient, and fault-tolerant hardware for analog machine learning, but existing implementations are linear, severely limiting their capabilities. These systems differ significantly from artificial neural networks as… ▽ More

    Submitted 5 April, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 11 pages 8 figures

    Journal ref: Proc. Nat. Acad. Sci. 121 (28), e2319718121 (2024)

  3. arXiv:2309.00058  [pdf, other

    cs.CV cond-mat.soft

    Bellybutton: Accessible and Customizable Deep-Learning Image Segmentation

    Authors: Sam Dillavou, Jesse M. Hanlan, Anthony T. Chieco, Hongyi Xiao, Sage Fulco, Kevin T. Turner, Douglas J. Durian

    Abstract: The conversion of raw images into quantifiable data can be a major hurdle in experimental research, and typically involves identifying region(s) of interest, a process known as segmentation. Machine learning tools for image segmentation are often specific to a set of tasks, such as tracking cells, or require substantial compute or coding knowledge to train and use. Here we introduce an easy-to-use… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: 6 Pages 3 Figures

  4. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  5. arXiv:2201.04626  [pdf, other

    cond-mat.soft cs.LG cs.NE

    Desynchronous Learning in a Physics-Driven Learning Network

    Authors: Jacob F Wycoff, Sam Dillavou, Menachem Stern, Andrea J Liu, Douglas J Durian

    Abstract: In a neuron network, synapses update individually using local information, allowing for entirely decentralized learning. In contrast, elements in an artificial neural network (ANN) are typically updated simultaneously using a central processor. Here we investigate the feasibility and effect of desynchronous learning in a recently introduced decentralized, physics-driven learning network. We show t… ▽ More

    Submitted 1 December, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: 6 pages 4 figures