Skip to main content

Showing 1–15 of 15 results for author: Moschella, L

  1. arXiv:2406.15057  [pdf, other

    cs.LG

    Latent Space Translation via Inverse Relative Projection

    Authors: Valentino Maiorca, Luca Moschella, Marco Fumero, Francesco Locatello, Emanuele Rodolà

    Abstract: The emergence of similar representations between independently trained neural models has sparked significant interest in the representation learning community, leading to the development of various methods to obtain communication between latent spaces. "Latent space communication" can be achieved in two ways: i) by independently mapping the original spaces to a shared or relative one; ii) by direc… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.00664, arXiv:2406.11014

  2. arXiv:2406.11014  [pdf, other

    cs.LG cs.AI

    Latent Communication in Artificial Neural Networks

    Authors: Luca Moschella

    Abstract: As NNs permeate various scientific and industrial domains, understanding the universality and reusability of their representations becomes crucial. At their core, these networks create intermediate neural representations, indicated as latent spaces, of the input data and subsequently leverage them to perform specific downstream tasks. This dissertation focuses on the universality and reusability o… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Doctoral Thesis: https://iris.uniroma1.it/handle/11573/1711827

  3. arXiv:2404.12917  [pdf, other

    cs.LG cs.AI cs.CV

    Zero-Shot Stitching in Reinforcement Learning using Relative Representations

    Authors: Antonio Pio Ricciardi, Valentino Maiorca, Luca Moschella, Riccardo Marin, Emanuele Rodolà

    Abstract: Visual Reinforcement Learning is a popular and powerful framework that takes full advantage of the Deep Learning breakthrough. However, it is also known that variations in the input (e.g., different colors of the panorama due to the season of the year) or the task (e.g., changing the speed limit for a car to respect) could require complete retraining of the agents. In this work, we leverage recent… ▽ More

    Submitted 7 May, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 13 pages, 10 figures, 4 tables

    MSC Class: 68T07 ACM Class: I.2.6

  4. arXiv:2311.06547  [pdf, other

    cs.LG

    From Charts to Atlas: Merging Latent Spaces into One

    Authors: Donato Crisostomi, Irene Cannistraci, Luca Moschella, Pietro Barbiero, Marco Ciccone, Pietro Liò, Emanuele Rodolà

    Abstract: Models trained on semantically related datasets and tasks exhibit comparable inter-sample relations within their latent spaces. We investigate in this study the aggregation of such latent spaces to create a unified space encompassing the combined information. To this end, we introduce Relative Latent Space Aggregation, a two-step approach that first renders the spaces comparable using relative rep… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: To appear in the NeurReps workshop @ NeurIPS 2023

  5. arXiv:2311.00664  [pdf, other

    cs.LG

    Latent Space Translation via Semantic Alignment

    Authors: Valentino Maiorca, Luca Moschella, Antonio Norelli, Marco Fumero, Francesco Locatello, Emanuele Rodolà

    Abstract: While different neural models often exhibit latent spaces that are alike when exposed to semantically related data, this intrinsic similarity is not always immediately discernible. Towards a better understanding of this phenomenon, our work shows how representations learned from these neural modules can be translated between different pre-trained networks via simpler transformations than previousl… ▽ More

    Submitted 11 February, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023. 21 pages, 13 figures, 8 tables

  6. arXiv:2310.01211  [pdf, other

    cs.LG

    From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

    Authors: Irene Cannistraci, Luca Moschella, Marco Fumero, Valentino Maiorca, Emanuele Rodolà

    Abstract: It has been observed that representations learned by distinct neural networks conceal structural similarities when the models are trained under similar inductive biases. From a geometric perspective, identifying the classes of transformations and the related invariances that connect these representations is fundamental to unlocking applications, such as merging, stitching, and reusing different ne… ▽ More

    Submitted 20 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 41 pages, 14 figures and 31 tables

  7. arXiv:2303.00721  [pdf, other

    cs.LG cs.AI

    Bootstrapping Parallel Anchors for Relative Representations

    Authors: Irene Cannistraci, Luca Moschella, Valentino Maiorca, Marco Fumero, Antonio Norelli, Emanuele Rodolà

    Abstract: The use of relative representations for latent embeddings has shown potential in enabling latent space communication and zero-shot model stitching across a wide range of applications. Nevertheless, relative representations rely on a certain amount of parallel anchors to be given as input, which can be impractical to obtain in certain scenarios. To overcome this limitation, we propose an optimizati… ▽ More

    Submitted 1 June, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 9 pages, 7 tables

    MSC Class: 68T07 ACM Class: I.2.6

  8. Latent Spectral Regularization for Continual Learning

    Authors: Emanuele Frascaroli, Riccardo Benaglia, Matteo Boschini, Luca Moschella, Cosimo Fiorini, Emanuele Rodolà, Simone Calderara

    Abstract: While biological intelligence grows organically as new knowledge is gathered throughout life, Artificial Neural Networks forget catastrophically whenever they face a changing training data distribution. Rehearsal-based Continual Learning (CL) approaches have been established as a versatile and reliable solution to overcome this limitation; however, sudden input disruptions and memory constraints a… ▽ More

    Submitted 16 July, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: 14 pages, 4 figures, , to appear in Pattern Recognition Letters, Volume 184, August 2024, Pages 119-125

    Journal ref: Pattern Recognition Letters, Volume 184, August 2024, Pages 119-125, ISSN 0167-8655

  9. arXiv:2210.01738  [pdf, other

    cs.LG cs.AI cs.CV

    ASIF: Coupled Data Turns Unimodal Models to Multimodal Without Training

    Authors: Antonio Norelli, Marco Fumero, Valentino Maiorca, Luca Moschella, Emanuele Rodolà, Francesco Locatello

    Abstract: CLIP proved that aligning visual and language spaces is key to solving many vision tasks without explicit training, but required to train image and text encoders from scratch on a huge dataset. LiT improved this by only training the text encoder and using a pre-trained vision network. In this paper, we show that a common space can be created without any training at all, using single-domain encoder… ▽ More

    Submitted 10 November, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 17 pages

  10. arXiv:2209.15430  [pdf, other

    cs.LG cs.AI

    Relative representations enable zero-shot latent space communication

    Authors: Luca Moschella, Valentino Maiorca, Marco Fumero, Antonio Norelli, Francesco Locatello, Emanuele Rodolà

    Abstract: Neural networks embed the geometric structure of a data manifold lying in a high-dimensional space into latent representations. Ideally, the distribution of the data points in the latent space should depend only on the task, the data, the loss, and other architecture-specific constraints. However, factors such as the random weights initialization, training hyperparameters, or other sources of rand… ▽ More

    Submitted 7 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 notable top 5%, 26 pages, 11 figures, 18 tables

    MSC Class: 68T07 ACM Class: I.2.6

  11. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  12. arXiv:2206.03695  [pdf, other

    cs.LG cs.AI

    Metric Based Few-Shot Graph Classification

    Authors: Donato Crisostomi, Simone Antonelli, Valentino Maiorca, Luca Moschella, Riccardo Marin, Emanuele Rodolà

    Abstract: Many modern deep-learning techniques do not work without enormous datasets. At the same time, several fields demand methods working in scarcity of data. This problem is even more complex when the samples have varying structures, as in the case of graphs. Graph representation learning techniques have recently proven successful in a variety of domains. Nevertheless, the employed architectures perfor… ▽ More

    Submitted 4 January, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: To appear in Learning on Graphs (LoG) 2022

  13. arXiv:2201.10222  [pdf, other

    cs.LG cs.AI cs.CL physics.hist-ph

    Explanatory Learning: Beyond Empiricism in Neural Networks

    Authors: Antonio Norelli, Giorgio Mariani, Luca Moschella, Andrea Santilli, Giambattista Parascandolo, Simone Melzi, Emanuele Rodolà

    Abstract: We introduce Explanatory Learning (EL), a framework to let machines use existing knowledge buried in symbolic sequences -- e.g. explanations written in hieroglyphic -- by autonomously learning to interpret them. In EL, the burden of interpreting symbols is not left to humans or rigid human-coded compilers, as done in Program Synthesis. Rather, EL calls for a learned interpreter, built upon a limit… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: Main paper: 10 pages, References: 3 pages, Appendix: 7 pages

  14. arXiv:2106.13679  [pdf, other

    cs.CV cs.GR cs.LG

    Shape registration in the time of transformers

    Authors: Giovanni Trappolini, Luca Cosmo, Luca Moschella, Riccardo Marin, Simone Melzi, Emanuele Rodolà

    Abstract: In this paper, we propose a transformer-based procedure for the efficient registration of non-rigid 3D point clouds. The proposed approach is data-driven and adopts for the first time the transformer architecture in the registration task. Our method is general and applies to different settings. Given a fixed template with some desired properties (e.g. skinning weights or other animation cues), we… ▽ More

    Submitted 28 June, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

  15. arXiv:2104.00514  [pdf, other

    cs.GR cs.CG cs.LG

    Learning Spectral Unions of Partial Deformable 3D Shapes

    Authors: Luca Moschella, Simone Melzi, Luca Cosmo, Filippo Maggioli, Or Litany, Maks Ovsjanikov, Leonidas Guibas, Emanuele Rodolà

    Abstract: Spectral geometric methods have brought revolutionary changes to the field of geometry processing. Of particular interest is the study of the Laplacian spectrum as a compact, isometry and permutation-invariant representation of a shape. Some recent works show how the intrinsic geometry of a full shape can be recovered from its spectrum, but there are approaches that consider the more challenging p… ▽ More

    Submitted 21 December, 2022; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: 18 pages, 20 figures