NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language ModelsAmit DhurandharTejaswini Pedapatiet al.2024ACL 2024
What Would Gauss Say About Representations? Probing Pretrained Image Models using Synthetic Gaussian BenchmarksIrene KoPin-Yu Chenet al.2024ICML 2024
Larimar: Large Language Models with Episodic Memory ControlPayel DasSUBHAJIT CHAUDHURYet al.2024ICML 2024
Pre-Training Protein Encoder via Siamese Sequence-Structure Diffusion Trajectory PredictionZuobai ZhangMinghao Xuet al.2023NeurIPS 2023
The Impact of Positional Encoding on Length Generalization in TransformersAmirhossein KazemnejadInkit Padhiet al.2023NeurIPS 2023
Efficient Equivariant Transfer Learning from Pretrained ModelsSourya BasuPulkit Katdareet al.2023NeurIPS 2023
Characterizing pre-trained and task-adapted molecular representationsCelia CintasPayel Daset al.2023NeurIPS 2023