Slicing Mutual Information Generalization Bounds for Neural NetworksKimia NadjahiKristjan Greenewaldet al.2024ICML 2024
Trust Regions for Explanations via Black-Box Probabilistic CertificationAmit DhurandharSwagatam Haldaret al.2024ICML 2024
Learning Optimal Projection for Forecast Reconciliation of Hierarchical Time SeriesAsterios TsiourvasWei Sunet al.2024ICML 2024
Asymmetry in Low-Rank Adapters of Foundation ModelsJiacheng ZhuKristjan Greenewaldet al.2024ICML 2024
Thermometer: Towards Universal Calibration for Large Language ModelsMaohao ShenSubhro Daset al.2024ICML 2024
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?Hongkang LiMeng Wenget al.2024ICML 2024
Representing Molecules as Random Walks Over Interpretable GrammarsMichael SunMinghao Guoet al.2024ICML 2024
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-ExpertsMohammed Nowaz Rabbani ChowdhuryMeng Wanget al.2024ICML 2024