Future Workload and Cloud Resource Usage: Insights from an Interpretable Forecasting ModelAmadou Ba2024Big Data 2024
Trans-LoRA: towards data-free Transferable Parameter Efficient FinetuningRunqian WangSoumya Ghoshet al.2024NeurIPS 2024
Predicting LLM Inference Latency: A Roofline-Driven ML MethodSaki ImaiRina Nakazawaet al.2024NeurIPS 2024
Consistency-based Black-box Uncertainty Quantification for Text-to-SQLDebarun BhattacharjyaBalaji Ganesanet al.2024NeurIPS 2024
TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data LakesAamod KhatiwadaHarsha Kokelet al.2024NeurIPS 2024
AIHWKIT-Lightning: A Scalable HW-Aware Training Toolkit for Analog In-Memory ComputingJulian BüchelWilliam Simonet al.2024NeurIPS 2024
TensorLakeHouse: A High-Performance, Open-Source Platform for Accelerated Geospatial Data Management with Hierarchical Statistical IndicesRomeo KienzlerLeonardo P. Tizzeiet al.2024AGU 2024
Fully subtractive Ru Topvia interconnects with minimum 9 nm-space airgap for RC performance and reliability enhancement as post-Cu interconnectsKoichi MotoyamaJaemyung Choiet al.2024IEDM 2024