Publications

8 results for Hilde Kuehne

Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?
- - Andrew Rouditchenko
  - Saurabhchand Bhati
  - et al.
- 2025
- ASRU 2025
Poster
Teaching VLMs to Localize Specific Objects from In-context Examples
- - Sivan Doveh
  - Nimrod Shabtay
  - et al.
- 2025
- ICCV 2025
Conference paper
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
- - Edson Araujo
  - Andrew Rouditchenko
  - et al.
- 2025
- CVPR 2025
Conference paper
New Frontiers in Associative Memories
- - Julia Kempe
  - Dmitry Krotov
  - et al.
- 2025
- ICLR 2025
Workshop
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs
- - Irene Huang
  - Wei Lin
  - et al.
- 2024
- NeurIPS 2024
Conference paper
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
- - Andrew Rouditchenko
  - Yuan Gong
  - et al.
- 2024
- INTERSPEECH 2024
Conference paper
What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation
- - Benedikt Blumenstiel
  - Johannes Jakubik
  - et al.
- 2023
- NeurIPS 2023
Conference paper
4th Workshop on Self-Supervised Learning: Theory and Practice
- - Tengda Han
  - Ishan Misra
  - et al.
- 2023
- NeurIPS 2023
Workshop