Learning interpretable positional encodings in transformers depends on initializationTaku ItoLuca Cocchiet al.2025ICML 2025
Geometry of naturalistic object representations in recurrent neural network models of working memoryXiaoxuan LeiTaku Itoet al.2024NeurIPS 2024
On the generalization capacity of neural networks during generic multimodal reasoningTaku ItoSoham Danet al.2024ICLR 2024