1 Centre for Digital Music, Queen Mary University of London, U.K. 2 Music & Audio Machine Learning Lab, Universal Music Group, London, U.K. Multimodal contrastive models have achieved strong ...