Abstract: Contrastive Language-Image Pre-training (CLIP) has shown strong performance in zero-shot image classification. However, it requires large datasets and high computational costs. In this paper ...