2023 . 05 . 16

Fine-Grained Visual Textual Alignment for Cross-Modal Retrieval Using Transformer Encoders