nlpconnect/vit-gpt2-image-captioning vs google/owlvit-base-patch16 | What are the differences? | StackShare