google/vit-base-patch16-224 vs nlpconnect/vit-gpt2-image-captioning | What are the differences? | StackShare