nlpconnect/vit-gpt2-image-captioning vs microsoft/beit-base-patch16-224-pt22k-ft22k | What are the differences? | StackShare