microsoft/resnet-50 vs nlpconnect/vit-gpt2-image-captioning | What are the differences? | StackShare