nlpconnect/vit-gpt2-image-captioning vs facebook/detr-resnet-50 | What are the differences? | StackShare