nlpconnect/vit-gpt2-image-captioning vs CIDAS/clipseg-rd64-refined | What are the differences? | StackShare