GPT-4 by OpenAI, Grok 4, ultralyticsplus/yolov8s, LLaVA, and openai/clip-vit-large-patch14 are the most popular tools in the category “Multimodal Models”.
GPT-4 by OpenAI
66 stacks
Grok 4
4 stacks
ultralyticsplus/yolov8s
2 stacks
LLaVA
1 stacks
openai/clip-vit-large-patch14
patrickjohncyh/fashion-clip
nlpconnect/vit-gpt2-image-captioning
immich-app/ViT-H-14-378-quickgelu__dfn5b
image describer
Musid.ai
A large multimodal model that can solve difficult problems with greater accuracy