GPT-4 by OpenAI, Grok 4, ultralyticsplus/yolov8s, patrickjohncyh/fashion-clip, and LLaVA are the most popular tools in the category “Multimodal Models”.
GPT-4 by OpenAI
66 stacks
Grok 4
4 stacks
ultralyticsplus/yolov8s
2 stacks
patrickjohncyh/fashion-clip
1 stacks
LLaVA
openai/clip-vit-large-patch14
nlpconnect/vit-gpt2-image-captioning
immich-app/ViT-H-14-378-quickgelu__dfn5b
image describer
Musid.ai
A large multimodal model that can solve difficult problems with greater accuracy