GPT-4 by OpenAI, ultralyticsplus/yolov8s, LLaVA, patrickjohncyh/fashion-clip, and Grok 4 are the most popular tools in the category “Multimodal Models”.
GPT-4 by OpenAI
64 stacks
ultralyticsplus/yolov8s
2 stacks
LLaVA
1 stacks
patrickjohncyh/fashion-clip
Grok 4
nlpconnect/vit-gpt2-image-captioning
openai/clip-vit-large-patch14
Image to Prompt AI
Dreamega
Gemini 3 Pro Preview
A large multimodal model that can solve difficult problems with greater accuracy