StackShareStackShare
Follow on
StackShare

Discover and share technology stacks from companies around the world.

Product

  • Stacks
  • Tools
  • Companies
  • Feed

Company

  • About
  • Blog
  • Contact

Legal

  • Privacy Policy
  • Terms of Service

© 2025 StackShare. All rights reserved.

API StatusChangelog
VideoPoet

VideoPoet

#34in Voice & Audio Models
Stacks0Discussions0
Followers1
OverviewDiscussions

What is VideoPoet?

It is a large language model (LLM) that is capable of a wide variety of video generation tasks, including text-to-video, image-to-video, video stylization, video inpainting and outpainting, and video-to-audio.

VideoPoet is a tool in the Voice & Audio Models category of a tech stack.

Key Features

Use generative models to tell visual storiesLong(er) video generationControllable video editingInteractive video editing

VideoPoet Pros & Cons

Pros of VideoPoet

No pros listed yet.

Cons of VideoPoet

No cons listed yet.

VideoPoet Alternatives & Comparisons

What are some alternatives to VideoPoet?

OpenAI

OpenAI

Creating safe artificial general intelligence that benefits all of humanity. Our work to create safe and beneficial AI requires a deep understanding of the potential risks and benefits, as well as careful consideration of the impact.

Claude

Claude

It is a next-generation AI assistant. It is accessible through chat interface and API. It is capable of a wide variety of conversational and text-processing tasks while maintaining a high degree of reliability and predictability.

Google Gemini

Google Gemini

It is Google’s largest and most capable AI model. It is built to be multimodal, it can generalize, understand, operate across, and combine different types of info — like text, images, audio, video, and code.

LLaMA

LLaMA

It is a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI.

GPT-4 by OpenAI

GPT-4 by OpenAI

It is a large multimodal model (accepting text inputs and emitting text outputs today, with image inputs coming in the future) that can solve difficult problems with greater accuracy than any of our previous models, thanks to its broader general knowledge and advanced reasoning capabilities.

Whisper

Whisper

It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

Try It

Visit Website

Adoption

On StackShare

Companies
0
Developers
0