Wan: Open and advanced large-scale video generative models.In this repository, we present Wan2.1, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation. Wan2.1 offers these key features:SOTA Performance: Wan2.1 consistently outperforms existing open-source models and state-of-the-art commercial solutions across multiple benchmarks.Supports Consumer-grade GPUs: The T2V-1.3B model requires only 8.19 GB VRAM, making it compatible with almost all consumer-grade GPUs. It can generate a 5-second 480P video on an RTX 4090 in about 4 minutes (without optimization techniques like quantization). Its performance is even comparable to some closed-source models.Multiple Tasks: Wan2.1 excels in Text-to-Video, Image-to-Video, Video Editing, Text-to-Image, and Video-to-Audio, advancing the field of video generation.Visual Text Generation: Wan2.1 is the first video model capable of generating both Chinese and English text, featuring robust text generation that enhances its practical applications.Powerful Video VAE: Wan-VAE delivers exceptional efficiency and performance, encoding and decoding 1080P videos of any length while preserving temporal information, making it an ideal foundation for video and image generation.
Advanced video model with state-of-the-art performance, generating 480/720p video on consumer GPUs, supports tasks including Text-to-Video, Image conversion, editing, and...
Location:
China
Collection time:
2025-09-29
Harness advanced AI to transform images and text into videos. Explore unique features like Spotify Canvas for musicians, inspiring artists, and empowering creators, all without the need for any payments. Perfect for those keen to push the boundaries of creative AI.
AI agent bridges thoughts and actions, excelling in work and life tasks like personalized travel, stock analysis, insurance comparisons, and supplier sourcing, autonomously completing tasks and providing insights while users rest.
HumanPal is a software for creating animated videos with AI-generated human characters. The characters can speak any text in various languages. The software has a vast library of characters with different clothing styles, ethnicities, and professions.
Hugging Face Generative AI Services (HUGS) are optimized, zero-configuration inference microservices designed to simplify and accelerate the development of AI applications with open models. Built on open-source Hugging Face technologies such as Text Generation Inference or...
A video foundation model that empowers users to design and animate expressive, realistic characters. It supports script-to-video conversion and offers an ad-free, web-based experience with AI-powered features.
Introducing Layla, the groundbreaking personal AI that resides directly on your phone or device. No internet connection required, no censorship, complete privacy. No information leaves your device.
An LLM-based autonomous agent controlling real-world applications via RESTful APIs.
InstaClip AI is an AI platform that automates the creation of viral short-form content, offering top-tier AI-generated videos, images, and shorts optimized for all platforms.
