Make videos programmatically with React
Inference script for Oasis 500M
A Customizable Image-to-Video Model based on HunyuanVideo
Official Python inference and LoRA trainer package
Multimodal Diffusion with Representation Alignment
NVR with realtime local object detection for IP cameras
Implementation of a U-net complete with efficient attention
Lets make video diffusion practical
Behavior tree AI for Godot Engine
Text mining using tidy tools
Official repository for LTX-Video
Open-source multi-speaker long-form text-to-speech model
Convert AI papers to GUI
ESP32 Camera motion capture application to record JPEGs to SD card
A reactive notebook for Python
AI-assisted storyboard and video generation tool
Code for running inference with the SAM 3D Body Model 3DB
Qwen2.5-VL is the multimodal large language model series
Implementation of Make-A-Video, new SOTA text to video generator
Video understanding codebase from FAIR for reproducing video models
MCP server enabling AI coding tools to access Figma design data
Doom-based AI research platform for reinforcement learning
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Tooling for the Common Objects In 3D dataset
Large Multimodal Models for Video Understanding and Editing