Private chat with local GPT with document, images, video, etc.
A simple screen parsing tool towards pure vision based GUI agent
Scrape job websites into a single spreadsheet with no duplicates.
Multi-modal large language model designed for audio understanding
Rules engine for cloud security, cost optimization, and governance
The Airweave CLI for developers and AI agents
Harmonized and Coherent Human Image Animation
Analyzing Hacker News discussions from a decade ago in hindsight
Making RAG Simpler with Small and Open-Sourced Language Models
Let agents classify your bank transactions
A pretty sweet vulnerability scanner
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
PraisonAI application combines AutoGen and CrewAI or similar framework
Fast image augmentation library and an easy-to-use wrapper
Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms
Outcome driven agent development framework that evolves
From Paper to Presentation in One Click
Bash is all you need, write a claude code with only 16 line code
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Automate Google Hacking Database scraping and searching
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-weight, large-scale hybrid-attention reasoning model
Bailing is a voice dialogue robot similar to GPT-4o
A Personalized LLM-powered Agent Frameworks
Chat with any codebase in under two minutes | Fully local