GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
UI-TARS-desktop version that can operate on your local personal device
A security scanner for custom LLM applications
Tools for merging pretrained large language models
GPT4V-level open-source multi-modal model based on Llama3-8B
A library for scientific machine learning & physics-informed learning
Distill your ex into an AI Skill
Hunyuan Translation Model Version 1.5
Agent framework and applications built upon Qwen>=3.0
Reading book source
Fast, powerful, git-native ticket tracking in a single bash script
Open-source choice to scale, assess and maintain natural language data
The Unified Machine Learning Framework
Making Enterprise Data Intelligent and Responsive for AI
Multi-lingual large voice generation model, providing inference
Tooling for the Common Objects In 3D dataset
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Data Lake for Deep Learning. Build, manage, and query datasets
Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, etc
Chat & pretrained large audio language model proposed by Alibaba Cloud
AIConfig is a config-based framework to build generative AI apps
A Conversational Speech Generation Model
human detection using yolov8
The unofficial python package that returns response of Google Bard
High-Resolution Image Synthesis with Latent Diffusion Models