Official inference library for Mistral models
Port of OpenAI's Whisper model in C/C++
A Pythonic framework to simplify AI service building
GPU environment management and cluster orchestration
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Simplifies the local serving of AI models from any source
LLM training code for MosaicML foundation models
Private Open AI on Kubernetes
A general-purpose probabilistic programming system
Protect and discover secrets using Gitleaks
Bring the notion of Model-as-a-Service to life
A RWKV management and startup tool, full automation, only 8MB
A library for accelerating Transformer models on NVIDIA GPUs
Open-Source AI Camera. Empower any camera/CCTV
A unified framework for scalable computing
PyTorch library of curated Transformer models and their components
State-of-the-art diffusion models for image and audio generation
Fast inference engine for Transformer models
A high-performance ML model serving framework, offers dynamic batching
Powering Amazon custom machine learning chips
Serving system for machine learning models
Replace OpenAI GPT with another LLM in your app
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Library for serving Transformers models on Amazon SageMaker
Unified Model Serving Framework