Data processing for and with foundation models
Links to everything you'd ever want to learn about data engineering
Data Infrastructure providing an approach to multimodal AI workloads
An AI-powered data science team of agents
CLI tool to extract (meta)data from PDF and manipulate PDF files
A modular graph-based Retrieval-Augmented Generation (RAG) system
Cloud-native open source data warehouse for analytics and AI queries
AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)
Helps data scientists define testable self-documenting dataflows
Instill Core is a full-stack AI infrastructure tool for data
Concurrent Python made simple
3D plotting and mesh analysis through a streamlined interface
AI-powered Jupyter spreadsheet that converts workflows into Python
Dataset Management Framework, a Python library and a CLI tool to build
Production-ready data processing made easy and shareable
Data manipulation and transformation for audio signal processing
Medical imaging toolkit for deep learning
A modular, primitive-first, python-first PyTorch library
Document content and metadata extraction microservice
E2M converts various file types (doc, docx, epub, html, htm, url
Multi-Agent daTa geneRation Infra and eXperimentation framework
Anthropic's Interactive Prompt Engineering Tutorial
Machine Learning Pipelines for Kubeflow
Open Source Differentiable Computer Vision Library
Unsupervised Learning for Image Registration