Open source libraries and APIs to build custom preprocessing pipelines
Instill Core is a full-stack AI infrastructure tool for data
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Extract schema, statistics and entities from datasets
Superlinked is a Python framework for AI Engineers
Parse files for optimal RAG
Central interface to connect your LLM's with external data
Vector database for scalable similarity search and AI applications
A fast, helpful, and open-source document parser
Context database designed specifically for AI Agents
Autonomous LLM agent for end-to-end data science workflows
A system for agentic LLM-powered data processing and ETL
CrateDB is a distributed and scalable SQL database
A modular graph-based Retrieval-Augmented Generation (RAG) system
Web framework designed for speed, security, and SEO
The open source mesh processing system
Lightweight library for scraping web-sites with LLMs
Clean network diagrams, One-time setup, zero upkeep
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Claude Code skill for generating production-quality SVG+PNG technical
Python module for parsing semi-structured text into python tables
Synthetic data generators for structured and unstructured text
AI-data warehouse to enrich, transform and analyze unstructured data
Fast and efficient unstructured data extraction
An extensible framework for Personal Data Management