Apache InLong - a one-stop integration framework for massive data
Apache DevLake is an open-source dev data platform
Upserts, Deletes And Incremental Processing on Big Data
A free, open-source, and cross-platform big data analytics framework
Centralize, transform and stash your data
Ridiculously fast, fully asynchronous, sharded hashmap for Rust
A multi-cloud framework for big data analytics
A data visualization framework combining React & D3
Python Stream Processing
A framework for real-life data science
A graph database that supports more than 100+ billion data
Probabilistic Circuits from the Juice library
Scalable and Flexible Gradient Boosting
ETL framework to index data for AI, such as RAG
Docker image used to run data processing workloads
Production-ready data processing made easy and shareable
Distributed scheduled job framework
Analytics for developers, setup Analytics in 30 seconds
A tool to help improve data quality standards in data science
Library providing end-to-end GPU-accelerated recommender systems
A web interface to create custom vector-based visualizations
.NET Standard bindings for Google's TensorFlow for developing models
NBi is a testing framework (add-on to NUnit)
Contains various Apache Flink connectors to connect to AWS data
StreamAlert is a serverless, realtime data analysis framework