Machine learning in Python
A curated list of data mining papers about fraud detection
A framework for real-life data science
Training data (data labeling, annotation, workflow) for all data types
Uncover insights, surface problems, monitor, and fine tune your LLM
airda(Air Data Agent
A reactive notebook for Python
Python Stream Processing
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
AI-data warehouse to enrich, transform and analyze unstructured data
High-level, high-performance dynamic language for technical computing
Create HTML profiling reports from pandas DataFrame objects
The open-source tool for building high-quality datasets
Benchmarking synthetic data generation methods
Beta Machine Learning Toolkit
Data science spreadsheet with Python & SQL
AutoGluon: AutoML for Image, Text, and Tabular Data
Detecting silent model failure. NannyML estimates performance
Streamline your ML workflow
Data science on data without acquiring a copy
The standard data-centric AI package for data quality and ML
ETL framework to index data for AI, such as RAG
Train machine learning models within Docker containers
Best practices on recommendation systems
Simple and distributed Machine Learning