Machine learning in Python
A curated list of data mining papers about fraud detection
A framework for real-life data science
Training data (data labeling, annotation, workflow) for all data types
Uncover insights, surface problems, monitor, and fine tune your LLM
airda(Air Data Agent
A reactive notebook for Python
Python Stream Processing
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
AI-data warehouse to enrich, transform and analyze unstructured data
High-level, high-performance dynamic language for technical computing
Create HTML profiling reports from pandas DataFrame objects
The open-source tool for building high-quality datasets
Benchmarking synthetic data generation methods
Beta Machine Learning Toolkit
AutoGluon: AutoML for Image, Text, and Tabular Data
Data science spreadsheet with Python & SQL
Detecting silent model failure. NannyML estimates performance
Streamline your ML workflow
Data science on data without acquiring a copy
The standard data-centric AI package for data quality and ML
ETL framework to index data for AI, such as RAG
Train machine learning models within Docker containers
Best practices on recommendation systems
A reinforcement learning package for Julia