Showing 432 open source projects for "python (scikit-learn)"

View related business solutions
  • Create a personalized AI chatbot for each team in minutes Icon
    Create a personalized AI chatbot for each team in minutes

    Get better, faster answers for your whole team with an AI chatbot trained on your company documents.

    QueryPal is the lifeline your team needs. Our AI chatbot integrates seamlessly with your communication channels, using advanced language understanding to identify and auto-answer repetitive questions — in seconds.
    Learn More
  • Apify is a full-stack web scraping and automation platform helping anyone get value from the web. Icon
    Apify is a full-stack web scraping and automation platform helping anyone get value from the web.

    Get web data. Build automations.

    Actors are serverless cloud programs that extract data, automate web tasks, and run AI agents. Developers build them using JavaScript, Python, or Crawlee, Apify's open-source library. Build once, publish to Store, and earn when others use it. Thousands of developers do this - Apify handles infrastructure, billing, and monthly payouts.
    Learn More
  • 1
    scikit-learn

    scikit-learn

    Machine learning in Python

    scikit-learn is an open source Python module for machine learning built on NumPy, SciPy and matplotlib. It offers simple and efficient tools for predictive data analysis and is reusable in various contexts.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 2
    scikit-learn-videos

    scikit-learn-videos

    Jupyter notebooks from the scikit-learn video series

    scikit-learn-videos repository accompanies a video tutorial series designed to teach machine learning using Python’s scikit-learn library. It provides the Jupyter notebooks used in each lesson so learners can reproduce the demonstrations and experiment with the code themselves. The series introduces fundamental machine learning concepts such as classification, regression, model evaluation, feature engineering, and cross-validation using clear examples and real datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    gplearn

    gplearn

    Genetic Programming in Python, with a scikit-learn inspired API

    gplearn implements Genetic Programming in Python, with a scikit-learn-inspired and compatible API. While Genetic Programming (GP) can be used to perform a very wide variety of tasks, gplearn is purposefully constrained to solving symbolic regression problems. This is motivated by the scikit-learn ethos, of having powerful estimators that are straightforward to implement. Symbolic regression is a machine learning technique that aims to identify an underlying mathematical expression that best describes a relationship. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Scikit-LLM

    Scikit-LLM

    Seamlessly integrate LLMs into scikit-learn

    Seamlessly integrate powerful language models like ChatGPT into sci-kit-learn for enhanced text analysis tasks. At the moment the majority of the Scikit-LLM estimators are only compatible with some of the OpenAI models. Hence, a user-provided OpenAI API key is required. Additionally, Scikit-LLM will ensure that the obtained response contains a valid label. If this is not the case, a label will be selected randomly (label probabilities are proportional to label occurrences in the training...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Securden Windows Privilege Manager Icon
    Securden Windows Privilege Manager

    For IT security teams

    Remove local administrator rights on Windows servers and endpoints. Seamlessly elevate applications for standard users. Grant time-limited rights on-demand. Control application usage by remote employees through whitelisting and blacklisting.
    Learn More
  • 5
    skfolio

    skfolio

    Python library for portfolio optimization built on top of scikit-learn

    skfolio is a Python library designed for portfolio optimization and financial risk management that integrates closely with the scikit-learn ecosystem. The project provides a unified machine learning-style framework for building, validating, and comparing portfolio allocation strategies using financial data. By following the familiar scikit-learn API design, the library allows quantitative researchers and developers to apply techniques such as model selection, cross-validation, and hyperparameter tuning to portfolio construction workflows. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    SKORCH

    SKORCH

    A scikit-learn compatible neural network library that wraps PyTorch

    A scikit-learn compatible neural network library that wraps PyTorch.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Python Outlier Detection

    Python Outlier Detection

    A Python toolbox for scalable outlier detection

    ...PyOD has multiple neural network-based models, e.g., AutoEncoders, which are implemented in both PyTorch and Tensorflow. PyOD contains multiple models that also exist in scikit-learn. It is possible to train and predict with a large number of detection models in PyOD by leveraging SUOD framework. A benchmark is supplied for select algorithms to provide an overview of the implemented models. In total, 17 benchmark datasets are used for comparison, which can be downloaded at ODDS.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 8
    HyperTools

    HyperTools

    A Python toolbox for gaining geometric insights

    HyperTools is a library for visualizing and manipulating high-dimensional data in Python. It is built on top of matplotlib (for plotting), seaborn (for plot styling), and scikit-learn (for data manipulation). Functions for plotting high-dimensional datasets in 2/3D. Static and animated plots. Simple API for customizing plot styles. Set of powerful data manipulation tools including hyperalignment, k-means clustering, normalizing and more.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 9
    Dask

    Dask

    Parallel computing with task scheduling

    Dask is a Python library for parallel and distributed computing, designed to scale analytics workloads from single machines to large clusters. It integrates with familiar tools like NumPy, Pandas, and scikit-learn while enabling execution across cores or nodes with minimal code changes. Dask excels at handling large datasets that don’t fit into memory and is widely used in data science, machine learning, and big data pipelines.
    Downloads: 5 This Week
    Last Update:
    See Project
  • World class QA, 100% done-for-you Icon
    World class QA, 100% done-for-you

    For engineering teams in search of a solution to design, manage and maintain E2E tests for their apps

    MuukTest is a test automation service that combines our own proprietary, AI-powered software with expert QA services to help you achieve world class test automation at a fraction of the in-house costs.
    Learn More
  • 10
    imbalanced-learn

    imbalanced-learn

    A Python Package to Tackle the Curse of Imbalanced Datasets in ML

    Imbalanced-learn (imported as imblearn) is an open source, MIT-licensed library relying on scikit-learn (imported as sklearn) and provides tools when dealing with classification with imbalanced classes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Ai-Learn

    Ai-Learn

    The artificial intelligence learning roadmap compiles 200 cases

    Ai-Learn is an open-source artificial intelligence learning roadmap that aggregates educational materials, tutorials, and practical projects designed to help beginners study AI and machine learning systematically. The repository was created to help learners start self-study programs in artificial intelligence without getting overwhelmed by the large number of available resources.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    FastAPI Python

    FastAPI Python

    FastAPI framework, high performance, easy to learn, fast to code

    FastAPI framework, high performance, easy to learn, fast to code, ready for production. FastAPI is a modern, fast (high-performance), web framework for building APIs with Python based on standard Python type hints.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    TPOT

    TPOT

    A Python Automated Machine Learning tool that optimizes ML

    Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming. TPOT stands for Tree-based Pipeline Optimization Tool. Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Machine Learning Study

    Machine Learning Study

    This repository is for helping those interested in machine learning

    ...It often demonstrates how to implement algorithms using widely used libraries such as NumPy, pandas, scikit-learn, and TensorFlow. Many examples include dataset preparation, visualization of results, and experimentation with different modeling approaches.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Zero to Mastery Machine Learning

    Zero to Mastery Machine Learning

    All course materials for the Zero to Mastery Machine Learning

    Zero to Mastery Machine Learning is an open-source repository that contains the complete course materials for the Zero to Mastery Machine Learning and Data Science bootcamp. The project provides a structured curriculum designed to teach machine learning and data science using Python through hands-on projects and interactive notebooks. The repository includes datasets, Jupyter notebooks, documentation, and example code that walk learners through the entire machine learning workflow from problem definition to model deployment. The course introduces essential tools such as NumPy, pandas, Matplotlib, and scikit-learn before moving on to deep learning with frameworks like TensorFlow and Keras. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Learn Claude Code

    Learn Claude Code

    Bash is all you need, write a claude code with only 16 line code

    Learn Claude Code is an educational repository that teaches how modern AI coding agents work by walking learners through a sequence of progressively more complex agent implementations, starting with a minimal Bash-based agent and culminating in agents with explicit planning, subagents, and skills. It emphasizes a hands-on learning path where each version (from v0 to v4) adds conceptual building blocks like the core agent loop, todo planning, task decomposition, and domain knowledge skills,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    NGBoost

    NGBoost

    Natural Gradient Boosting for Probabilistic Prediction

    ngboost is a Python library that implements Natural Gradient Boosting, as described in "NGBoost: Natural Gradient Boosting for Probabilistic Prediction". It is built on top of Scikit-Learn and is designed to be scalable and modular with respect to the choice of proper scoring rule, distribution, and base learner. A didactic introduction to the methodology underlying NGBoost is available in this slide deck.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    Python Web

    Python Web

    Course to learn frontend web development

    This repository is a beginner-friendly template for creating Python web applications using Flask. Designed by @mouredev for learning and practice, it provides a simple, minimalistic structure for serving HTML pages and static content. Ideal for educational purposes and small-scale web projects, it also includes preconfigured files to simplify deployment and local development.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    machine learning tutorials

    machine learning tutorials

    machine learning tutorials (mainly in Python3)

    machine-learning is a continuously updated repository documenting the author’s learning journey through data science and machine learning topics using practical tutorials and experiments. The project presents educational notebooks that combine mathematical explanations with code implementations using Python’s scientific computing ecosystem. Topics covered include classical machine learning algorithms, deep learning models, reinforcement learning, model deployment, and time-series analysis....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The Arcade Learning Environment

    The Arcade Learning Environment

    The Arcade Learning Environment (ALE) -- a platform for AI research

    Arcade Learning Environment (ALE) is a widely used open-source framework that wraps hundreds of Atari 2600 games via an emulator and presents them as RL environments for AI agents. It decouples the game/emulation aspects from the agent interface, providing a clean API (C++, Python, Gymnasium) so researchers can focus on agent design rather than game plumbing. This environment suite has been central to many RL breakthroughs, including value-based agents, deep Q-nets, and general-agent...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 21
    Learn AI Engineering

    Learn AI Engineering

    Learn AI and LLMs from scratch using free resources

    Learn AI Engineering is a learning path for AI engineering that consolidates high-quality, free resources across the full stack: math, Python foundations, machine learning, deep learning, LLMs, agents, tooling, and deployment. Rather than a loose bookmark list, it organizes topics into a progression so learners can start from fundamentals and move toward practical, production-oriented skills.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Pfl Research

    Pfl Research

    Simulation framework for accelerating research

    A fast, modular Python framework released by Apple for privacy-preserving federated learning (PFL) simulation. Integrates with TensorFlow, PyTorch, and classical ML, and offers high-speed distributed simulation (7–72× faster than alternatives).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    UMAP

    UMAP

    Uniform Manifold Approximation and Projection

    Uniform Manifold Approximation and Projection (UMAP) is a dimension reduction technique that can be used for visualization similarly to t-SNE, but also for general non-linear dimension reduction. It is possible to model the manifold with a fuzzy topological structure. The embedding is found by searching for a low-dimensional projection of the data that has the closest possible equivalent fuzzy topological structure. First of all UMAP is fast. It can handle large datasets and high dimensional...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 24
    Metaflow

    Metaflow

    A framework for real-life data science

    Metaflow is a human-friendly Python library that helps scientists and engineers build and manage real-life data science projects. Metaflow was originally developed at Netflix to boost productivity of data scientists who work on a wide variety of projects from classical statistics to state-of-the-art deep learning.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    Practical Machine Learning with Python

    Practical Machine Learning with Python

    Master the essential skills needed to recognize and solve problems

    ...The repository emphasizes end-to-end workflows rather than isolated code snippets, showing how to handle common challenges like class imbalance, overfitting, hyperparameter optimization, and interpretability. By leveraging popular Python libraries such as pandas, scikit-learn, XGBoost, and visualization tools, it illustrates how to build reproducible and robust solutions that scale beyond small demos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB