scikit-learn

Open-source projects categorized as scikit-learn

Top 23 scikit-learn Open-Source Projects

  • ML-For-Beginners

    12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

  • Project mention: Good coding groups for black women? | news.ycombinator.com | 2024-01-13

    - https://github.com/microsoft/ML-For-Beginners

    Also check out this list Pitt puts out every year:

  • 100-Days-Of-ML-Code

    100 Days of ML Coding

  • Project mention: Top 10 GitHub Repositories for Python and Java Developers | dev.to | 2024-05-03

    5. Avik-Jain/100-Days-Of-ML-Code - As the name implies, this repository offers a structured approach to learning machine learning with Python. It covers core ML principles and algorithms through real-world applications. https://github.com/Avik-Jain/100-Days-Of-ML-Code

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • PythonDataScienceHandbook

    Python Data Science Handbook: full text in Jupyter Notebooks

  • Project mention: About Data analyst, data scientist and data engineer, resources and experiences | dev.to | 2024-03-26

    Python Data Science Handbook

  • data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • handson-ml

    ⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

  • best-of-ml-python

    🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

  • onnxruntime

    ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

  • Project mention: New exponent functions that make SiLU and SoftMax 2x faster, at full acc | news.ycombinator.com | 2024-05-15
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • python-machine-learning-book

    The "Python Machine Learning (1st edition)" book code repository and info resource

  • Dask

    Parallel computing with task scheduling

  • Project mention: The Distributed Tensor Algebra Compiler (2022) | news.ycombinator.com | 2023-06-15
  • mlcourse.ai

    Open Machine Learning Course

  • Project mention: Open Machine Learning Course | news.ycombinator.com | 2023-10-22
  • sktime

    A unified framework for machine learning with time series

  • auto-sklearn

    Automated Machine Learning with scikit-learn

  • autogluon

    Fast and Accurate ML in 3 Lines of Code

  • featuretools

    An open source python library for automated feature engineering

  • Project mention: Featuretools – A Python Library for Automated Feature Engineering | news.ycombinator.com | 2023-09-20
  • interpret

    Fit interpretable models. Explain blackbox machine learning.

  • skorch

    A scikit-learn compatible neural network library that wraps PyTorch

  • orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

  • Project mention: Hierarchical Clustering | news.ycombinator.com | 2024-04-20

    I know I've tooted its horn before, but Orange3 is a pretty neat Python-based GUI platform that makes this and a metric buttload of other statistical/ML techniques available to non-programmer types.

    Just watch out for null character `x00` in the corpus. That always seems to kill it stone dead.

    https://orangedatamining.com/

    https://orange3.readthedocs.io/projects/orange-visual-progra...

  • machine_learning_complete

    A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

  • superduperdb

    🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.

  • Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12
  • python-machine-learning-book-3rd-edition

    The "Python Machine Learning (3rd edition)" book code repository

  • flower

    Flower: A Friendly Federated Learning Framework (by adap)

  • Project mention: Pyenv – lets you easily switch between multiple versions of Python | news.ycombinator.com | 2024-03-25

    We use Pyenv successfully for developing the Flower open-source project. We use a few simple Bash scripts to manage virtual environments with different Python versions via pyenv and the pyenv-virtualenv plugin.

    The main scripts are `venv-create.sh`, `venv-delete.sh` and `bootstrap.sh`. `venv-reset.sh` pulls these three scripts together to make reinstalling your venv a single command.

    Here's the link if anyone is interested: https://github.com/adap/flower/tree/main/dev

  • yellowbrick

    Visual analysis and diagnostic tools to facilitate machine learning model selection.

  • FLAML

    A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

  • Project mention: AutoGen: Enabling Next-Gen GPT-X Applications | news.ycombinator.com | 2023-08-22

    I really like the simplicity of this framework, and they hit on a lot of common problems found in other agent-based frameworks. Most intrigued by the RAG improvements.

    Seems like Microsoft was frustrated with the pace of movement in this space and the shitty results of agents (which admittedly kept my interest turned away from agents for the last few months). I'm interested again because it makes practical sense, and from looking at the example notebooks, seems fairly easy to integrate into existing applications.

    Maybe this is the 'low code' approach that might actually work, and bridge together engineering and non-engineering resources.

    This example was what caught my eye: https://github.com/microsoft/FLAML/blob/main/notebook/autoge...

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

scikit-learn related posts

  • About Data analyst, data scientist and data engineer, resources and experiences

    5 projects | dev.to | 26 Mar 2024
  • Show HN: Logistic Regression Training on Encrypted Data with FHE

    1 project | news.ycombinator.com | 6 Feb 2024
  • Implementing a ChatGPT-like LLM from scratch, step by step

    3 projects | news.ycombinator.com | 27 Jan 2024
  • Training ML Models on Encrypted Data with Homomorphic Encryption (FHE)

    1 project | news.ycombinator.com | 25 Jan 2024
  • AlphaPy: machine learning framework built on sklearn and pandas. Support pyfolio/xgboost/lightgmb/catboost(gradient boosting on decision tress) etc. Examples include financial market prediction/sports prediction/kaggle. Configurations are set though

    1 project | /r/algoprojects | 10 Dec 2023
  • Tradero: A tool for achieving self-funding via trading

    1 project | news.ycombinator.com | 12 Sep 2023
  • Scikit-learn Stock Prediction: using fundamental and pricing data to predict future stock returns. Sklearn's randomforest classifier is trainded and author claimed positive live trading results. Not actively mainained Other Models - star count:1520.0

    1 project | /r/algoprojects | 28 Aug 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 18 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source scikit-learn projects? This list will help you:

Project Stars
1 ML-For-Beginners 67,267
2 100-Days-Of-ML-Code 43,599
3 PythonDataScienceHandbook 41,635
4 data-science-ipython-notebooks 26,532
5 handson-ml 25,099
6 best-of-ml-python 15,672
7 onnxruntime 12,894
8 python-machine-learning-book 12,076
9 Dask 12,055
10 mlcourse.ai 9,454
11 sktime 7,454
12 auto-sklearn 7,422
13 autogluon 7,181
14 featuretools 7,064
15 interpret 6,022
16 skorch 5,648
17 orange 4,626
18 machine_learning_complete 4,520
19 superduperdb 4,415
20 python-machine-learning-book-3rd-edition 4,386
21 flower 4,251
22 yellowbrick 4,206
23 FLAML 3,701

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com