imbalanced-learn vs scikit-learn

imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning (by scikit-learn-contrib)

Source Code

imbalanced-learn.org

Suggest alternative

Edit details

scikit-learn

scikit-learn: machine learning in Python (by scikit-learn)

Machine Learning Python Statistics Data Science Data Analysis

Source Code

scikit-learn.org

Suggest alternative

Edit details

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.

Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

getstream.io

featured

InfluxDB – Built for High-Performance Time Series Workloads

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

www.influxdata.com

featured

imbalanced-learn		scikit-learn
	Project
1	Mentions	94
7,070	Stars	64,346
0.3%	Growth	0.8%
6.9	Activity	9.9
4 months ago	Latest Commit	2 days ago
Python	Language	Python
MIT License	License	BSD 3-clause "New" or "Revised" License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

imbalanced-learn

Posts with mentions or reviews of imbalanced-learn. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-26.

What’s your approach to highly imbalanced data sets?
5 projects | /r/datascience | 26 May 2023

There's a pletora of undersampling and oversampling models you can try out. To avoid removing information form the dataset, you can focus on oversampling techniques. You can try imbalanced-learn or smote-variants. Given enough data, using fully synthetic data is also an option, you can check ydata-synthetic for it. Let us know how it turned out!

scikit-learn

Posts with mentions or reviews of scikit-learn. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-12-14.

The Gorman Paradox: Where Are All the AI-Generated Apps?
6 projects | news.ycombinator.com | 14 Dec 2025

Another conspicuous thing is the lack of vibe-coded PRs on mature open source projects. Maybe it's because these projects have erected policies limiting AI contributions, but given the high scores on SWEBench, you'd expect _something_ to come of it?
And yet in real world use you get stuff like https://github.com/scikit-learn/scikit-learn/pull/32101
Open Source Journey
4 projects | dev.to | 1 Nov 2025

Start Simple, Build Confidence Project: Scikit-learn After the intense first experience with BEHAVIOR-1K, I needed something more approachable. I went straight to Scikit-learn's good first issue label and found a task that seemed manageable: changing relative imports to absolute imports in Cython files. From this
Top 5 GitHub Repositories for Data Science in 2026
8 projects | dev.to | 20 Sep 2025

The book introduces the core libraries essential for working with data in Python: particularly IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and related packages Familiarity with Python as a language is assumed; if you need a quick introduction to the language itself, see the free companion project, A…
What is the Most Effective AI Tool for App Development Today?
23 projects | dev.to | 17 Aug 2025

For apps demanding robust machine learning capabilities, frameworks like TensorFlow provide the scalability and flexibility needed to handle large-scale data and models. These tools are essential for developers building features like recommendation engines or predictive analytics.
Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers
12 projects | dev.to | 6 Aug 2025

Machine learning (ML) teaches computers to learn from data, like predicting user clicks. Start with simple models like regression (predicting numbers) and clustering (grouping data). Deep learning uses neural networks for complex tasks, like image recognition in a Vue.js gallery. Tools like Scikit-learn and PyTorch make it easier.
Predicting Tomorrow's Tremors: A Machine Learning Approach to Earthquake Nowcasting in California
3 projects | dev.to | 3 Jul 2025

Scikit-learn Documentation: https://scikit-learn.org/
10 Useful Tools and Libraries for Python Developers
8 projects | dev.to | 29 Mar 2025

7. Scikit-learn - Machine Learning
Must-Know 2025 Developer’s Roadmap and Key Programming Trends
6 projects | dev.to | 5 Feb 2025

Python’s Growth in Data Work and AI: Python continues to lead because of its easy-to-read style and the huge number of libraries available for tasks from data work to artificial intelligence. Tools like TensorFlow and PyTorch make it a must-have. Whether you’re experienced or just starting, Python’s clear style makes it a good choice for diving into machine learning. Actionable Tip: If you’re new to Python, try projects that combine data with everyday problems. For example, build a simple recommendation system using Pandas and scikit-learn.
🚀 Launching a High-Performance DistilBERT-Based Sentiment Analysis Model for Steam Reviews 🎮🤖
6 projects | dev.to | 16 Dec 2024

scikit-learn (optional): Useful for additional training or evaluation tasks.
State of Python 3.13 Performance: Free-Threading
5 projects | news.ycombinator.com | 5 Nov 2024

The race condition bugs are typically hidden by different software layers. For instance, we found one that involves OpenBLAS's pthreads-based thread pool management and maybe its scipy bindings:
- https://github.com/scipy/scipy/issues/21479
it might be the same as this one that further involves OpenMP code generated by Cython:
- https://github.com/scikit-learn/scikit-learn/issues/30151
We haven't managed to write minimal reproducers for either of those but as you can observe, those race conditions can only be triggered when composing many independently developed components.

What are some alternatives?

When comparing imbalanced-learn and scikit-learn you can also consider the following projects:

deodel - A mixed attributes predictive algorithm implemented in Python.

Prophet - Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

general_class_balancer - Data matching algorithm for categorical and continuous variables

tensorflow - An Open Source Machine Learning Framework for Everyone

confidenceinterval - The long missing library for python confidence intervals

Surprise - A Python scikit for building and analyzing recommender systems

imbalanced-learn vs deodel scikit-learn vs Prophet imbalanced-learn vs general_class_balancer scikit-learn vs tensorflow imbalanced-learn vs confidenceinterval scikit-learn vs Surprise

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.

getstream.io

featured

InfluxDB – Built for High-Performance Time Series Workloads

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

www.influxdata.com

featured

Compare imbalanced-learn vs scikit-learn and see what are their differences.

imbalanced-learn

scikit-learn

imbalanced-learn

scikit-learn

What are some alternatives?

Did you know that Python is
the 2nd most popular programming language
based on number of references?

imbalanced-learn VS scikit-learn

Compare imbalanced-learn vs scikit-learn and see what are their differences.

imbalanced-learn

scikit-learn

imbalanced-learn

scikit-learn

What are some alternatives?

Did you know that Python is the 2nd most popular programming language based on number of references?

Did you know that Python is
the 2nd most popular programming language
based on number of references?