NumPy
scikit-learn
| NumPy | scikit-learn | |
|---|---|---|
| 310 | 94 | |
| 31,038 | 64,346 | |
| 1.0% | 0.8% | |
| 10.0 | 9.9 | |
| 4 days ago | 2 days ago | |
| Python | Python | |
| GNU General Public License v3.0 or later | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
NumPy
- Python is not a great language for data science. Part 1: The experience
- Choosing Tech Stack in 2025: A Practical Guide
Unmatched integration with ML/AI ecosystems through NumPy, TensorFlow, and PyTorch
- What Dynamic Typing Is For
- Bringing NumPy's type-completeness score to nearly 90% – Pyrefly
> Let’s take a pause here for a second - the ‘CanIndex’ and ‘SupportsIndex’ from the looks are just “int”.
The PR for the change is https://github.com/numpy/numpy/pull/28913 - The details of files changed[0] shows the change was made in 'numpy/__init__.pyi'. Looking at the whole file[1] shows SupportsIndex is being imported from the standard library's typing module[2].
Where are you seeing SupportsIndex being defined as an int?
> I have a hard time dealing with these custom types because they are so obscure.
SupportsIndex is obscure, I agree, but it's not a custom type. It's defined in stdlib's typing module[2], and was added in Python 3.8.
[0]: https://github.com/numpy/numpy/pull/28913/files
[1]: https://github.com/charris/numpy/blob/c906f847f8ebfe0adec896...
[2]: https://docs.python.org/3/library/typing.html#typing.Support...
- Don’t Let Cyber Risk Kill Your GenAI Vibe: A Developer’s Guide
Know (or check) tells of older versions, such as the python sdk of OpenAI changing from a client with global state in v0.x.x, to a declared instance in v1.x.x, or numpy's change in how random generators are declared.
- Top 5 GitHub Repositories for Data Science in 2026
The book introduces the core libraries essential for working with data in Python: particularly IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and related packages Familiarity with Python as a language is assumed; if you need a quick introduction to the language itself, see the free companion project, A…
- Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers
AI starts with math and coding. You don’t need a PhD—just high school math like algebra and some geometry. Linear algebra (think matrices) and calculus (like slopes) help understand how AI models work. Python is the main language for AI, thanks to tools like TensorFlow and NumPy. If you know JavaScript from Vue.js, Python’s syntax is straightforward.
- Top 17 Tools for Scientific Simulation & Modeling
- Release v2.3.0 (June 7, 2025) · NumPy/NumPy
- How to Get Started with Scikit-Learn: A Beginner-Friendly Guide to Machine Learning in Python
As is the case with most Python libraries, it is open-source and free-to-use, making it easily accessible by anyone willing to learn machine learning, and it is built upon other open-source libraries within Python, like SciPy for advanced scientific operations, NumPy for efficient numerical computations, Matplotlib for data visualization, and Cython for increased efficiency and speed, similar to that of C/C++.
scikit-learn
- The Gorman Paradox: Where Are All the AI-Generated Apps?
Another conspicuous thing is the lack of vibe-coded PRs on mature open source projects. Maybe it's because these projects have erected policies limiting AI contributions, but given the high scores on SWEBench, you'd expect _something_ to come of it?
And yet in real world use you get stuff like https://github.com/scikit-learn/scikit-learn/pull/32101
- Open Source Journey
Start Simple, Build Confidence Project: Scikit-learn After the intense first experience with BEHAVIOR-1K, I needed something more approachable. I went straight to Scikit-learn's good first issue label and found a task that seemed manageable: changing relative imports to absolute imports in Cython files. From this
- Top 5 GitHub Repositories for Data Science in 2026
The book introduces the core libraries essential for working with data in Python: particularly IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and related packages Familiarity with Python as a language is assumed; if you need a quick introduction to the language itself, see the free companion project, A…
- What is the Most Effective AI Tool for App Development Today?
For apps demanding robust machine learning capabilities, frameworks like TensorFlow provide the scalability and flexibility needed to handle large-scale data and models. These tools are essential for developers building features like recommendation engines or predictive analytics.
- Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers
Machine learning (ML) teaches computers to learn from data, like predicting user clicks. Start with simple models like regression (predicting numbers) and clustering (grouping data). Deep learning uses neural networks for complex tasks, like image recognition in a Vue.js gallery. Tools like Scikit-learn and PyTorch make it easier.
- Predicting Tomorrow's Tremors: A Machine Learning Approach to Earthquake Nowcasting in California
Scikit-learn Documentation: https://scikit-learn.org/
- 10 Useful Tools and Libraries for Python Developers
7. Scikit-learn - Machine Learning
- Must-Know 2025 Developer’s Roadmap and Key Programming Trends
Python’s Growth in Data Work and AI: Python continues to lead because of its easy-to-read style and the huge number of libraries available for tasks from data work to artificial intelligence. Tools like TensorFlow and PyTorch make it a must-have. Whether you’re experienced or just starting, Python’s clear style makes it a good choice for diving into machine learning. Actionable Tip: If you’re new to Python, try projects that combine data with everyday problems. For example, build a simple recommendation system using Pandas and scikit-learn.
- 🚀 Launching a High-Performance DistilBERT-Based Sentiment Analysis Model for Steam Reviews 🎮🤖
scikit-learn (optional): Useful for additional training or evaluation tasks.
- State of Python 3.13 Performance: Free-Threading
The race condition bugs are typically hidden by different software layers. For instance, we found one that involves OpenBLAS's pthreads-based thread pool management and maybe its scipy bindings:
- https://github.com/scipy/scipy/issues/21479
it might be the same as this one that further involves OpenMP code generated by Cython:
- https://github.com/scikit-learn/scikit-learn/issues/30151
We haven't managed to write minimal reproducers for either of those but as you can observe, those race conditions can only be triggered when composing many independently developed components.
What are some alternatives?
mitmproxy - An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
Prophet - Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
SymPy - A computer algebra system written in pure Python
tensorflow - An Open Source Machine Learning Framework for Everyone
Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Surprise - A Python scikit for building and analyzing recommender systems