Skip to content

Commit c0ddded

Browse files
Update README.md
1 parent 43f6cc6 commit c0ddded

File tree

1 file changed

+17
-2
lines changed

1 file changed

+17
-2
lines changed

README.md

Lines changed: 17 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@
1919
## Contents
2020
* [Machine Learning](#machine-learning)
2121
* [Deep Learning](#deep-learning)
22+
* [Web Scraping](#web-scraping)
2223
* [Data Manipulation](#data-manipulation)
2324
* [Feature Engineering](#feature-engineering)
2425
* [Visualization](#visualization)
@@ -186,6 +187,12 @@
186187
* [Caffe2](https://github.com/pytorch/pytorch/tree/master/caffe2) - A lightweight, modular, and scalable deep learning framework (now a part of PyTorch).
187188
* [hipCaffe](https://github.com/ROCmSoftwarePlatform/hipCaffe) - The HIP port of Caffe. <img height="20" src="img/amd_big.png" alt="Possible to run on AMD GPU">
188189

190+
## Web Scraping
191+
* [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/bs4/doc/): The easiest library to scrape static websites for beginners
192+
* [Scrapy](https://scrapy.org/): Fast and extensible scraping library. Can write rules and create customized scraper without touching the coure
193+
* [Selenium](https://selenium-python.readthedocs.io/installation.html#introduction): Use Selenium Python API to access all functionalities of Selenium WebDriver in an intuitive way like a real user.
194+
* [Pattern](https://github.com/clips/pattern): High level scraping for well-establish websites such as Google, Twitter, and Wikipedia. Also has NLP, machine learning algorithms, and visualization
195+
189196
## Data Manipulation
190197

191198
### Data Containers
@@ -216,6 +223,7 @@
216223
* [meza](https://github.com/reubano/meza) - A Python toolkit for processing tabular data.
217224
* [Prodmodel](https://github.com/prodmodel/prodmodel) - Build system for data science pipelines.
218225
* [dopanda](https://github.com/dovpanda-dev/dovpanda) - Hints and tips for using pandas in an analysis environment. <img height="20" src="img/pandas_big.png" alt="pandas compatible">
226+
* [CircleCi](https://circleci.com/): Automates your software builds, tests, and deployments.
219227

220228
## Feature Engineering
221229

@@ -234,19 +242,26 @@
234242
* [scikit-rebate](https://github.com/EpistasisLab/scikit-rebate) - A scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning. <img height="20" src="img/sklearn_big.png" alt="sklearn">
235243

236244
## Visualization
245+
### General Purposes
237246
* [Matplotlib](https://github.com/matplotlib/matplotlib) - Plotting with Python.
238247
* [seaborn](https://github.com/mwaskom/seaborn) - Statistical data visualization using matplotlib.
239-
* [Bokeh](https://github.com/bokeh/bokeh) - Interactive Web Plotting for Python.
240-
* [HoloViews](https://github.com/ioam/holoviews) - Stop plotting your data - annotate your data and let it visualize itself.
241248
* [prettyplotlib](https://github.com/olgabot/prettyplotlib) - Painlessly create beautiful matplotlib plots.
242249
* [python-ternary](https://github.com/marcharper/python-ternary) - Ternary plotting library for python with matplotlib.
243250
* [missingno](https://github.com/ResidentMario/missingno) - Missing data visualization module for Python.
244251
* [chartify](https://github.com/spotify/chartify/) - Python library that makes it easy for data scientists to create charts.
245252
* [physt](https://github.com/janpipek/physt) - Improved histograms.
253+
### Interactive plots
246254
* [animatplot](https://github.com/t-makaro/animatplot) - A python package for animating plots build on matplotlib.
247255
* [plotly](https://plot.ly/python/) - A Python library that makes interactive and publication-quality graphs.
256+
* [Bokeh](https://github.com/bokeh/bokeh) - Interactive Web Plotting for Python.
257+
* [Altair](https://altair-viz.github.io/): Declarative statistical visualization library for Python. Can easily do many data transformation within the code to create graph
258+
### Map
248259
* [folium](https://python-visualization.github.io/folium/quickstart.html#Getting-Started) - Makes it easy to visualize data on an interactive open street map
249260
* [geemap](https://github.com/giswqs/geemap) - Python package for interactive mapping with Google Earth Engine (GEE)
261+
### Automatic Plotting
262+
* [HoloViews](https://github.com/ioam/holoviews) - Stop plotting your data - annotate your data and let it visualize itself.
263+
* [AutoViz](https://github.com/AutoViML/AutoViz): Visualize data automatically with 1 line of code
264+
250265

251266

252267
## Deployment

0 commit comments

Comments
 (0)