Skip to content

Docker image for python datascience container with NumPy, SciPy, Scikit-learn, Matplotlib, nltk, pandas packages installed.

License

Notifications You must be signed in to change notification settings

thanhsmind/python-datascience

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#Docker image for Python Datascience containers

SOFTWARE PACKAGES

  • musl: standard C library
  • lib6-compat: compatibility libraries for glibc
  • linux-headers: commonly needed, and an unusual package name from Alpine.
  • build-base: used so we include the basic development packages (gcc)
  • bash: so we can access /bin/bash
  • git: to ease up clones of repos
  • ca-certificates: for SSL verification during Pip and easy_install
  • freetype: library used to render text onto bitmaps, and provides support font-related operations
  • libgfortran: contains a Fortran shared library, needed to run Fortran
  • libgcc: contains shared code that would be inefficient to duplicate every time as well as auxiliary helper routines and runtime support
  • libstdc++: The GNU Standard C++ Library. This package contains an additional runtime library for C++ programs built with the GNU compiler
  • openblas: open source implementation of the BLAS(Basic Linear Algebra Subprograms) API with many hand-crafted optimizations for specific processor types
  • tcl: scripting language
  • tk: GUI toolkit for the Tcl scripting language
  • libssl1.0: SSL shared libraries

PYTHON DATA SCIENCE PACKAGES

  • numpy: support for large, multi-dimensional arrays and matrices
  • matplotlib: plotting library for Python and its numerical mathematics extension NumPy.
  • scipy: library used for scientific computing and technical computing
  • scikit-learn: machine learning library integrates with NumPy and SciPy
  • pandas: library providing high-performance, easy-to-use data structures and data analysis tools
  • nltk: suite of libraries and programs for symbolic and statistical natural language processing for English

Command

# For Python 2.7 pull $ docker pull nguyenphuongthanhf/python-datascience:2.7 # For Python 3.6 pull $ docker pull nguyenphuongthanhf/python-datascience:3.6 $ docker container run --rm -it nguyenphuongthanhf/python-datascience:3.6 python $ docker container run --rm -it -v $(pwd):app nguyenphuongthanhf/python-datascience:latest python $ docker container run -v $(pwd):/app --rm -it -u www nguyenphuongthanhf/python-datascience:latest sh ### create funny worker support $ vi ~/.bashrc ### Add code friday () { tty= tty -s && tty=--tty docker run \ $tty \ --interactive \ --rm \ --volume $(pwd):/app \ --user $(id -u):$(id -g) \ --env SSH_AUTH_SOCK=$SSH_AUTH_SOCK \ nguyenphuongthanhf/python-datascience:latest python $@ } ### RUN $ friday --version 

About

Docker image for python datascience container with NumPy, SciPy, Scikit-learn, Matplotlib, nltk, pandas packages installed.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Dockerfile 100.0%