Matrix-Vector Library Designed for Neural Network Construction. cuda (gpu) support, openmp (multithreaded cpu) support, partial support of BLAS, expression template based implementation PTX code generation identical to hand written kernels, and support for auto-differentiation
machine-learning gpu linear-algebra openmp cuda neural-networks blas neuralnetworks gpu-support bct neural-networks-and-deep-learning neural-networks-from-scratch neuralnetwork-construction blackcat-tensors
- Updated
Nov 8, 2020 - C++