This document discusses GPU computing with CUDA and NVIDIA Tesla hardware. It provides an overview of GPU computing and how it differs from CPU computing in being optimized for data-parallel throughput rather than low latency. It also describes the key specifications of the NVIDIA Tesla C1060 GPU and Tesla streaming multiprocessor. Finally, it outlines the CUDA parallel computing architecture and programming model, including how applications use the GPU as a coprocessor through kernels launched from the CPU.