People - Anirban Ghose | Intel DevMesh

Memory Optimizations for Deep Learning Workloads on Hardware Accelerators

External Post

by Anirban Ghose, June 12, 2020

The objective of the project lies in exploring memory optimizations for speeding up training of large Deep Neural Networks (DNN) using oneAPI’s DPC++ toolchain for any general purpose heterogeneous architecture comprising multicore CPUs, integrated GPUs as well as general purpose hardware accelerators with discrete memory spaces such as discrete GPUs and FPGAs. Currently the project is in the concept stage.

Anirban Ghose

Posts

Memory Optimizations for Deep Learning Workloads on Hardware Accelerators

Login to continue

This action requires you to be logged in.