CPC G06N 3/044 (2023.01) [G06F 9/5038 (2013.01); G06N 3/045 (2023.01); G06N 3/063 (2013.01); G06N 3/084 (2013.01); G06F 2209/5021 (2013.01)] | 20 Claims |
1. An apparatus to facilitate workload scheduling comprising:
an interconnect fabric; and
one or more general purpose graphics processing units coupled with the interconnect fabric, the one or more general purpose graphics processing units to processes general purpose graphics workloads received from one or more clients via the interconnect fabric, the one or more general purpose graphics processing units including:
hardware resources;
a scheduler to schedule direct access to the hardware resources to process the workloads on behalf of the one or more clients, wherein the one or more clients are each associated with a precompiled neural network (NN) kernel; and
a gather unit to bypass zero data values and gather non-zero data values associated with the one or more clients, the non-zero data values stored sparsely in memory.
|