It however turns out that a _lot_ of customer apps today don't use those accelerators at all.
(and about the attempt at using BNNS functions, that's not offloaded, it runs on the host CPU cores w/ the AMX tightly bound accelerator)
It however turns out that a _lot_ of customer apps today don't use those accelerators at all.
(and about the attempt at using BNNS functions, that's not offloaded, it runs on the host CPU cores w/ the AMX tightly bound accelerator)