CPC G06F 9/5038 (2013.01) [G06F 9/4881 (2013.01); G06F 9/505 (2013.01); G06F 9/5016 (2013.01); G06F 9/5061 (2013.01); G06F 9/5083 (2013.01); G06N 20/00 (2019.01)] | 20 Claims |
1. A method implemented using a system comprising a hardware accelerator coupled to a plurality of processors, the method comprising:
identifying a splitting of the plurality of processors among:
a first domain that performs tasks that support the hardware accelerator in accelerating execution of a machine-learning (ML) workload; and
a second, different domain;
during runtime of the system, obtaining memory usage measurements that characterize usage of system memory by the first domain and the second domain; and
adjusting, based on the memory usage measurements, a configuration of (i) the first domain, (ii) the second domain, or (iii) both; and
adjusting utilization of the system memory by the plurality of processors in response to adjusting the configuration.
|