The Configuration Wall: Characterization and Elimination of Accelerator Configuration Overhead
Contemporary compute platforms increasingly offload compute kernels from CPU to integrated hardware accelerators to reach maximum performance per Watt. Unfortunately, the time t...