AMD developed ACE's as part of their GCN architecture ~4 years ago, as you may remember as the 7970, Mantle allowed more control over this design and DX12 will feature Asynchronous instruction handling [mostly] the same way Mantle and its successor/clone, Vulkan, does.
I never stated anything about ACE (which isn't all that different from how most other architectures also partition their execution), rather calling them out on their practice of renaming industry standards, good programming practices, and other previously defined specifications for marketing reasons. So far nothing you have stated counters the assumption that this is just an implementation of executeIndirect, where you can program the GPU controller itself to execute multiple commands without input from the CPU.