[L0] share one immediate CL per hardware queue to eliminate dispatch overhead by pvelesko · Pull Request #1293 · CHIP-SPV/chipStar

pvelesko · 2026-06-11T14:05:55Z

Reduce Level Zero kernel dispatch overhead by sharing a single immediate command list per hardware queue instead of allocating one per submission.

…overhead Intel Arc B570 (numQueues=1): zeCommandListAppendLaunchKernel takes ~0.45ms when switching between different immediate CL handles, but only ~0.013ms when reusing the same handle. With N HIP streams each owning a private CL, launching N kernels cost N×0.45ms = 14.4ms at N=32 — identical to OCL's 3.3ms. Fix: CHIPDeviceLevel0 now maintains a SharedImmCLs_ pool keyed by (ordinal<<32|index). All streams that map to the same hardware queue share one CL handle via getOrCreateSharedImmCL(). On B570 (numQueues=1) all streams share one CL → dispatch falls to 32×0.013ms = 0.4ms → perlin-hip N=32: 3.13ms (was 13.97ms), matching OCL at 3.28ms. On multi-queue devices (e.g. PVC numQueues=4) streams get distinct shared CLs per slot, preserving current parallelism behavior. Also fix: - finish() two-flag skip (IsEmptyQueue_ && CmdListInitialized_) to avoid the ~0.4ms zeCommandListHostSynchronize overhead on never-synced CLs - Lazy checkEvents() in getEventFromPool(): only call when pool is exhausted, not on every kernel dispatch

pvelesko · 2026-06-14T09:50:31Z

/run-aurora-ci

pvelesko force-pushed the fix-l0-event-hotpath branch 2 times, most recently from 1903fa8 to 7b713bf Compare June 11, 2026 15:16

pvelesko force-pushed the fix-l0-event-hotpath branch from 7b713bf to 3cae92f Compare June 13, 2026 15:37

colleeneb mentioned this pull request Jun 14, 2026

Modifying isEmptyQueue to check LZ to confirm if the queue is empty so we can catch when it has finished early #1300

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[L0] share one immediate CL per hardware queue to eliminate dispatch overhead#1293

[L0] share one immediate CL per hardware queue to eliminate dispatch overhead#1293
pvelesko wants to merge 1 commit into
mainfrom
fix-l0-event-hotpath

pvelesko commented Jun 11, 2026

Uh oh!

pvelesko commented Jun 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pvelesko commented Jun 11, 2026

Uh oh!

pvelesko commented Jun 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant