Design a driver abstraction layer or scheduler that optimizes agentic workflow execution across distributed hardware. The focus should be on reducing latency for multi-step reasoning tasks.