Phase 17 — Cheatsheet: Hardware Backends & Plugins
- Platform abstraction = one place for device/dtype/default-backend/capabilities.
- Engine asks the Platform; it never hardcodes CUDA.
- New hardware = out-of-tree plugin via entry points.
- CPU backend runs on a laptop (no paging/graph kernels), great for learning.
Key upstream files
vllm/platforms/interface.pyvllm/platforms/cuda.pyvllm/platforms/cpu.pyvllm/platforms/__init__.pyvllm/plugins/
Full reference: 00-guide.md · 01-deep-dive.md