Marcus Wright

Cloud infrastructure and DevOps practitioner. Kubernetes, FinOps, and supply chain security.

143 articles

Hardware-Aware LLM Selection: Turning Model Choice Into an SRE Discipline

Why teams need reproducible model-to-hardware routing policies as local inference and heterogeneous fleets expand.

A practical framework for governments and regulated enterprises evaluating domestic AI models for broad internal deployment.

Enterprise announcements around Qwen-class on-prem models show a shift from experimentation to governed, costed, and auditable internal AI platforms.