Hi, I'm Zak!

Architecting the
Resilient, Multi-Cloud
Foundation
for AI.

Zak Hassan

Site Reliability Engineer. Platform Engineer. AI Infrastructure Specialist.

For over a decade, I've engineered the robust, scalable infrastructure that powers high-growth technology. As a strategic, embedded Site Reliability Engineer (SRE), I partner with teams to solve their most complex challenges across multi-cloud environments (AWS, GCP, Azure, OCI).

My passion lies at the intersection of platform engineering and AI/ML. I have hands-on experience building the foundational layer for machine learning, from pioneering serverless GPU infrastructure and deploying Spark on Kubernetes to managing large-scale data storage with Ceph. Whether leading zero-downtime migrations or creating reusable, tested infrastructure as code, my mission is to build the secure, automated, and reliable systems that accelerate innovation.

Learn more about me

Sharing my OpenSource experience

As a technical leader, I specialize in building reliable, self-service infrastructure while fostering a culture of innovation and continuous improvement.

I've shared my expertise at leading tech conferences including KubeCon, OpenSource Summit, and Spark Summit. I'm passionate about open source and believe great ideas can come from anyone on the team.

Zak's AI.Assist

Session only - not saved