Building Reliable Systems
Staff Software Engineer specializing in DevOps, SRE, and Cloud Infrastructure
Background
As a Staff Software Engineer at Okta and former Senior Software Engineer @ Red Hat and other companies Workday, SAP and Hootsuite, I've dedicated my career to building reliable, scalable infrastructure and improving system resilience. My journey in technology has been driven by a passion for solving complex problems and making systems more efficient. I started out my career at Red Hat working on middleware products and was early in working on AI use cases which provided me the opportunity to work with some of the best and brightest in the industry. I've seen the good, the bad and the ugly parts of taking product from idea to production at multiple companies.
Technical Focus
My expertise lies at the intersection of Site Reliability Engineering (SRE), Cloud (AWS, GCP, OCI) and AI (Langchain, OpenAI, Gemini, Cohere, AWS Bedrock). I specialize in:
- Designing and implementing large-scale distributed production systems
- Building robust CI/CD pipelines and automation workflows
- Cloud infrastructure optimization across AWS, GCP, and hybrid environments including Nvidia GPU in the cloud
- Kubernetes orchestration and container technologies
- Implementing observability and monitoring solutions with Grafana, Loki, Prometheus, and Splunk