Production Support Engineer (Site Reliability Engineer)
● Managed and supported monitoring across the production and lower environments, using observability tools like Datadog and Splunk
● Built and managed cloud resources and other infrastructure using Terraform automation
● Boosted operational efficiency by 35% with a Slack-integrated app that automated Jira support ticket creation and management,
saving support teams 3+ hours weekly
● Led a team of developers and cloud engineers to design and automate a disaster recovery procedure for newly integrated GCP
infrastructure, achieving a 50% improvement in recovery time
● Provided Tier 3 support and engineering for multiple large-scale distributed microservices and on-premises services as part of on-call rotations supporting infrastructure and services 24/7.