Download PDF

Summary

Platform Site Reliability Engineer with a decade of hands on experience on designing, supporting, and testing next generation solution for Enterprise applications in Cloud Infrastructure Platform , leveraging Infrastructure as a Code, CI/CD and proficient with DevOps practices. Self-learner, highly motivated and contributing towards a fast paced tech trend. Have implemented 2 greenfield projects and experience with support and SRE roles for another 3 clients.

Certificates

  • AWS Certified Solutions Architect Associate SAA-C02
  • Certified Kubernetes Administrator
  • Certified Kubernetes Application Developer
  • ITIL V3 Foundation Certified

Work experience

SRE

Jan 2024Present
Schuberg Philis
  • Led the development of a greenfield project for the largest pension fund in the Netherlands at Schuberg Philis.
  • Created and managed almost 50 AWS accounts part of enablement platform integrating with OKTA for seamless single sign-on functionality.
  • Whole automation was achieved using a combination of CDK TF and several terraform modules that includes but not limited to vend new AWS Accounts, SCP policies, Governance, IAM roles, networking, AWS backup, Gitlab, Terraform cloud workspaces. 
  • Designed and Managed Kubernetes platform EKS for robust production environments and implemented network polices with CNI Cilium in EKS, GitOps with ArgoCD, and vulnerability scanning with Kubescape, and writing Kyverno policies.
  • Providing support for RBAC and guiding product teams through onboarding processes for AWS, Gitlab, EKS and Terraform Cloud.
  • Ensuring compliance and security standards through Sentinel policy checks, while introducing code review processes and collaboration tools for code reusability and knowledge sharing.
  • Conducted acceptance testing for Operations and production container platforms, and the AWS enablement platform and supported developers in their journey to migrate from their cloud competence center to Schuberg Managed AWS cloud platform.
  • Instrumental in setting up managed prometheus, Grafana, Loki stack and ensuring the alertmanager integration with opsgenie.

Tech Stack : GitLab, AWS, EKS, CDK, Cilium, Karpenter, Keda, Kubescape, Terraform Cloud, Kubecost, Loki, Prometheus, Grafana, ArgoCD, OKTA, OpsGenie

Lead SRE

July 2022Dec 2023
HyperTrader Inc
  • Contributed to gethypertrader.com, a highly scalable Fintech Trading platform startup, by engineering and optimizing AWS infrastructure for seamless scalability and reliability under high-volume demands.
  • Implemented auto-scaling and managed production-grade EKS (Kubernetes) on AWS with custom operators to efficiently handle sudden traffic spikes.
  • Created CI/CD pipelines using self-hosted GitHub Actions for deployment and designed outbound proxy servers using Squid.
  • Focused on AWS Security with Guardrails, Config, and Security HUB to mitigate risks and reduced latency by implementing CDN (CloudFront) and MemoryDB, Architected the platform engineering and the AWS Platform and performed migrations.
  • Contributed to architecture design with a focus on operational cost reduction, developed a custom Terraform Provider using GoLang, and utilized ELK, Prometheus, Grafana, and CloudWatch for comprehensive observability.

Tech Stack: Github, AWS, EKS, Kubescape, Terraform, Prometheus, Grafana, Helm

Sr Cloud Native SRE

May 2020June 2022
Shell Netherlands
  • Worked on AWS Cloud and EKS and I was part of the team developed the complete infrastructure as a self service.
  • Contributed to the adoption of EKS (Kubernetes) for container orchestration, streamlining deployment processes. Managed 10 different EKS clusters with 100+ high computing nodes.
  • Collaborated with cross-functional teams to troubleshoot and resolve issues, optimising the performance of the Kubernetes clusters.
  • Actively contributed towards the Jenkins Shared Library and several Terraform modules for code reusability.
  • Integrated the Hashicorp Vault and Terraform Enterprise to the CI and CD Pipeline of jenkins and created an automated release framework on each commits.
  • Designed and set up the the observability stack involving Prometheus, Grafana, Jaeger microservice traceability tool and ESK stack using filebeats. 
  • Spearheaded operations automation initiatives, resulting in a 25% reduction in manual tasks and improved system reliability.

Sr DevOps Engineer

May 2019Apr 2020
HCL Technologies BV, Netherlands

Client Name: ING Bank

  • Spearheaded the implementation of Infrastructure as Code (IaC) practices using Ansible in private cloud.
  • Lead the design, development, and maintenance of complex systems in python, contributing to the scalability and efficiency of critical applications.
  • Conducted regular reviews of alerting thresholds and fine-tuned configurations to minimise false positives in Grafana & Prometheus Alert Manager.
  • Conducted regular performance assessments and optimisations, resulting in a 30% improvement in system response times.
  • Debugged using traceability tools like JAEGER and Open Telemetry to identify and resolve performance bottlenecks in critical applications.

Sr DevOps Engineer

Nov 2014May 2019
Emirates Telecom, Abu Dhabi, UAE
  • Led the design and implementation of a Kubernetes-based on-VMWare Vsphere production environment, ensuring high availability, scalability, and reliability using NSX-T virtualisation and networking layer.
  • Hands on Linux Experience and automation using Ansible, Shell scripting and Python.
  • Monitoring production sites and monitoring using the tools like Dynatrace.
  • Plan for system reliability by designing solutions that seamlessly operate across multiple datacenters.

Tech Stack: Jenkins, Docker Swarm, Linux, Kubernetes on VMWARE, Bitbucket, Ansible, bash, Python, Weblogic, WebSphere, JDK

Member Technical Staff

Nov 2010Oct 2014
Oracle , Bangalore, India
  • Member of Java Product support team handling global support issues related to Weblogic Server/Coherence/Java JDK & Jrockit JVM.
  • Built and configured enterprise grade High-Available Servers and authored several knowledge articles.
  • Involved in bug code fixing & solving complex customer issues including performance tuning of the Weblogic Server, this requires in depth knowledge.

PRODUCT Support Engineer

Feb 2010Oct 2010
KPIT Technologies | Pune, India

Worked for BEA Systems(Now acquired by Oracle Corp.) dealing with same kind of profile for enterprise product support before getting transitioned into Oracle.Supported JAVA/J2EE, Weblogic Server/Oracle Jrockit JDK & Sun HotSpot JDK.

Education

Bachelor of Engineering

July 2005 June 2009
RGTU Bhopal, India

Specialized : Computer Science & Engineering