About this role
About the position
We are seeking a highly skilled Site Reliability Engineer (SRE) / DevOps Engineer to support enterprise cloud and DevOps transformation initiatives. The ideal candidate should have strong expertise in implementing DevOps solutions, cloud-native infrastructure, CI/CD automation, Kubernetes platform engineering, GitOps deployment models, security integration, and observability practices across AWS and Azure environments. The role requires hands-on experience in modern DevOps practices with strong focus on: CI/CD implementation using GitHub and GitHub Actions, DevSecOps integration using JFrog and Veracode, Infrastructure as Code using Terraform, Kubernetes platform engineering on EKS, GitOps deployment using ArgoCD, and AWS cloud-native services and automation.
Responsibilities
• Design, implement, and maintain enterprise-grade CI/CD pipelines using GitHub and GitHub Actions.
• Automate build, deployment, testing, and release management processes.
• Implement branching strategies, pull request governance, and release approvals.
• Integrate DevSecOps controls into CI/CD pipelines.
• Manage source code repositories and deployment automation workflows.
• Integrate security tools such as JFrog Xray and Veracode into CI/CD pipelines.
• Perform artifact and container image vulnerability scanning.
• Implement security best practices for cloud and container platforms.
• Ensure compliance with enterprise security standards.
• Deploy and manage applications across AWS cloud platforms.
• Provision and manage infrastructure using Terraform and CloudFormation.
• Support cloud-native and serverless deployments.
• Manage infrastructure scalability, resiliency, and availability.
• Deploy and manage containerized applications on Amazon EKS.
• Implement GitOps deployment practices using ArgoCD.
• Support Kubernetes cluster administration, scaling, and monitoring.
• Implement service mesh technologies such as Istio, Envoy, or Gloo.
• Work extensively with AWS services including: EKS, EC2, Lambda, IAM, RDS, ElastiCache, S3, Route53, ALB/NLB, AWS Batch, Secrets Manager, SSM Parameter Store, KMS.
• Configure secure networking, DNS, encryption, and access management.
• Implement monitoring, logging, and observability solutions using Datadog and Sumo Logic.
• Monitor application metrics, logs, APM, and infrastructure health.
• Configure proactive alerting and incident management workflows.
• Perform root cause analysis and production support activities.
• Work within Agile/Scrum teams using Jira and Confluence.
• Participate in sprint planning, backlog grooming, and release management.
• Collaborate with development, QA, cloud, and security teams.
Requirements
• 10 years' experience minimum
• DevOps & CI/CD
• GitHub & GitHub Actions
• JFrog & Veracode
• Terraform
• Amazon EKS
• ArgoCD
• AWS Cloud Services
• Kubernetes
• GitOps
• DevSecOps
• Infrastructure as Code (IaC)
• CI/CD & Source Control
• GitHub
• GitHub Actions
• Bamboo
• Maven
• Gradle
• NPM
• MSBuild
• Security & Artifact Management
• JFrog
• Nexus
• Veracode
• SonarQube
• Cloud Platforms
• AWS
• Infrastructure as Code
• Terraform
• CloudFormation
• Sceptre
• Containerization & Orchestration
• Kubernetes
• EKS
• Docker
• ArgoCD
• Istio
• Envoy/Gloo
• Monitoring & Observability
• Datadog
• Sumo Logic
Nice-to-haves
• Azure (Good to Have)
• Experience supporting enterprise-scale DevOps and cloud transformation programs.
• Strong understanding of SRE principles and operational excellence.
• Experience with microservices architecture and distributed systems.
• Exposure to AI-enabled DevOps tools and automation.
• Multi-cloud deployment experience is an added advantage.