About this role
<h3 data-path-to-node="2">Who We Are</h3>
<p id="p-rc_ed02c07fb1fa8ac6-19" data-path-to-node="3"><span data-path-to-node="3,1"><span class="citation-39">At 2K, we create some of the most iconic and culture-shaping video games in entertainment, including NBA® 2K, one of the top-selling franchises in the world, and legendary titles like BioShock®, Borderlands®, Mafia, Sid Meier’s Civilization®, and XCOM®, as well as fan favorites WWE® 2K, TopSpin®, and PGA TOUR® 2K. </span></span><span data-path-to-node="3,4"><span class="citation-38">We build unforgettable experiences by pushing the boundaries of creativity, authenticity and innovation across every genre. </span></span></p>
<p id="p-rc_ed02c07fb1fa8ac6-20" data-path-to-node="4"><span data-path-to-node="4,1"><span class="citation-37">Our portfolio is brought to life by some of the most influential game development studios in the world. </span></span><span data-path-to-node="4,4"><span class="citation-36">Visual Concepts, Firaxis Games, Hangar 13, Cat Daddy Games, 31st Union, Cloud Chamber, Gearbox, HB Studios, and 2K SportsLab create world-class experiences across platforms. </span></span><span data-path-to-node="4,7"><span class="citation-35">But what truly powers 2K is our people. </span></span><span data-path-to-node="4,10"><span class="citation-34">We believe the best ideas come from teams that feel empowered, supported, and inspired. </span></span><span data-path-to-node="4,13"><span class="citation-33">As an equal opportunity employer, we are committed to fostering a diverse, inclusive workplace where people are encouraged to come as they are and do their best work. </span></span></p>
<h3 data-path-to-node="5">The Team</h3>
<p data-path-to-node="6">The 2K SRE team owns the infrastructure behind every player connection—All 2K game services, account platforms, CI/CD pipelines, and developer tooling spanning AWS, GCP, and on-premises data centers across multiple global regions. Global launch windows and live-service events push systems to their limits, and this team is expected to hold the line.</p>
<p data-path-to-node="7">Post-mortems here focus on systems, not people. Automation is the default answer to repetitive work. The infrastructure keeps millions of players connected, and the team takes that seriously!</p>
<h3 data-path-to-node="8">The Role</h3>
<p data-path-to-node="9">The Senior SRE at 2K is a hands-on technical leader—shaping production infrastructure across multiple clouds and regions while partnering with network engineers, systems architects, and game studio developers. This is an ownership role: driving technical direction, influencing reliability from architecture review through production operation, and closing the gap between what engineering ships and what players experience.</p>
<h3 data-path-to-node="10">What You'll Do</h3>
<p data-path-to-node="11"><strong data-path-to-node="11" data-index-in-node="0">Platform & Infrastructure</strong></p>
<ul data-path-to-node="12">
<li>
<p data-path-to-node="12,0,0">Design, build, and operate scalable multi-cloud and hybrid infrastructure using Terraform, Pulumi, and GitOps workflows (ArgoCD, Flux).</p>
</li>
<li>
<p data-path-to-node="12,1,0">Own Kubernetes platforms (EKS, GKE) end-to-end cluster lifecycle, multi-tenancy, networking (Istio, Cilium), and autoscaling.</p>
</li>
<li>
<p data-path-to-node="12,2,0">Push progressive delivery patterns (blue/green, canary) across game service deployments.</p>
</li>
</ul>
<p data-path-to-node="13"><strong data-path-to-node="13" data-index-in-node="0">Observability & Reliability</strong></p>
<ul data-path-to-node="14">
<li>
<p data-path-to-node="14,0,0">Build and run the full observability stack: Prometheus + Grafana + Datadog.</p>
</li>
<li>
<p data-path-to-node="14,1,0">Define SLI/SLO/error budget policies and build alerting that cuts through the noise.</p>
</li>
<li>
<p data-path-to-node="14,2,0">Lead chaos engineering exercises to surface failure modes before players encounter them.</p>
</li>
<li>
<p data-path-to-node="14,3,0">Drive incident response and post-mortems with a focus on systemic fixes and real follow-through.</p>
</li>
</ul>
<p data-path-to-node="15"><strong data-path-to-node="15" data-index-in-node="0">Automation, Security & Developer Experience</strong></p>
<ul data-path-to-node="16">
<li>
<p data-path-to-node="16,0,0">Eliminate toil through self-service provisioning, automated remediation, and intelligent scaling.</p>
</li>
<li>
<p data-path-to-node="16,1,0">Harden CI/CD pipelines (GitHub Actions, Jenkins, ArgoCD).</p>
</li>
<li>
<p data-path-to-node="16,2,0">Embed security at the platform layer through secrets management (PasswordState, 1Password, and AWS Secrets Manager) and policy-as-code (OPA/Gatekeeper).</p>
</li>
</ul>
<p data-path-to-node="17"><strong data-path-to-node="17" data-index-in-node="0">Leadership</strong></p>
<ul data-path-to-node="18">
<li>
<p data-path-to-node="18,0,0">Promote SRE practices across 2K studios through reliability reviews, runbooks, and embedded collaboration.</p>
</li>
<li>
<p data-path-to-node="18,1,0">Shape architectural decisions and author engineering RFCs that move the platform forward.</p>
</li>
</ul>
<h3 data-path-to-node="19">Required Qualifications</h3>
<ul data-path-to-node="20">
<li>
<p data-path-to-node="20,0,0"><strong data-path-to-node="20,0,0" data-index-in-node="0">Experience:</strong> 5+ years in SRE, Platform Engineering, or equivalent infrastructure work at production scale.</p>
</li>
<li>
<p data-path-to-node="20,1,0"><strong data-path-to-node="20,1,0" data-index-in-node="0">Kubernetes:</strong> Deep experience in cloud environments (EKS or GKE preferred), including networking, storage, and multi-cluster patterns.</p>
</li>
<li>
<p data-path-to-node="20,2,0"><strong data-path-to-node="20,2,0" data-index-in-node="0">Infrastructure as Code (IaC):</strong> Strong proficiency with Terraform and/or Pulumi; hands-on with Helm, Terragrunt, and GitOps tooling (ArgoCD or GitHub Actions).</p>
</li>
<li>
<p data-path-to-node="20,3,0"><strong data-path-to-node="20,3,0" data-index-in-node="0">Environments:</strong> Experience with modern and legacy tech, including AWS, GCP, VMware, and Bare metal servers.</p>
</li>
<li>
<p data-path-to-node="20,4,0"><strong data-path-to-node="20,4,0" data-index-in-node="0">Configuration Management:</strong> Server configuration using Ansible, Puppet, and AWS Systems Manager.</p>
</li>
<li>
<p data-path-to-node="20,5,0"><strong data-path-to-node="20,5,0" data-index-in-node="0">Observability:</strong> Experience with Datadog, Prometheus + Grafana, and OpenTelemetry; fluency in operationalizing SLI/SLO/error budgets inside engineering teams.</p>
</li>
<li>
<p data-path-to-node="20,6,0"><strong data-path-to-node="20,6,0" data-index-in-node="0">Software Engineering:</strong> Production-quality code in Go, Python, or TypeScript for tools, automation, and internal libraries.</p>
</li>
<li>
<p data-path-to-node="20,7,0"><strong data-path-to-node="20,7,0" data-index-in-node="0">Systems & Networking:</strong> Solid understanding of Linux internals, TCP/IP networking, DNS, and TLS proven enough to debug at the system level.</p>
</li>
<li>
<p data-path-to-node="20,8,0"><strong data-path-to-node="20,8,0" data-index-in-node="0">Incident Management:</strong> Incident response and post-mortem leadership with a track record of systemic follow-through.</p>
</li>
</ul>
<h3 data-path-to-node="21">Preferred Qualifications</h3>
<ul data-path-to-node="22">
<li>
<p data-path-to-node="22,0,0">Live-service game or large-scale consumer internet experience dealing with millions of concurrent users.</p>
</li>
<li>
<p data-path-to-node="22,1,0">Deep knowledge of Service mesh (Istio, Cilium) and advanced Kubernetes networking.</p>
</li>
<li>
<p data-path-to-node="22,2,0">Experience with FinOps and managing resources efficiently at cloud scale.</p>
</li>
<li>
<p data-path-to-node="22,3,0">Experience with AI and Agentic Development.</p>
</li>
<li>
<p data-path-to-node="22,4,0">Cloud certifications (AWS Solutions Architect, GCP Professional Cloud Architect, CKA/CKS, or equivalent).</p>
</li>
<li>
<p data-path-to-node="22,5,0">Experience mentoring SREs or leading reliability working groups.</p>
</li>
</ul>
<p id="p-rc_ed02c07fb1fa8ac6-21" data-path-to-node="23"><span data-path-to-node="23,1"><span class="citation-32">As an equal opportunity employer, we are committed to ensuring that qualified individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform their essential job functions, and to receive other benefits and privileges of employment. </span></span><span data-path-to-node="23,4"><span class="citation-31">Please contact us if you need reasonable accommodation. </span></span></p>
<p id="p-rc_ed02c07fb1fa8ac6-22" data-path-to-node="24"><span data-path-to-node="24,1"><span class="citation-30">Please note that 2K Games and its studios never uses instant messaging apps or personal email accounts to contact prospective employees or conduct interviews and when emailing, only use 2K.com accounts. </span></span></p>
<p data-path-to-node="25">#LI-Hybrid</p>