Description
About the Team
At Slack, the Cloud Engineering team is the backbone of our infrastructure — a dynamic group of Cloud Engineers, Financial Analysts, and AWS Subject Matter Experts united by a single mission: keeping Slack fast, reliable, and cost-efficient for millions of users worldwide.
We tackle unique, large-scale challenges that very few engineers ever get to work on. From designing the systems that power global real-time communication to writing software that brings deep visibility into our cloud infrastructure, our work has direct impact every single day. We partner with teams across Slack to maximize cloud value, champion cost-conscious engineering, and build a platform that scales with our ambitions. If you're energized by high availability, resilience, and the right technology choices — this is the team for you.
Slack has a positive, diverse, and supportive culture — we look for people who are curious, inventive, and work to be a little better every single day. In our work together we aim to be smart, humble, hardworking and, above all, collaborative. If this sounds like a good fit for you, read on ahead!
What you will be doing:
Lead software projects end-to-end — from scoping and architecture through delivery, iteration, and long-term ownership
Architect and build a next-generation internal platform that gives engineering teams a powerful foundation to innovate quickly
Deliver cutting-edge solutions leveraging containerization, virtualization, and a broad suite of AWS services
Author, extend, and improve Terraform modules that power infrastructure-as-code across Slack
Design and implement an in-house system to deploy, manage, and scale applications for service owners
Partner directly with development teams to identify performance bottlenecks and drive cloud efficiency improvements
Build strong, trusted relationships with service owners — serving as a go-to advisor on cloud architecture and best practices
Champion a culture of platform efficiency by sharing knowledge, writing runbooks, and leading internal enablement sessions
Mentor and grow junior engineers, scaling the impact of the team through thoughtful technical leadership
Make a measurable financial impact — driving millions of dollars in cloud cost savings annually
Participate in on-call rotation and collaborate with our operations team to triage and resolve production incidents with urgency and precision
Build observability and introspection tooling that gives engineers deep, real-time visibility into system health and bottlenecks
What you should have:
U.S. Citizenship or Permanent Residency (Green Card holder). We are unable to provide visa sponsorship for this role.
Genuine curiosity about how cloud infrastructure works — and a passion for sharing that knowledge with your team
Proven ability to analyze, optimize, and improve reliability in high-traffic, production internet applications
A strong mentoring instinct and commitment to engineering excellence: you lead by example in code reviews, testing, design docs, and debugging
Deep, hands-on AWS experience — broad familiarity across many services with deep expertise in at least a few
Demonstrated experience deploying cloud applications and managing infrastructure-as-code using Terraform and/or CloudFormation
Strong ability to troubleshoot and debug complex issues across infrastructure, applications, and distributed systems
A track record of professional software development you're proud of — you can point to real-world systems you've built, scaled, and improved
Experience working with Kubernetes (K8s)
Qualifications:
7+ years of professional experience in cloud engineering or a closely related discipline, working in a collaborative team environment
Strong computer science fundamentals: data structures, algorithms, distributed systems, programming languages, and information retrieval
Bachelor's degree in Computer Science, Engineering, or a related field — or equivalent training, fellowship, or work experience
Proficiency in one or more functional or imperative programming languages — Python, Go, or PHP preferred
Hands-on experience with software engineering, scripting, automation, and orchestration tools (e.g., Bash, Chef, Jenkins, Terraform)
Extensive, production-grade experience provisioning, configuring, and maintaining AWS environments
Experience managing large-scale Kubernetes systems (EKS or Bare Metal)
Bonus Points:
Deep expertise in core AWS services such as EKS, EC2, IAM, Fargate, S3, or Lambda
AWS Professional or Specialty certification(s)
Experience designing or operating large-scale, high-volume distributed systems
A proven history of driving significant cloud cost reductions at scale
Familiarity with observability tooling (e.g., Datadog, Prometheus, OpenTelemetry) and SRE practices