Manager, Site Reliability Engineering, Adobe I/O

5 - 10 years
System Administration
Full Time
United States
San Francisco, CA
August 2, 2019

The Challenge

We are looking for a seasoned, passionate and hands on technical leader with expertise in DevOps to join the Adobe I/O team. As a member of the Adobe I/O organization, this Manager will lead a team of highly skilled site reliability engineers to automate and build tools to ensure service uptime and resilience.

Adobe I/O provides the central development platform for delivering, managing and tracking APIs. The I/O platform is used by internal Adobe developers as well as external 3rd-party developers via This role will focus on the needs of application developers looking to integrate with Adobe cloud technologies. This team is uniquely positioned to make a measurable difference to Adobe’s developer ecosystem.

What You’ll Do

  • Manage a highly skilled team of site reliability engineers providing work assignment, check-ins and annual rewards.
  • Develop the solutions to maintain and optimize the availability and performance of services to ensure a fantastic, reliable experience for our customers.
  • Envision, design and build the tools and solutions to keep the services healthy and responsive
  • Continuously improve the techniques and processes used in Operations to optimize the costs and increase the productivity
  • Evaluate and utilize the newer technologies coming in the industry to keep the solution on the cutting edge
  • Help build engineering practices, monitors and processes to provide 99.99% availability
  • Help define prevention and recovery of incident in production and staging environments
  • Collaborate across different teams – development, quality engineering, product management, program management etc to ensure the true devops culture to get the right systems and solutions in place for agile delivery of a growing portfolio of SaaS applications, product releases and infrastructure optimizations
  • Effectively work across multiple timezones to collaborate with peers in other geographies
  • Handle escalations from different quarters – customers, client care and engineering teams, resolve the issues and effectively communicate the status across the board
  • Create a culture that supports innovation and creativity while delivering high volume in a predictable and reliable way.
  • Keep the team motivated to go beyond the expected in execution and thought leadership.

What You Need To Succeed

This position requires seasoned devops leader with strong technical, communications and problem-solving skills and the ability to engage and interact with numerous external teams. The ideal candidate would be a top notch SRE having architected large scale solutions.

Candidates should be able to demonstrate deep competency in most or all of the requirements listed below:

  • An extensive background in developing and operating large-scale cloud-based distributed applications
  • Highly Focused and able to design infrastructure solutions for scalability, reliability, high availability, performance, security, software maintainability, and operational excellence.
  • Ability to “fix the plane while in flight”.
  • Having experience in driving deployments of large scale internet hosted applications at short intervals and developing automated solutions for continuous delivery.
  • Hands on experience in Linux-based platforms, storage systems (SAN and NAS), load balancers and virtualized environments (VMware, AWS, Azure)
  • Good knowledge of different database solutions (SQL, NoSQL, BigData) and the corresponding pros and cons pertaining to each
  • Having worked on latest technologies like Hadoop, Storm, Spark etc
  • Knowledge on the newer virtualization and clustering solutions like Docker, Mesos, CoreOS, Kubernetes etc
  • Expertise in networking with deep knowledge of concepts, protocols and technologies such as TCP/IP, HTTP, VLAN, DNS, switches, routers, datacenter networks
  • Strong experience with designing, deploying and maintaining monitoring solutions such as Splunk, Nagios, Cacti, Munin, Datadog, Pingdom, Runscope, Pagerduty etc
  • Experience in configuration management and automated deployment tools like Chef, Puppet, Salt, Ansible etc
  • Experience with one or more development or scripting languages suited for system administration and automation, such as Ruby, Python, Perl, PHP, Java/Javascript, Shell

Nice To Have.

  • BS or MS in Computer Science or equivalent experience. MS preferred.
  • 7+ years of experience.
  • Familiarity with agile software development processes including software builds and source code control
  • Familiarity with service delivery and project management principles

At Adobe, you will be immersed in an exceptional work environment that is recognized throughout the world on Best Companies lists. You will also be surrounded by colleagues who are committed to helping each other grow through our unique Check-In approach where ongoing feedback flows freely.

If you’re looking to make an impact, Adobe’s the place for you. Discover what our employees are saying about their career experiences on the Adobe Life blog and explore the meaningful benefits we offer.

Adobe is an equal opportunity employer. We welcome and encourage diversity in the workplace regardless of race, gender, religion, age, sexual orientation, gender identity, disability or veteran status.

Apply for this job


Related Jobs