CNG Member Technical Staff (Site Reliability Engineer, UK)

  • Location:
    London, England, United Kingdom
  • Area of Interest
    Engineer - Software
  • Job Type
    Professional
  • Technology Interest
    *None
  • Job Id
    1211736
New

Job Title: CNG Member of Technical Staff (Site Reliability Engineer)

Location: London, UK


As a Site Reliability Engineer on the Meraki Backend Infrastructure Team, you are responsible for everything from our server hardware and operating systems to tools for code deployment and service monitoring. You build software and systems to monitor, scale and deploy our distributed cloud services.

In this role, you will be part of a small engineering team that is based out of our UK office in Finsbury Square, Central London. You will make crucial decisions about how to manage and scale complex, high-performance distributed systems. You will also provide your own perspective on our backend systems and constantly develop innovative ways to improve the way we manage the underlying infrastructure.

Main Duties of a Meraki Site Reliability Engineer:

  • Collecting metrics, crunching data and improving service monitoring to detect problems before they’re visible to our customers.

  • Building systems to automate our server lifecycle, from configuration management to server bootstrap and decommission.

  • Scaling our continuous deployment system to accommodate a rapidly growing team and increasing feature velocity without compromising stability.  

  • Troubleshooting, performing root cause analysis, and resolving production issues from the network and application layers all the way down to the system level. This might include anything from digging into source code (our own or from open source projects), hunting memory leaks, tracing bottlenecks in upstream networks, or database query optimization.

  • Advising other development teams when building new products so that they’re scalable, maintainable, and performing well.

Skills, qualifications, and experience required:

  • Script or code fluently in Ruby and Bash. You are comfortable digging into other people’s source code in search of the root cause of a problem and you automate all the things.

  • Are fluent with Git and SVN. You have experience with common DevOps practices such as continuous integration and continuous deployment.

  • Have experience using Ansible to manage a large production environment. You develop scripts and automation tools to provision and operate fleets of servers.

  • Have experience managing large, distributed monitoring platforms using Graphite, Grafana, collectd, statsd, Logstash, ElasticSearch and Kibana.

  • Have experience building large infrastructure systems using Cisco UCS servers.

  • Care about the customer experience. You have experience in a high-pressure, fast-moving on-call environment.

  • Have experience on a pager rotation where you responded to escalations quickly to minimise customer downtime.

  • You build large systems out of small components that each do one job and do it well. Experience with the Debian packaging system is essential.

  • Must be willing to travel internationally up to four times per year.

Benefits include:

  • Competitive salary and annual bonus

  • Company funded private healthcare

  • Pension contributions

Posting Date: August 8, 2017
Closing date for applications: September 6, 2017

Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.

Apply on the Company Site
Powered By