Sign In
 [New User? Sign Up]
Mobile Version

SR RELIABILITY ENGINEER

Home Depot


Location:
Austin, TX
Date:
11/17/2017
2017-11-172017-12-16
Job Code:
111242
Apply on the Company Site
  •  
  • Save Ad
  • Email Friend
  • Print
  • Research Salary

Job Details

SR SOFTWARE ENGINEER - SRE - ATN - dot com - Austin TX 78721 Skip Navigation
Job Details

SR RELIABILITY ENGINEER (111242)

TX - Austin

  • Date Posted: Nov 9, 2017
  • Company: The Home Depot
  • Travel: None
  • Functional Area: Information Technology
  • Position Type: Full-Time
  • Relocation Provided: No




Position Description:

If you are passionate about large scale, mission critical software systems, and you have a maniacal focus on system availability and performance, Reliability Engineering is for you. 

As a member of our Reliability Engineering team, you will be responsible for scaling some of the largest software products in Retail by automating the application infrastructure, deployment, and monitoring of those products in production. You will also be part of a 24x7 on-call team that will lead the triage of incidents for your products using your expertise to mitigate the problem as soon as possible. Our "own what you build" mentality empowers you to make decisions quickly to deliver reliability improvements without the red tape that typically surrounds enterprise environments. Our Reliability Engineering motto is: Enable Speed with High Availability.

 You should have a passion for automating as much as possible and constantly be on the lookout for areas where operational and code efficiencies can be improved. You will work directly with product engineering teams leveraging XP principles, and, when you aren't automating all the things, you will be proactively executing destructive tests, participating in "game day" exercises, and related activities to improve the operational readiness of your product(s).


MAJOR TASKS, RESPONSIBILITES AND KEY ACCOUNTABILITIES
70% - Delivery & Execution: 
Collaborates and pairs with other product team members (UX, engineering, and product management) to create secure, reliable, scalable software solutions 
Writes custom code or scripts to automate infrastructure, monitoring services, and test cases 
Writes custom code or scripts to do "destructive testing" to ensure adequate resiliency in production 
Creates meaningful dashboards, logging, alerting, and responses to ensure that issues are captured and addressed proactively 
Contributes to enterprise-wide tools to drive destructive testing, automation, or engineering empowerment 
Identifies product enhancements (client-facing or technical) to create a better experience for the end users 
Identifies unsecured code areas and implements fixes as they are discovered with or without tooling 
Contributes to foundational code elements that can be reused many times by a product 
Contributes to meaningful architecture diagrams and other documentation needed for security reviews or other interested parties 
Defines Service Level Objectives for product(s) to constantly measure their reliability in production and help prioritize backlog work 

20% - Support & Enablement: 
Fields questions from other product teams or support teams 
Monitors tools and participates in conversations to encourage collaboration across product teams 
Provides application support for software running in production 
Proactively monitors production Service Level Objectives for product(s) 
Proactively reviews the Performance and Capacity of all aspects of production: code, infrastructure, data, and message processing 
Triages high priority issues and outages as they arise 

10% - Learning: 
Participates in and leads learning activities around modern software design and development core practices (communities of practice) 
Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used within other technology organizations 
Attends conferences and learns how to apply new technologies where appropriate

NATURE AND SCOPE
Typically reports to the Software Engineer Manager or Sr. Manager.

ENVIRONMENTAL JOB REQUIREMENTS
Environment: 
Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable.
Travel: 
Typically requires overnight travel less than 10% of the time.


ESSENTIAL SKILLS:
MINIMUM QUALIFICATIONS
Must be eighteen years of age or older.
Must be legally permitted to work in the United States.

Additional Minimum Qualifications:
Proficient in an object oriented programming language (preferably Java) 
Must be legally permitted to work in the United States

Education Required:
The knowledge, skills and abilities typically acquired through the completion of a bachelor's degree program or equivalent degree in a field of study related to the job.

Years of Relevant Work Experience: 3-7 years
Physical Requirements:
Most of the time is spent sitting in a comfortable position and there is frequent opportunity to move about. On rare occasions there may be a need to move or lift light articles.
Additional Qualifications:

Preferred Qualifications:

  • Proficient in production monitoring concepts and implementation including synthetic, real user, application performance, system, log, time-series, and dashboarding. Includes tools like appdynamics, dynatrace, newrelic, splunk, grafana, ELK, etc
  • Proficient in production systems design including High Availability, Disaster Recovery, Performance, Efficiency, and Security           
  • Proficient in a modern scripting language (preferably python)
  • Proficient in a modern infrastructure automation toolkit such as Puppet or Chef
  • Proficient in a Linux or Unix based environment
  • Deep understanding of modern microservice based architectures and operations
  • Experience in destructive testing methodologies and tools such as chaos monkey
  • Experience in CI/CD automation
  • Experience in a version control systems such as Git or SVN     
  • Experience in a cloud computing platform and the associated automation patterns it provides              
  • Experience in defensive coding practices and patterns for high-availability         
  • Exposure to a modern objected oriented programming language (preferably Java)        

Knowledge, Skills, Abilities and Competencies:Cultivates Innovation: Creating new and better ways for the organization to be successful 
Action Oriented: Taking on new opportunities and tough challenges with a sense of urgency, high energy and enthusiasm 
Business Insight: Applying knowledge of business and the marketplace to advance the organization s goals 
Collaborates: Building partnerships and working collaboratively with others to meet shared objectives 
Communicates Effectively: Developing and delivering multi-mode communications that convey a clear understanding of the unique needs of different audiences 
Drives Results: Consistently achieving results, even under tough circumstances 
Global Perspective: Taking a broad view when approaching issues; using a global lens 
Interpersonal Savvy: Relating openly and comfortably with diverse groups of people 
Manages Ambiguity: Operating effectively, even when things are not certain or the way forward is not clear 
Optimizes Work Processes: Knowing the most effective and efficient processes to get things done, with a focus on continuous improvement 
Self-Development: Actively seeking new ways to grow and be challenged using both formal and informal development channels 
Situational Adaptability: Adapting approach and demeanor in real time to match the shifting demands of different situations


We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.

Apply on the Company Site
Powered ByLogo

Featured Jobs[ View All ]

Featured Employers [ View All ]