Sr. Reliability Engineer

  • Company: Capital One
  • Posted: January 13, 2017
  • Reference ID: R17501
Plano 1 (31061), United States of America, Plano, Texas

Sr. Reliability Engineer

Capital One is a diversified bank that offers a broad array of financial products and services to consumers, small business and commercial clients. We nurture a work environment where people with a variety of thoughts, ideas and backgrounds, guided by our shared Values, come together to make Capital One a great company and a great place to work.

As a Sr. Reliability Engineer, you will support operation of Capital One Home Loans applications and infrastructure.  Responsible for coordinating production support activities for all major systems and related subsystems.

Your role is critical to ensuring the integrity and operation of critical business systems. As an essential member of the team, your responsibilities involve a deep technical understanding of architecture and environments supported.


  • Ensure system availability and performance. 
  • Resolve incidents, events, problems and issues. 
  • Implement changes to applications and infrastructure. 
  • Configure and update appropriate monitors and alerts. 
  • Ensure systems meet Capital One standards for security and resiliency.
  • Script Chef recipes/cookbooks for Amazon Web Services (AWS)
  • Deliver AWS based infrastructure solutions using AWS Cloud Formation (JSON) and Chef (Ruby) for configuration management
  • Migrate on premise applications to AWS 
  • Code automation of various system builds and tasks (e.g. automating code builds and deployments)
  • Troubleshoot a variety of issues for hosting platforms capable of running on a variety of frameworks (java, node.js, ruby, php, python) 

Basic Qualifications:

Bachelor's degree or military experience

At least 3 years of experience providing enterprise Linux based system administration

At least 2 years of experience working with ITIL foundations for Incident, Change and Problem management 

At least 2 years of experience configuring system and application monitoring

At least 2 year of experience working with AWS cloud automation

At least 2 year of experience with Chef or Puppet

At least 1 year of experience with GIT or Jenkins Hudson or other code repository

At least 2 year of experience with Shell or at least 1 year of experience with Ruby or Python

Additional Preferred Qualifications: 

1+ year of experience in an enterprise cloud environment 

1+ year of experience working with Incident Management

1+ year of experience working with Release Management

2+ year of experience working with Unix shell scripting

At this time, Capital One will not sponsor a new applicant for employment authorization for this position. 

Share this Job