This role will be responsible for operational delivery and support of production cloud infrastructure for dedicated cloud storage offering, focusing on proactive monitoring, rapid response, and Tier 2 support for our iCloud storage services. You will be driving to improve the reliability and performance of our Operations.
You will work as an IC as part of a small team of Operations Engineers & tools developers and work with our engineering teams to deliver, build and operate the next generation of StaaS (Storage as a Service) focusing on automation, availability and performance. You will diagnose and resolve latent and systemic reliability issues across entire stack: hardware, software, services, application and network working closely with engineering teams. Drive standardization efforts across multiple disciplines and services.
Roles and Responsibilities
+ Document, communicate, evangelize, advocate for, and optimize cross functional system-level designs
+ Create and maintain technical documentation for operational readiness
+ Build, tune, troubleshoot and document systems in high availability Clusters.
+ Create and maintain security best practices
+ Configure and maintain virtual networking, including load balancers, firewalls and switches
+ Become a solid contributor on our team, and build, extend and maintain some of the key infrastructure that powers our Private Cloud platform
+ Take ownership of important components of the architecture that power our Private Cloud
+ Provide troubleshooting expertise for virtualization performance and other issues
+ Train and educate others within Technology about Cloud storage technologies.
+ Solve business needs with technology by evaluating different technology options and vendor products.
+ 6+ years industry experience with Bachelor’s Degree in CS or similar field of study OR work equivalent
+ Linux/Unix expert, preferably within a production IaaS datacenter environment and familiar with industry best practices
+ Experience with scripting (ruby/python/perl/awk/shell/etc…)
+ Experience working in a 24X7 production environment to deliver QOS and maintain SLAs
+ Experience working on large distributed and highly scalable systems
Additional Preferred Qualifications
+ Some Hands-on experience with any configuration management tools
+ Experience with firewalls, VPN, routing, switching, load balancers, monitoring, security and DNS plus
+ Understanding of Distributed systems
+ Knowledge of systems management concepts, processes and standards
When you choose our company, you join a diverse world of innovative thought leaders. At our core is a commitment to workplace diversity, the sustainability of our planet, and community corporate involvement. We offer highly competitive salaries, bonus programs, world-class benefits, and unparalleled growth and development opportunities-all to create a compelling and rewarding work environment.
We are an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy), sexual orientation, gender identity and/or expression, national origin, protected veteran status, disability, genetics, or citizenship status (when otherwise legally authorized to work) and will not be discriminated against on the basis of such characteristics or any other status protected by the laws or regulations in the locations where we operate. We encourage applicants of all ages.
**Critical Hiring Criteria:**
Engineering - Software
203 - INFRASTRUCTURE MANAGEMENT GROU
US - Washington - Seattle