Interview: Phone and skype
Conversion Salary : 160-170K
Description:I have a current requirement for Production Operations Manager in Redwood City, CA. This role will be leading a team of 10 Systems Administrators. Client seeking an experienced, hands-on Linux Systems Administration manager with extensive experience designing, deploying, and administering high-availability systems that support SaaS applications with 24x7 100% uptime requirements.
The ideal candidate will have significant experience and major accomplishments managing high-caliber teams of Linux Systems Administrators and a great track record of designing, deploying, maintaining, and continuously improving uptime, availability, performance, efficiency, and automation in critical production and non-production environments supported by thousands of Linux servers.
Job Responsibilities and Duties:- Manage a team of Linux Systems Administrators whose primary duties include:
- Day-to-day systems management and provisioning
- Front-line handling of product, system, and application alerts
- Troubleshooting
- Capacity planning
- Documentation of Standard Operating Procedures
- Developing tools to automate manual procedures and increase efficiency
- Buildout of systems and platforms to support scaling of existing environments, new applications, and initiatives, leveraging existing and new automation frameworks (e.g. Puppet, Foreman, etc.)
- Continuous improvement and optimization of monitoring and analytics systems
- Performance optimization
- Cost reduction
- Lead team and cross-team meetings (9-10 direct reports)
- Manage assignments for on-call and frontline duties
- Ensure that critical events and incidents are resolved in a timely manner
- Ensure adherence to production change control policies, identifying opportunities for process improvement and overall efficiency
- Cross-train team members to prevent knowledge and expertise gaps and single points-of-failure
- Work closely with Engineering teams to ensure that infrastructure required to support new product rollouts and capacity expansions are ready well in-advance of need
- Work closely with Customer Support teams to address customer issues in a timely fashion
Required Skills, Knowledge, and Experience: - 4+ years' management experience, with demonstrated successes in recruiting, retaining, and managing top talent
- 5+ years' experience operating SaaS, eCommerce, or Web site environments that have 100% uptime requirements
- 7+ years' experience administering RedHat Enterprise Linux Systems
- General working knowledge of MySQL
- Good understanding of network protocols and tools
- Expert-level scripting skills in in bash, python, or equivalent
- Excellent verbal and written communication skills
Nice-to-haves:- Knowledge and experience with Network Appliance Storage
- Knowledge and experience with CDNs (e.g. Akamai)
- Knowledge and experience with F5 load balancers (BigIP LTM)
- Knowledge and experience with Client blade and rackmount server management tools (VirtualConnect, SIM, OneView, etc.)
- Knowledge and experience with virtualization products and technologies, especially VMware and XenServer
Required Education and Certifications:- BS in Computer Science or equivalent discipline
- RedHat Certified System Administrator (RHCSA)
- RedHat Certified Engineer (RHCS) strongly desired