Job Title: Site Reliability Engineer
Location: Houston, TX (Hybrid)
Duration: Full time
JOB DESCRIPTION:
SRE is a critical and visible role, central to running a multi-tiered cloud infrastructure, applications and workloads across public, private and hybrid cloud environments. SRE’s are required to have in-depth knowledge of Cloud technologies. SRE’s collaborate with development engineers, architects, technical leads and IT engineers to ensure uptime for cloud applications. SRE’s are expected to build and use tooling, automation, scripting and latest best practices to ensure services remain up and running, performant, resilient and secure.
Responsibilities:
Deploy and configure new Public Cloud tenants via automation – Jenkins, Terraform, Ansible, gitlabci, atlantis.awx (ansible tower)
Utilize the most recent technologies to automate all the current manual tasks.
Kubernetes and docker understanding and troubleshooting.
Provide day-to-day support to existing customers and ensure that the team is always exceeding their expectations.
Develop system health metrics for both real time monitoring and usability recommendations.
Enforce best practices for security and reliability.
Participate in security initiatives, including access control and vulnerability testing.
Maintain documentation of the infrastructure and suggest areas for improvement.
Assist in maintaining platform availability to defined levels.
Troubleshoot and address infrastructure issues as necessary.
Collaborate with the Cloud Automation team on shared objectives on future desired state.
Assist in the validation of new automations in Azure and AWS if the need arises.
Investigate new technologies and methodologies to better support the product.
Coach and mentor other team members as needed.
Participate in an on-call rotation as required.
Requirements:
A bachelor’s degree in a technical field and 6+ years of professional work experience between IT and Public Cloud operations or customer-oriented environments (at least 4 years Cloud experience)
Experience with Public Cloud PaaS/SaaS solutions such as App Services, App Insights, Storage Accounts, Resource Groups, and monitoring tools, EC2, S3, route53, IAM
Good debugging and troubleshooting understanding in distributed systems.
Ability to understand and develop CI/CD pipelines for automations.
Excellent interpersonal and communication skills and a team player
General knowledge of designing and implementing GUI-based web pages/dashboards aimed to gather and present information to account, sales, and other stakeholder teams.
Strong System administration skills: Windows and Linux. Ability to troubleshoot and solve problems.
Strong coding skills: Jenkins, Ansible, Terraform, VBA, Bash, PowerShell, JavaScript, Python.
Strong container skills: Docker and Kubernetes administration. Ability to troubleshoot and solve problems.
Strong ID management skills: Okta, VIDM, Horizon View, Federation, users/groups management, directory integration, MFA, SSO, monitoring, automation workflows, profile management, application integration. Ability to troubleshoot and solve problems in all specified skills.
Excellent Git skills
Strong experience in AWS/Azure resources and associated managed services.
Thank you so much.
Best regards,
Sincerely Yours,
Rahul Jaiswal
Technical Recruiter
Diverse Lynx, LLC
300 Alexander Park Suite # 200
Princeton, NJ 08540
Desk: +1 732-452-1006 Ext-557
URL: http://www.diverselynx.com.
Note: Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence, and performance of the essential functions of their positions. We promote and support a diverse workforce at all levels in the company. This is not an unsolicited mail and if it is not intended for you or you are not interested in receiving our e-mails please forward this email to remove@diverselynx.com with "remove" in the subject line and mention all the e-mail addresses to be removed with any e-mail addresses, which might be diverting the e-mails to you. We are extremely sorry if our email has caused any inconvenience to you.
Last updated on Oct 20, 2023
30+ days ago
Dallas, Texas
·30+ days ago
West Chester, Pennsylvania
·30+ days ago
Seattle, Washington
·30+ days ago
Sunnyvale, California
·30+ days ago
New York, New York
·30+ days ago
San Francisco, California
·30+ days ago
30+ days ago
Remote
·30+ days ago
Remote
·30+ days ago
Des Moines, Iowa
·30+ days ago
South Jordan, Utah
·30+ days ago
Tampa, Florida
·30+ days ago
California
·30+ days ago
Minneapolis, Minnesota
·30+ days ago