Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.

H

Site Reliability Engineer II - Operational Readiness (Scale & Performance) (Hybrid)

hashicorp · 30+ days ago

India - Bengaluru

Negotiable

Full-time

Continue

By pressing the button above, you agree to our Terms and Privacy Policy, and agree to receive email job alerts. You can unsubscribe anytime.

The Role

As a Site Reliability Engineer for the Operational Readiness team, you will play a critical role in enhancing the scalability, performance, and reliability of HashiCorp's cloud products. With at least 3 years of experience in site reliability engineering or a related field, you will lead efforts to identify, address, and mitigate operational challenges before they impact our customers. Your expertise in load testing, performance analysis, and system hardening will ensure that our services meet the highest standards of operational excellence.

You will play a pivotal role in enhancing our operational resilience and maintaining the reliability of our enterprise and cloud-based products. With a focus on overall Quality you will be at the forefront of ensuring high availability and performance across HashiCorp’s offerings.

You will provide expert execution of the test plans, defining system wide strategies for product load and performance testing. You will be working on a wide variety of tools and exploring new avenues to ensure all the products meet the essential Operational readiness criteria.

Utilize top-notch troubleshooting techniques like simulating the system with Chaos to identify, organize, and advocate for novel solutions to remediate customer impact on complex interconnected systems.

Key Responsibilities

Implement best practices for system reliability, including proactive identification of potential failure points and the development of automated mitigations.
Design and execute comprehensive load testing strategies to identify performance bottlenecks and scalability limits across our cloud products.
Implement best practices and technologies to improve system resilience, ensuring high availability and fault tolerance through Chaos testing framework.
Work closely with engineering and product teams to integrate operational readiness into the development lifecycle, enhancing product stability and user satisfaction.
Build and refine tools and frameworks for automated testing, environment simulation, and incident reproduction, reducing manual effort and increasing test coverage.
Conduct in-depth analysis of testing results, documenting findings and making actionable recommendations for systemic improvements
Develop and implement disaster recovery and backup strategies to ensure data integrity and system resilience.
Share your knowledge and expertise with team members, fostering a culture of learning and continuous improvement.

Ideal Candidate

3+ years of experience in SRE , systems engineering, or non functional testing roles with a focus on performance testing, or system scalability.
Having commitment to explore career opportunity in Site Reliability Engineering field
Proficient in any programming language or scripting language.
Good understanding of CI/CD process and maintaining quality pipelines
Experience with version control systems (e.g., Git) and agile project management methodologies.
Exposure to cloud technologies ( AWS, Azure, Or GCP) and container technologies like Nomad or Kubernetes.
Effective communication and collaboration skills, capable of working with cross-functional teams and articulating technical concepts to diverse audiences.
Experience with infrastructure as code (Terraform, CloudFormation) is a plus.
Understanding of monitoring and alerting systems is a plus
Chaos testing experience is a plus
Exposure to disaster recovery domain is a plus #LI-Hybrid

•

Last updated on Aug 22, 2024

About the company

H

hashicorp

More jobs at hashicorp

Analyzing

Solutions Architect, APJ - Customer Adoption (Hybrid)

·

30+ days ago

Sr. Solutions Engineer - Public Sector DACH

·

30+ days ago

Sr. Customer Success Manager

·

30+ days ago

Sr. Site Reliability Engineer II - Incident Excellence (Hybrid)

Bengaluru, Karnataka

·

30+ days ago

Sr. Software Development Engineer II - IP Compliance & Assurance (Hybrid)

Bengaluru, Karnataka

·

30+ days ago

For job seekers

Job searchSearch millions of jobs

LaunchpadNew

Resume to business

Cover Letter StudioGenerate a cover letter

Add to ChatGPTFind and discuss jobs

For employers and recruiters

Resume ScreenerOrganize and rank candidates

Talent PipelinePre-order

Source top talent

Promote a jobReach more candidates

For everyone

Referral programEarn 30% commission

Get mobile appBrowse anywhere

Jobs API

Become a partner

Developed by Blake and Linh in the US and Vietnam.

We're interested in hearing what you like and don't like! Live chat with our founder or join our Discord

Changelog

🚀 LaunchpadNov 27

Create a site and sell services based on your CV.

🔥 Job search dashboardNov 13

Revamped job search UI with a sortable grid, live filtering, bookmarks, and application tracking.

🫡 Cover letter instructionsSep 27

New Studio settings give you control over AI output.

✨ Cover Letter StudioAug 9

Automatically generate cover letters for any job.

🎯 Suggested filtersAug 6

Copilot suggests additional filters above the results.

⚡️ Quick applicationsAug 2

Apply to jobs using info from your CV. Initial coverage of ~200k jobs in Spain, Germany, Austria, Switzerland, France, and the Netherlands.

🧠 Job AnalysisJul 12

Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.