Browse
Employers / Recruiters

Team Lead, Site Reliability

kubra · 30+ days ago
Negotiable
Full-time
Continue
By pressing the button above, you agree to our Terms and Privacy Policy, and agree to receive email job alerts. You can unsubscribe anytime.
Are you an experienced Site Reliability Engineer with a passion for enhancing platform stability, reliability, and efficiency?

We are growing at KUBRA, and we're looking for a skilled Team Lead, Site Reliability Engineer, where you will guide our DevOps team in optimizing our customer experience management platforms.

In this dynamic role, you will work collaboratively with cross-functional teams to apply SRE principles and drive continuous improvement. Your technical expertise will be pivotal in identifying potential issues, resolving complex problems, and leading technical and business discussions. You will leverage your experience in IT Service Delivery and Management to standardize operations, enhance service levels, and support technology system evolution.

This is hybrid opportunity in Tempe, AZ.

What you get to do every day:

  • Ensure that infrastructure and applications perform within established Service Level Agreements (SLA) and Service Level Objectives (SLO).
  • Maintain well-documented standards and best practices to ensure services are built for high availability and security.
  • Implement appropriate automation and observability to achieve low and continuously improving mean time to recovery (MTTR) for service-impacting incidents.
  • Document any incidents thoroughly, along with corresponding problem records and corrective actions.
  • Participate in the Architectural Review Process for new and existing services, ensuring compliance with high-availability, observability, security, and cost efficiency standards.
  • Enhance governance processes to ensure all platform components meet current standards.
  • Lead root cause analysis for major incidents, communicating with senior stakeholders, driving problem-solving, and debugging using best practice techniques.
  • Design and conduct fault injection experiments to identify potential weak points in high-availability architecture and work with engineering teams to remediate findings.
  • Collaborate with engineering teams to optimize infrastructure for security, resiliency, and cost targets based on collected feedback.
  • Document processes and maintain records related to infrastructure procedures and strategies, ensuring appropriate alerts and support procedures are in place for quick incident remediation.

What kind of person should you be?

  • Adept at solving complex technical challenges and devising effective solutions.
  • Meticulous attention to detail to ensure high standards of availability and security.
  • Team player with strong interpersonal skills, able to work well within a team setting.
  • Adaptable and flexible with new technologies and environments.
  • Proactive at identifying and resolving issues before they escalate.
  • Effective communicator, capable of explaining complex technical issues to both technical and non-technical audiences.
  • Committed to high-quality work consistently.
  • Resilient under pressure, performing well in fast-paced environments.
  • Proven leadership and team management skills.

What skills do you need?

  • Bachelor’s degree in Computer Science, Engineering, Information Technology, or equivalent experience.
  • 5+ years of experience in site reliability engineering or a related field.
  • Proven leadership and team management experience.
  • Experience with systems programming languages, such as Go or Python, and shell scripting.
  • Proficient with Terraform and infrastructure as code principles.
  • Demonstrated proficiency in public cloud environments, particularly AWS.
  • Hands-on experience with Kubernetes management within AWS EKS.
  • Experience with CI/CD automation tools, such as CircleCI and ArgoCD.
  • Experience with monitoring and logging using tools like Prometheus, Grafana, Open Telemetry, CloudWatch, and Honeycomb.
  • AWS and Kubernetes Certifications (Solutions Architect, SysOps Administrator, DevOps Engineer, CKA, CKS, CKAD, KCNA) are desirable.

What you can expect from us

  • Award-winning culture that fosters growth, diversity and inclusion for all
  • Paid day off for your birthday
  • Access to LinkedIn learning courses
  • Bi-annual performance-based bonus
  • Continued education with our education reimbursement program
  • Flexible schedules
  • Free unlimited access to our refreshment stations (fully stocked with tea, coffee and other beverages)
  • Two paid days for volunteer opportunities
  • A free premium membership for ‘Headspace’; an app geared towards mental health and wellbeing
  • Access to Perkopolis retail discounts
  • Generous benefit coverage with low premiums (+ a Health Care Spending Account)
  • RRSP Matching
KUBRA is an equal opportunity employer dedicated to building an inclusive and diverse workforce. We will provide accommodations during the recruitment process upon request. Information received relating to accommodation will be addressed confidentially. We thank all applicants for their interest; however, only candidates under consideration will be contacted.

While we value the skills and experiences listed in our job requirements, we also recognize that talent comes in many forms, and welcome applications from candidates who meet most but not all specified requirements. If you possess a strong desire to learn and grow in a dynamic work environment, apply now!

KUBRA is a fast-growing company that delivers customer communications solutions to some of the largest utility, insurance, and government entities across North America. KUBRA offers billing and payments, mapping, mobile apps, proactive communications, and artificial intelligence solutions for customers. With more than 1.5 billion customer interactions annually, KUBRA services reach over 40% of households in the U.S. and Canada. KUBRA is an operating subsidiary of Hearst.
 
Our office is small enough to allow creative individuals to flourish, yet large enough to provide long-term stability. We place a tremendous amount of responsibility on our team members to be productive, focused and self-motivated. We offer a casual work environment, competitive compensation and a stellar benefits program. 

KUBRA does not typically provide immigration-related assistance, including employment-based work visa (e.g. H-1B) sponsorship, work permit applications and extensions, permanent residence (green card) sponsorship, LMIA applications or permanent residency nominations. Candidates must ensure they have legal authorization to work in the U.S/ Canada. All sponsorship determinations are case by case based on business need.

Last updated on Sep 5, 2024

See more

About the company

More jobs at kubra

Analyzing

Tempe, Arizona

 · 

30+ days ago

Tempe, Arizona

 · 

30+ days ago

Tempe, Arizona

 · 

30+ days ago

Mississauga, Ontario

 · 

30+ days ago

Phoenix, Arizona

 · 

30+ days ago

Developed by Blake and Linh in the US and Vietnam.
We're interested in hearing what you like and don't like! Live chat with our founder or join our Discord
Changelog
🚀 LaunchpadNov 27
Create a site and sell services based on your resume.
🔥 Job search dashboardNov 13
Revamped job search UI with a sortable grid, live filtering, bookmarks, and application tracking.
🫡 Cover letter instructionsSep 27
New Studio settings give you control over AI output.
✨ Cover Letter StudioAug 9
Automatically generate cover letters for any job.
🎯 Suggested filtersAug 6
Copilot suggests additional filters above the results.
⚡️ Quick applicationsAug 2
Apply to jobs using info from your resume. Initial coverage of ~200k jobs in Spain, Germany, Austria, Switzerland, France, and the Netherlands.
🧠 Job AnalysisJul 12
Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.
© 2024 RemoteAmbitionAffiliate · Privacy · Terms · Sitemap · Status