Browse
Employers / Recruiters

Site Reliability Engineer (SRE)

invisible-ai · 30+ days ago
Remote
Negotiable
Full-time
Continue
By pressing the button above, you agree to our Terms and Privacy Policy, and agree to receive email job alerts. You can unsubscribe anytime.
At Invisible AI, we are building the future of computer vision. Today, our core focus is on developing an end-to-end platform that can digitize manufacturing operations. We deploy edge AI cameras to digitize all steps of manual assembly work which helps people-driven manufacturing be accurate, reliable, and safe. Coming from the world of self-driving cars, the founders of Invisible AI have years of experience in building and deploying large-scale AI & Machine Learning pipelines. Join us and help build a company that will deliver the endless possibilities of computer vision to real-world customers!

As a Site Reliability Engineer, you will build the technology to enable our platform to deploy, run, and monitor Invisible AI’s software at scale across tens of independent deployments and thousands of devices. The SRE works closely with all other engineering teams and owns internal tools to enable faster development and deployment, like secure ephemeral debug environments, streamlined access controls, CI/CD systems, and a custom in-house device management platform for device configuration and software releases.

Responsibilities:

  • Design, build, and maintain scalable and resilient infrastructure on the edge.
  • Develop automation and infrastructure-as-code solutions using Terraform, Ansible, and scripting languages (Python, Bash).
  • Deploy and manage containerized applications using Docker and related technologies.
  • Ensure system observability by building and optimizing monitoring systems, particularly using Prometheus.
  • Troubleshoot and optimize Linux-based systems (e.g., Red Hat, CentOS, Ubuntu).
  • Collaborate with security teams to implement robust security practices and ensure compliance with best practices.
  • Work closely with software engineers to improve system performance, reliability, and deployment pipelines.
  • Support and maintain networking infrastructure, including troubleshooting protocols and configurations.
  • Manage cloud and on-premise infrastructure, with a focus on automation and scalability.
  • Contribute to incident response, postmortems, and process improvements.

Requirements:

  • 5+ years of experience building and managing infrastructure at scale, particularly on the edge.
  • Proficiency in Python, Docker, Linux systems, and scripting (Bash, Python).Strong expertise with infrastructure automation tools (Terraform, Ansible).Experience managing observability and monitoring systems, particularly Prometheus.
  • Deep understanding of networking concepts and protocols.
  • Familiarity with cloud platforms (AWS, Azure, Google Cloud) is a plus.
  • Experience with Windows Services/VMs is a plus.
  • Excellent problem-solving skills, with attention to detail.
  • Strong communication and collaboration skills to work across teams.
  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent experience.
Our compensation package plays a big part in how we value your impact on our mission. Our base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The estimated base salary guideline range for this role is between $110,000-$170,000 and may be modified. This will vary based on various factors, including market and individual qualifications objectively assessed during the interview process. In addition to base salary, your compensation package will include additional components such as equity, sales incentive pay (for sales roles), and benefits. Invisible AI is an equal-opportunity employer. We do not discriminate based on age, ethnicity, gender, nationality, religious belief, or sexual orientation.

Last updated on Jan 21, 2025

See more

About the company

More jobs at invisible-ai

Analyzing

San Francisco, California

 · 

30+ days ago

Tulsa, Oklahoma

 · 

30+ days ago

More jobs like this

Analyzing

New York, New York

 · 

30+ days ago

San Francisco, California

 · 

30+ days ago

Web Engineer
U
Upworthy ·  Viral content for social good

 · 

30+ days ago

Remote

 · 

30+ days ago

Remote

 · 

30+ days ago

Des Moines, Iowa

 · 

30+ days ago

South Jordan, Utah

 · 

30+ days ago

Tampa, Florida

 · 

30+ days ago

Web Site Designer
TT
The Talently ·  AI recruitment platform

California

 · 

30+ days ago

Apttus CPQ Developer
C
crjdnwsnowo2i4nz45b1teboszrxlg0351vr73gpqw7yanury9u287prckhdnkww

Minneapolis, Minnesota

 · 

30+ days ago

Developed by Blake and Linh in the US and Vietnam.
We're interested in hearing what you like and don't like! Live chat with our founder or join our Discord
Changelog
🚀 LaunchpadNov 27
Create a site and sell services based on your resume.
🔥 Job search dashboardNov 13
Revamped job search UI with a sortable grid, live filtering, bookmarks, and application tracking.
🫡 Cover letter instructionsSep 27
New Studio settings give you control over AI output.
✨ Cover Letter StudioAug 9
Automatically generate cover letters for any job.
🎯 Suggested filtersAug 6
Copilot suggests additional filters above the results.
⚡️ Quick applicationsAug 2
Apply to jobs using info from your resume. Initial coverage of ~200k jobs in Spain, Germany, Austria, Switzerland, France, and the Netherlands.
🧠 Job AnalysisJul 12
Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.
© 2024 RemoteAmbitionAffiliate · Privacy · Terms · Sitemap · Status