Browse
Employers / Recruiters

DevOps Engineer/Site Reliability Engineer (SRE with Azure Cloud)

datamaxis · 25 days ago
Negotiable
Full-time
Continue
By pressing the button above, you agree to our Terms and Privacy Policy, and agree to receive email job alerts. You can unsubscribe anytime.

The ecommerce Platform Operations team is responsible for the stability, reliability, release and deployment of our B2B & B2C ecommerce platforms. The team’s primary function is to increase the efficiency of the organization through well designed automation and infrastructure. As a Site Reliability engineer you will work closely with various infrastructure & application development teams to increase stability and reliability via the enablement of various Telemetry concepts. You will also be responsible for effective operations of the ecommerce platform via efficient automation & execution of operational processes. If you’re someone who doesn’t mind participating in on-call support, and enjoys troubleshooting production issues and implementing remediation, this position is for you!

Required skills:

• Expert level experience with operating ATG Commerce ecommerce platform (OR) building custom Java / Java EE customer-facing solutions on Azure Cloud environment (AKS).
• 3+ Years Azure Experience
• Hands on experience with containerization, Kubernetes, and micro services.
• Experience with Cloud infrastructure and application monitoring following methodologies such as RED or USE.
• Familiarity with APM monitoring tools such as Splunk APM, AppDynamics and/or Azure AppInsights
• Familiarity with Infrastructure monitoring tools such as Graphana, Prometheus, Azure monitor, Log Analytics (KQL queries)
• Experience with log collection tools and analysis, as well as infrastructure performance and optimization practices
• Experience with DevOps automation platforms such as Jenkins, Artifactory, ACR, and/or Azure DevOps
• Experience with CI/CD provisioning and managing Azure Infrastructure
• Participate in after-hours on-call rotation and after-hours maintenance window activities as needed
• Experience performing Root Cause Analysis (RCA) for application and infrastructure related issues
• Solid grasp of various performance monitoring methodologies, as well as 2+ years of hands-on experience configuring monitoring tools such as Azure Application Insights, New Relic, and Splunk is required. Strong experience with other telemetry tools, including AppDynamics, Extrahop, vSphere, Solarwinds Orion, SAM, etc. will be considered.
• Top candidate will have experience or thorough understanding of incident workflows (preferably using New Relic). Must have experience enriching alerts for faster root-cause detection and incident resolution.
• Must be experience configuring monitors for business transactions, service end points, etc., as well as setup health rules for triggering alerts.
• Detailed knowledge of relational databases, Ex: MS SQL, MySQL (OR) NoSQL DB like Cosmos DB. Must be able to construct SQL queries and configure them with telemetry.
• Strong scripting (bash, python, shell) skills.
• Self-starter with the ability to quickly learn new tools and tool features. Must be able to handle multiple tasks and priorities within a fast-paced work environment
• Must be highly motivated and dependable with excellent communication skills.
• Bachelors in Computer Science or other four-year degree in a relevant field is required

Preferred Skills:

• Experience using Terraform to perform infrastructure as code
• Deep working knowledge with Azure networking, Application Gateway, APIM, IAM Policy and network security.
• Able to deploy and manage Azure storage.
• Experience with Azure Active Directory management and design experience a plus
• Production support experience with E-commerce websites.
• Experience with tracking, measuring, and reporting KPIs like MTBI, MTRS, MTTD, etc.

Last updated on Jun 26, 2024

See more

About the company

More jobs at datamaxis

Analyzing

Pune, Maharashtra

 · 

30+ days ago

Chennai, Tamil Nadu

 · 

30+ days ago

Mundelein, Illinois

 · 

30+ days ago

Pune, Maharashtra

 · 

30+ days ago

Remote

 · 

30+ days ago

Developed by Blake and Linh in the US and Vietnam.
We're interested in hearing what you like and don't like! Live chat with our founder or join our Discord
Changelog
🚀 LaunchpadNov 27
Create a site and sell services based on your CV.
🔥 Job search dashboardNov 13
Revamped job search UI with a sortable grid, live filtering, bookmarks, and application tracking.
🫡 Cover letter instructionsSep 27
New Studio settings give you control over AI output.
✨ Cover Letter StudioAug 9
Automatically generate cover letters for any job.
🎯 Suggested filtersAug 6
Copilot suggests additional filters above the results.
⚡️ Quick applicationsAug 2
Apply to jobs using info from your CV. Initial coverage of ~200k jobs in Spain, Germany, Austria, Switzerland, France, and the Netherlands.
🧠 Job AnalysisJul 12
Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.
© 2024 RemoteAmbitionAffiliate · Privacy · Terms · Sitemap · Status