Browse
Employers / Recruiters

Cloud Ops Engineer

laagencia · 30+ days ago
Negotiable
Full-time
Continue
By pressing the button above, you agree to our Terms and Privacy Policy, and agree to receive email job alerts. You can unsubscribe anytime.
Cloud Ops Engineer

POSITION SUMMARY
The Cloud Ops Engineer will support Amazon Web Services (AWS) and Linux/Windows environments. The Cloud Ops Engineer will be responsible for all aspects of the production lifecycle of maintenance, and administration, including but not limited to infrastructure automation, continuous integration and deployment, product release and support, running a scalable production environment for hosting the ARCOS platform, maintaining application/database availability, and ensuring continuous 24x7
production uptime of our services.

The Cloud Ops Engineer needs to be familiar with AWS, Apache, Tomcat, PostgreSQL, Oracle, Ansible, Jenkins, Jira, Confluence, and SaaS operations.

ESSENTIAL JOB FUNCTIONS

  • Design, develop and maintain scalable AWS solutions and infrastructure, including but not limited to EC2, RDS, S3, DynamoDB, Elasticache, and Route53.
  • Develop tooling and processes to automate the deployment of SaaS-based applications and their underlying operating systems and infrastructure.
  • Perform PostgreSQL and Oracle database administration, including maintenance, troubleshooting, tuning, optimization, installation, upgrades, backup/recovery, and data migration.
  • Partner with Engineering, Development, Quality Assurance, Professional Services, and Technical Support to ensure the success of the assigned product offerings and schedules.
  • Engage in Agile team practices such as daily standups, backlog refinement, release planning and sprint planning.
  • Coordinate configuration changes, installs, and upgrades with appropriate development teams and product owners while following company change control procedures.
  • Participate in capacity planning to determine future infrastructure needs.
  • Participate in 24x7 on-call responsibilities, maintaining the availability and performance of all customer-facing production services.
  • Triage and participate in the resolution of complex problems, including network connectivity issues, that span multiple tiers of application/infrastructure.
  • Implement monitoring and reporting capabilities to assist engineering in rapidly identifying issues.
  • Actively monitor supported systems and respond promptly to security or usability concerns.
  • Review application logs and analyze events using cloud-native services (e.g. CloudWatch, CloudTrail) or third party SIEM tools (e.g. Splunk).
  • Upgrade systems and processes as required for enhanced functionality and security compliance.
  • Maintain product service level agreements.
  • Accurately document all processes and procedures for routine and non-routine tasks.
  • All other duties and responsibilities as assigned.

QUALIFICATIONS REQUIREMENTS AND SKILLS

  • Bachelor’s degree in Computer Science or related field, or equivalent work experience.
  • 4-5 years of system administration experience, ideally in global management and operations of highly trafficked production applications. Experience working in a 24x7 SaaS environment is preferred.
  • 4-5 years of experience designing solutions for and managing AWS services, including but not limited to EC2, RDS, S3, DynamoDB, Elasticache, WAF/Shield, Route53, IAM, and Directory Service.
  • Experience with Linux and Windows system administration, automation, and performance tuning.
  • Experience with configuration management and infrastructure as code tools such as Ansible and Terraform.
  • Experience with Apache, Nginx, Tomcat, NodeJS/PM2.
  • Experience with scripting languages, including Bash, Python, and Powershell.
  • Knowledge of CI/CD technologies and best practices.
  • Knowledge of PostgreSQL, Oracle, Docker, Jira, Confluence.
  • Advanced knowledge of system vulnerability management and security best practices.
  • Solid understanding of networking concepts and troubleshooting. 
  • Proven ability to work effectively with highly reliable and highly available mission-critical technologies with detail and results shown while meeting deadlines.
  • Ability to operate deployment automation, SaaS operations, internal and external SaaS infrastructure, security and cost management.
  • Solid understanding of technical issues and opportunities related to modern cloud infrastructure and operations.
  • Action oriented, decisive approach to work required, with the willingness to take a hands-on role when needed to ensure deliverables are met on time.
  • High energy, motivated self-starter with the ability to take direction and manage tasks with minimal supervision within an energized, collaborative, and entrepreneurial environment.
  • Excellent written and verbal communication skills.

Production Support/On-Call Duties:

  • As a key member of our engineering team, you will address escalated production issues from customer support. Your responsibilities will include:
  • Participating in a rotational on-call schedule to handle significant production issues.
  • Rapidly diagnosing and resolving technical challenges that arise in production.
  • Collaborating with customer support and engineering teams for seamless issue resolution.
  • Maintaining clear communication and documentation during and after incidents.
  • Leveraging these experiences to contribute to continuous process improvement.
Compensatory Time for On-Call Work:
We value work-life balance and recognize the extra effort required during on-call rotations. For hours spent actively working on-call, compensatory time off is provided, unless the law requires otherwise. This ensures your commitment is appropriately acknowledged. Please coordinate with your manager regarding the approval and scheduling of compensatory time, to align with team needs and workload. Your contribution is essential in maintaining the smooth operation of our systems and in upholding high standards of customer satisfaction.

ARCOS is committed to creating an environment of mutual respect where equal opportunities are available to all.
We embrace the diversity of our team members and are dedicated to creating an inclusive environment for all
employees. Discrimination will not be tolerated within our organization; we encourage all walks of life to apply. We
stand behind the belief that the more diverse and inclusive we are, the more impactful our work will be. All
employment is decided based on qualifications, merit, and business need.

Last updated on Oct 18, 2024

See more

About the company

More jobs at laagencia

Analyzing

Mexico City, Mexico City

 · 

30+ days ago

Mexico City, Mexico City

 · 

30+ days ago

Mexico City, Mexico City

 · 

30+ days ago

Guadalajara, Jalisco

 · 

30+ days ago

 · 

30+ days ago

More jobs like this

Analyzing

Remote

 · 

30+ days ago

SAP HANA - Monterrey
Q
qvjdnwe572ghl7nxzooczu4p6c4c5a0430flsot5fdrnb6of92vtejcidf753nik

Monterrey, Nuevo Leon

 · 

30+ days ago

SAP HANA - Monterrey
u4jdnwo9bfrrb4880orajasydzcctc0430ep5ji4kdp5v8qtsbx70nx09mmm07dy

Monterrey, Nuevo Leon

 · 

30+ days ago

BBDD - México
Multiplica Talent ·  Talento reclutamiento bajo demanda

Mexico City, Mexico City

 · 

30+ days ago

Remote

 · 

30+ days ago

Mexico City, Mexico City

 · 

30+ days ago

Mexico City, Mexico City

 · 

30+ days ago

Mexico City, Mexico City

 · 

30+ days ago

Guadalajara, Jalisco

 · 

30+ days ago

Mexico City, Mexico City

 · 

30+ days ago

Developed by Blake and Linh in the US and Vietnam.
We're interested in hearing what you like and don't like! Live chat with our founder or join our Discord
Changelog
🚀 LaunchpadNov 27
Create a site and sell services based on your resume.
🔥 Job search dashboardNov 13
Revamped job search UI with a sortable grid, live filtering, bookmarks, and application tracking.
🫡 Cover letter instructionsSep 27
New Studio settings give you control over AI output.
✨ Cover Letter StudioAug 9
Automatically generate cover letters for any job.
🎯 Suggested filtersAug 6
Copilot suggests additional filters above the results.
⚡️ Quick applicationsAug 2
Apply to jobs using info from your resume. Initial coverage of ~200k jobs in Spain, Germany, Austria, Switzerland, France, and the Netherlands.
🧠 Job AnalysisJul 12
Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.
© 2024 RemoteAmbitionAffiliate · Privacy · Terms · Sitemap · Status