NinjaTrader is an investor-backed, growth-stage fintech company with an award-winning platform and over 1.7 million users. We are building products and services that empower active traders to easily analyze and react to data from the world's leading financial markets. Located in Chicago, our unique employee-centric company culture is one that our team finds inviting, energizing and fun. Please visit www.ninjatrader.com to learn more about our business.
We are seeking a highly skilled and experienced engineering leader to join our Platform team, leading the Cloud Infrastructure and SRE team. You will help us scale our cloud infrastructure for continuous and programmatic optimization of cloud resources, build and support a robust incident response program, and drive down costs while enhancing performance and utilization. As a technical leader in our technology organization, your team's work will have a profound impact on our core high-throughput, low-latency trading application, directly influencing our business's bottom line.
In this role, you will:
- Mentor and manage a team of SREs, fostering a culture of collaboration, innovation, and excellence in execution
- Own technical decisions for the team, ensuring alignment with developers and compliance with security policies and industry standards
- Implement and maintain robust operational practices, including incident management, monitoring, alerting, and capacity planning
- Coordinate on-call support across global time zones, ensuring 24/7 coverage and efficient handovers
- Lead initiatives related to the design, deployment, and maintenance of critical infrastructure components
- Oversee release processes and ensure smooth deployments, minimizing downtime and impact on users
- Conduct thorough post-incident reviews, identifying root causes and implementing preventive measures
- Hire the best talent to help build and grow NinjaTrader’s infrastructure team
Key Position Requirements:
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience
- 8+ years of progressive engineering experience in Site Reliability or adjacent disciplines (DevOps, Platform, Backend Engineering, etc.), with a strong background in cloud service providers, ISPs, or similar service-oriented networking companies
- 3+ years of management experience with a consistent track record of managing technical and geographically distributed teams, including coaching, performance management, career development, and hiring
- Production experience supporting a 24/7 cloud-based environment and leading Incident Management at scale
- Exceptional troubleshooting, debugging, and diagnostic skills for cloud and web-based technologies using industry-standard observability tools and frameworks
- Proven experience in designing, deploying, and managing large-scale cloud infrastructure
- Experience writing production-quality code in a major programming language such as Java, Scala, Python, C++, or Go
- Strong scripting skills in languages such as Python, Bash, or equivalent
- In-depth knowledge of networking, security, and identity management in cloud environments
- Excellent problem-solving and communication skills, with the ability to articulate technical problems concisely to a non-technical audience
Our Core Benefits Include:
- 15+ days PTO per year
- 7 paid holidays annually
- 401k with Company Match
- Health, Vision, Dental Coverage
- Life and Disability Insurance covered 100% by NinjaTrader
- And more!
We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal opportunity workplace.
•
Last updated on Aug 22, 2024