Listing Description
About the Position
Do you want to work at a high-growth company where your impact is seen and rewarded? Are you looking for the autonomy to do your best work? We are seeking a Staff Devops / Site Reliability Engineer to help shape the future of our production infrastructure. As a key contributor on our growing engineering team of 100, you will become an expert on the problems facing our engineers and helping them move faster.
Responsibilities
- Maintain and improve upon robust monitoring of production services
- Investigate and fix system failures, and then share root-cause analysis with the rest of the team
- Implement robust CI/CD and testing pipelines for a wide range of services and tools
- Automate development workflows, infrastructure tasks, and security auditing
- Improve and optimize infrastructure through data-driven analysis
- Work closely with engineers/QA to build environments and solutions for rapidly evolving needs
- Handle infrastructure support requests from the engineering organization
- Help create a roadmap for the DevOps team to support a quickly growing engineering organization
- Administer infrastructure consisting of Windows and Linux servers and containers
- Drive modernization and breaking up of our Windows-based monolith into a modern infrastructure running in Kubernetes
Basic Qualifications
- Proficient in automating repetitive tasks with code
- 3+ years of industry experience in Windows systems
- 3+ years of industry experience in Linux systems
- 5+ years of industry experience in Software Engineering
- 5+ years of industry experience in SRE/devops engineering
- Experience maintaining infrastructure for services hosted in a cloud environment
- Excellent proactive communication skills to work with remote engineers in Austin, San Francisco and across the country
- Experienced in git, git workflows, and git history maintenance
- Familiar with monitoring the health of applications running in the cloud
- Familiar with Continuous Integration (CI) and Continuous Delivery (CD) and the tools required to accomplish this
- Thrive in a fast-paced startup environment, ready to contribute from day one
Preferred Qualifications
- Experience automating and maintaining infrastructure for high-traffic services hosted in a multi-account AWS architecture
- Experience with terraform, ansible, or similar infrastructure/configuration as code tools (IaC, CaC)
- Experience building, optimizing, and automating CI/CD (Jenkins) flows for API, Web, and Data services
- Experience managing production loads on Redis, SQL Server, Elasticsearch, or similar technologies at scale
- Experienced with configuring and building automated monitoring systems that provide an early warning to minimize service degradation and downtime
- Strong experience writing code across multiple languages and environments in a collaborative environment
- Experience with C#, dotnet, javascript and/or typescript
- Proficient in using git (GitHub) in a multi branched environment
- BS/MS Computer Science or a related technical field
#LI-Remote
Listing Details
- Citizenship: Not Provided
- Incentives: Not Provided
- Education: Not Provided
- Travel: Not Provided
- Telework: Not Provided