Job Summary
Key Skills and Attributes Required
- Knows their way around a Unix/Linux shell, can write shell scripts, and understands Linux internals
- Experience monitoring and operating large-scale production systems on linux based environments and in cloud based environments such as AWS
- Experience debugging complex operational problems, particularly in a microservice environment
- Experience with usage and management of container based technologies such as Docker or Kubernetes
- Experience with TCP/IP networking, troubleshooting on L3/4/7
- Excellent communication skills, both verbal and written
- Experience with software engineering, software development, and system operations
- Knows Python, Java, Node.js, Go or similar
- Understands messaging between services
- Has hands-on experience using source control (Git, GitHub) and feature branching strategies
- Has experience with a variety of open-source databases (MySQL, Postgres, Redis, Cassandra, etc.)
Preferred
- Experience with DevOps engineering or SRE
- Experience with monitoring and observability such as with Datadog, New Relic and Nagios
- Experience automating infrastructure, testing, and deployments using tools like or Terraform
- Experience with configuration management, such as with Puppet, Ansible or Chef
- Monitoring and troubleshooting service issues on AWS
- Understands the idea behind Chaos Engineering