As a Sr. DevOps Engineer, you will be key member of a highly technical team of DevOps engineers responsible for designing, coding, and supporting the infrastructure and tools that underpin all SoundThinking Inc. production and development environments. Working closely with our development teams you will focus on all phases from design and development to maintenance, automation and monitoring of our operational systems.
Essential Duties/Responsibilities:
- Partner closely with our Engineering teams to develop infrastructures and tools that assist in the automation and management of SoundThinking inc. services. A keen focus through the software design cycle is on scalability, observability, automation, reliability, security, and quality of our services as deployed in AWS.
- Build and implement tools and frameworks to automate our system and software deployments. Including disaster recovery and multi region replication with a goal of minimizing mean time to recovery.
- Lead technology initiatives that enable others to deliver their services to our customer with a minimal amount of friction and high degree of quality.
- Deliver internal Infrastructure services such as monitoring, logging, data services to our internal users.
- Though the use of GitOps be responsible for building, maintaining, and automating the infrastructure components used to deliver SoundThinking services.
Minimum Qualifications
- Bachelor’s degree preferably in computer science or equivalent and 7+ years’ experience with IT infrastructure, Linux administration, networking and security, or 10+ years of equivalent experience.
- Five (5) or more years' experience working with AWS including IAM, Route53, VPC, EKS, ECR, EC2, S3, Security Groups and other components. Some experience with Azure a plus.
- Five (5) or more years' experience with Configuration Management and Infrastructure Management systems such as CloudFormation, Argo CD, Terraform, Ansible, Chef, Puppet, or others.
- Hands-on experience running and managing workloads in a containerized environment using Helm and Kubernetes, preferably running in EKS.
- Experience administering databases, preferably RDS Aurora or Postgress and Redis.
- Solid understanding of Microservices architecture in monorepo and polyrepo form.
- Practical and demonstrable skill with an administrative language such as Python, Bash, or Ruby. Some knowledge of node.js, Java or Go is beneficial.
- Hands-on experience with system and application metric collection and alerting services such as Graphite, Grafana, Prometheus, SysDig or others. A keen focus on what makes a system observable.
- Understanding of and experience with continuous integration and continuous deployment patterns and tools including Jenkins, Team City, Gitlab CI and GitHub actions.
- A strong understanding of modern systems, network and service-related security best practices.
- Firm technical grasp on Linux administration including protocols such as DNS, LDAP, SMTP, HTTP, TCP/IP, TLS, etc.
- Solid understanding of networking and distributed computing concepts.
- Superior troubleshooting skills.
- Excellent communication skills to support working with a distributed team mostly in the US eastern time zone.
- Keen attention to detail and excellent documentation skills including architecture, incidents, change control, diagrams, runbooks and checklists. SoundThinking undergoes SOC2 Type 2 audits where accurate documentation is essential.
- Willingness to stand in an on-call Tier 3 rotation on occasion as part of the engineering experts supporting our 24/7 operation.
- Must be able to pass a CJIS background check.
- Candidate must be a US Citizen or Green Card Holder.
Bonus Skills
- Experience with CJIS compliance and CJI deployments in AWS GovCloud.
- Experience with non-relational databases like MongoDB, or Cassandra.
- Experience operating real-time analytics infrastructures such as Storm, Hadoop, HBase, or Amazon EMR.
Hybrid Workplace:
Soundthinking follows a hybrid schedule for employees who live equal to or less than 50 miles from one of our office locations, which include Fremont, CA, Tucson, AZ, Washington, D.C., or Iselin, NJ. Employees are expected to work onsite 3 days per week – the specific days are dependent on the office location.