Are you interested in building the technical foundation of the worldwide transition to clean energy? Do you enjoy working with a highly motivated and talented team to deliver mission critical software? Voltus is growing our Site Reliability Engineering [or “Platform”] team to help deploy, manage, troubleshoot, and enhance our Platform and tools for its internal and external customers.
As a Site Reliability Engineer you will be responsible for deploying and maintaining our core Platform, which consists of Hashicorp’s Nomad, Consul, and Vault systems in AWS. In addition, you will help manage and maintain our monitoring systems, which currently include Prometheus and Datadog.
You will build innovative automated solutions and tools to help debug and resolve problems in production and prevent them from recurring. Further, you will proactively seek out system weaknesses and find ways to fix them before they cause production issues using monitoring data, logs, and watching trends.