Site Reliability Engineer (Remote)
Apply on
Title: Site Reliability Engineer
Job Type: Long Term Contract
Location: 100% Remote work
JOB REQUIREMENTS:
Experience setting up a SRE practice
Must be hands on. It will be automation and operational support over infrastructure setup.
Must be knowledgeable and apply SRE best practices
Experience with pipelines is required both CI/CD and data ingestion pipelines
Only need to understand these in order to support them: Immuta, Starburst, Collibra, Databricks, Alteryx, and Tableau
Experience mentoring others
Experience working in an Agile environment
Experienced as a SRE in the following:
Stability of the application Tweaking observability and configuration settings to meet customer expectations.
Lead with data Have a data driven mindset
Empowering our users and engineers Ensuring that tools, pipelines are configured along best practices, educating support and engineers on best practices.
Automation Where feasible, automate tasks and processes to reduce engineering toil and reduce errors.
- CI/CD Tools: e.g., Jenkins, GitLab
- Familiarity with AWS. The SRE will not be focused on building the infrastructure but must understand it to support it.
- Orchestration and environment management tools (Puppet, Kubernetes, Ansible, Terraform)
Monitoring tools (Splunk, Dynatrace, Datadog).
Thorough understanding of APIs, gateways, orchestrators, databases, networking, monitoring, configuration management and security best practices for a production environment.
Experience programming and scripting on UNIX / Linux. (i.e., Python or Bash).
Experienced in implementing Data and Advanced Analytics solutions, and SaaS or related experience in the Cloud.
Pluses:
Experience working with FedRamp