Epicareer Might not Working Properly
Learn More
S

SRE Engineer

  • Full Time, onsite
  • Sky Solutions LLC
  • Hybrid3 days a week onsite required starting week 1, United States of America
Salary undisclosed

Apply on


Original
Simplified

Hi,


Hope you are doing well?

I am looking for a candidate with experience as SRE Engineer.

Title: SRE Engineer

Location: Fort Worth, TX (Only candidates already local to Dallas will be considered).

Duration: Long term

Location: 1 Transformation Way, Fort Worth, TX 76155

3 days a week onsite required starting week 1

Interviews will be 2 rounds and will include a Dynatrace Assessment onsite final

Must be on our W2

Top 3 to 5 skillsets / years of experience:

  • 3+ years of experience as a developer in a technology organization.
  • 2+ years of experience working with Site Reliability Engineering (SRE), DevOps, or Infrastructure teams.
  • Familiarity with SRE practices such as incident management, SLOs/SLIs, and automation.
  • Experience with tools such as Dynatrace, Thousand Eyes, ServiceNow and similar tools.
  • Experience with Python and Java
  • Understanding of cloud platforms (AWS, Google Cloud Platform, Azure), CI/CD pipelines, and observability tools (Prometheus, Grafana, ELK Stack).
  • Previous experience working directly with SRE teams or DevOps. Surrounding team/key projects
  • Will join the existing Resiliency team
  • Key project will be increasing Resiliency presence and practices in IT

    We are looking for a dedicated and experienced Developer to guide and facilitate our Site Reliability Engineering (SRE) teams.

As an SRE Developer, you will be responsible for enabling our SRE teams to deliver reliable and scalable services that ensure optimal performance and uptime of our production systems. You will work closely with the SRE engineers, product owners, and other cross-functional teams to ensure seamless collaboration, remove obstacles, and drive continuous improvement.

Key Responsibilities:
Assist SRE teams in defining and achieving goals by organizing and facilitating ceremonies such as daily stand-ups, sprint planning, sprint reviews, and retrospectives.
Align SRE activities with Agile methodologies, focusing on incident management, problem resolution, and reliability improvement.
Identify and remove impediments or blockers that may hinder the team's progress.
Track and analyze key performance indicators (KPIs) related to reliability, system performance, and team productivity. Report these metrics to stakeholders and leadership.
Drive continuous improvement initiatives across SRE processes, leveraging feedback from retrospectives and performance data.
Work closely with SRE engineers, product owners, and stakeholders to align the team s work with organizational goals.
Advocate for SRE best practices such as monitoring, alerting, automation, and system health reviews to ensure system stability and availability.
Coordinate and facilitate post-incident reviews, ensuring teams identify and implement action items to prevent future occurrences.
Qualifications:
Bachelor s degree in Computer Science, Engineering, or a related field, or equivalent experience.
3+ years of experience as a developer in a technology organization.
2+ years of experience working with Site Reliability Engineering (SRE), DevOps, or Infrastructure teams.
Familiarity with SRE practices such as incident management, SLOs/SLIs, and automation.
Experience with tools such as Dynatrace, Thousand Eyes, ServiceNow and similar tools.
Experience with Python and Java
Excellent communication, leadership, and facilitation skills.
Understanding of cloud platforms (AWS, Google Cloud Platform, Azure), CI/CD pipelines, and observability tools (Prometheus, Grafana, ELK Stack).
Preferred Skills:
Previous experience working directly with SRE teams or DevOps.
Understanding of infrastructure-as-code tools like Terraform or CloudFormation.
Knowledge of containerization and orchestration technologies such as Docker and Kubernetes.
Language & Communication Skills:
Ability to effectively communicate both verbally and written with all levels within the organization
Ability to explain technical concepts and adjust messaging based on the audience, including non-technical groups
Ability to influence through outstanding interpersonal skills, collaboration, and negotiation skills
Ability to work well within a team environment, as well as independently

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
Report this job