Epicareer Might not Working Properly
Learn More
J

SRE DevOps Engineer

Salary undisclosed

Apply on

Availability Status

This job is expected to be in high demand and may close soon. We’ll remove this job ad once it's closed.


Original
Simplified

Looking only for independent consultants

Who Are We Looking For?

We are seeking a DevOps Engineer to be a pivotal and transformational member of our engineering team. In this role, you will work closely with agile teams to analyze, design, build, and test high-quality cloud deployment methodologies and systems. As part of the team under the leadership of architects, you will play a key role in building, migrating, and operating cloud-based platforms.

This position holds full responsibility for managing our platform and application pipelines, overseeing the flow of code updates, and creating/monitoring highly available cloud infrastructure. You will gain hands-on exposure to multiple technologies, focusing on cloud platform services.


Technical Skills Required:

  • Experience:

    • 4+ years of overall experience, with 2+ years in a Site Reliability Engineer (SRE) role managing IaaS, PaaS, and microservices on AWS Cloud Platform.
    • Strong background in monitoring, troubleshooting, and providing application/infrastructure support.
  • Key Expertise:

    • Release engineering, CI/CD automation, and build tools.
    • Performing health checks and proactive issue identification (alerts, verifications, etc.).
    • Debugging and fixing infrastructure and application issues alongside engineering or DevOps teams.
    • AWS/OCI platform hands-on experience, including services like EC2, S3, VPC, RDS, CloudTrail, and EKS.
    • Integration with tools like Code Deploy and GitHub Actions.
    • Expertise in monitoring tools like CloudWatch and Datadog.
    • Experience with infrastructure automation tools like CloudFormation (CFT) and Terraform.
  • Cloud and SRE Skills:

    • Proficient in Infrastructure as Code, Cloud Networking, Containerization, and SRE principles.
    • Migrating and implementing applications from on-premise to cloud environments.
    • Hands-on experience with Kubernetes, AWS CloudFormation, and Shell Scripting.

Process and Soft Skills:

  • Strong understanding of ITIL practices, including Change Management and Incident Management.
  • Exceptional communication and collaboration skills, with a self-starter attitude.
  • Practical exposure to emerging cloud database technologies.
  • Proven ability to work independently in a matrix organization with multiple managers.
  • Keeping pace with industry trends, technological innovations, and evolving customer requirements.

Responsibilities:

  • Support rapid development and engineering productivity via CI/CD pipelines and build tools.
  • Monitor application and infrastructure health, proactively resolving issues.
  • Plan and implement infrastructure capacity, upgrades, and monitoring.
  • Participate in daily standups, production reviews, and on-call rotations for escalations.
  • Contribute to the design and improvement of deployment architecture, focusing on reliability, high availability, and efficiency.
  • Research and customize tools to enhance observability and resilience.
  • Maintain comprehensive SRE-related documentation, including solution repositories and Root Cause Analysis reports.

Preferred Certification:

  • AWS Public Cloud Certification (optional but preferred).

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
Report this job