Epicareer Might not Working Properly
Learn More
Y

Site Reliability Engineer

  • Full Time, onsite
  • Yashco Systems, Inc.
  • Hybrid3 days onsite a week, United States of America
Salary undisclosed

Apply on


Original
Simplified

NOT FOR RECRUITING AGENCIES WE CANNOT SPONSER

Sr. SRE ( Site Reliability Engineer) - KINDLY NOTE THAT THIS IS NOT A DEVOPS ROLE

Someone with hands one experience in Data DevOps/ DataOps/ No- SQL, Kafka , Databricks, Kubernetes, Kafka , Terraforms

HYBRID ROLE IN SEATTLE WA

Imp Note

This is a Sr. SRE role and not devops role

Azure Cloud -AKS - must have this experience

Databricks Notebooks must have this experience

NO-SQL Database - Cassandra, Mongo, PostGres- must have this experience

Kubernetes must have this experience

Kafka- skill level expert is required for this role

Terraform- skill level expert is required for this role

Pl match skills before submitting resumes

Core skills needed -

Azure Clous, AKS Scalability, monitoring, deployment, check logs, ensure node and pod health.

Databases include - Cassandra, Mongo, PostGres

Databricks Notebooks There are a lot of jobs on Databricks experience with Databricks to know how a notebook is created and run - run queries against the database and finding discrepancies and perform fixes.

Based microservices, responsible for deployment, scripting language is python.

Should have an understanding around terraform.

Emphasis on Logs and Monitoring (datadog and splunk)

Summary of Experience

  • Requires 10-12 years experience in the IT industry
  • Requires 9+ years of software and DevOps development engineering
  • Experience in working with cloud environment Azure preferred.
  • Experience with Kubernetes, Azure Kubernetes (AKS) preferred.
  • Experience with using Kafka, Event Hub, NATS or any messaging broker.
  • Experience with Cassandra, PostgresSQL, Mongo, Elastic Search, Cosmos DB
  • Experience on Azure DevOps, Jenkins/ Python / Terraform / Ansible
  • Experience with Databricks
  • Experience with DataDog, Splunk or other logging and APM tools.
  • Experience in working with Linux environment.

Summary of Key Responsibilities

Responsibilities and essential job functions include but are not limited to the following:

Responsible for health of production system

Develop monitoring dashboards

Configure alerts and automate process for system recovery

Monitor alerts and take proactive steps to resolve system issues

Troubleshoot production issues

Lead production troubleshooting calls

Responsible for patches and updates on production systems.

Design and build cutting-edge, multi-micro service solutions to support Starbucks s growth worldwide.

Helping CI/CD team during rolling out application and infrastructure globally.

Collaborates with development team, other Information Technology (IT) team s developer leads. Initiates process improvements for new and existing systems.

Participates in a production support rotation that includes pager responsibilities.

Ability to accurately break down complex application designs into component deliverables and estimate design and development timelines

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
Report this job