Site Reliability Engineer
Apply on
Availability Status
This job is expected to be in high demand and may close soon. We’ll remove this job ad once it's closed.
Who we are:
ShorePoint is a fast-growing, industry recognized, and award-winning cybersecurity services firm with a focus on high-profile, high-threat, private and public-sector customers who demand experience and proven security models to protect their data. ShorePoint subscribes to a work hard, play hard mentality and celebrates individual and company successes. We are passionate about our mission and going above and beyond to deliver for our customers. We are equally passionate about an environment that supports creativity, accountability, diversity, inclusion, and a focus on giving back to our community.
The Perks:
As recognized members of the Cyber Elite, we work together in partnership to defend our nation s critical infrastructure while building meaningful and exciting career development opportunities in a culture tailored to the individual technical and professional growth. We are committed to the belief that our team members do their best work when they are happy and well cared for. In support of this philosophy, we offer a comprehensive benefits package, including major carriers for health care providers. Highlighted benefits offered: 18 days of PTO, 11 holidays, 80% of insurance premium covered, 401k, continued education, certifications maintenance and reimbursement, etc.
Who we're looking for:
We are seeking a skilled Site Reliability Engineer to support dynamic, fast-paced environments in the public sector. This role involves engineering, operating, and monitoring cyber data collection systems based on architecture and system designs. The Site Reliability Engineer will collaborate closely with data engineers, architects, and security analysts to ensure system reliability and continuously drive operational excellence. This is an exciting opportunity to shape the future of government cyber data modernization capabilities.
What you'll be doing:
- Provide operational engineering support for cyber data systems in government environments.
- Build and integrate IT best practices and operational excellence across all areas of the project.
- Ensure system uptime by building observability and reliability into large, distributed security data infrastructures.
- Develop systems to monitor data ingestion and storage of large and diverse datasets.
- Produce dashboards and reports to inform the team about operational metrics, system status, performance, and capacity.
- Operate and maintain cloud-based infrastructure and platform services.
- Manage incidents, determine root causes, and develop plans to prevent future outages.
- Collaborate in an Agile DevOps team environment, using tools for continuous integration and delivery (CI/CD).
- Communicate effectively with team members and maintain documentation for technical procedures.
What you need to know:
- Strong experience in IT operations and troubleshooting issues related to data connections and sources.
- Proven expertise in implementing IT best practices (e.g., IT Service Management, ITIL, Change Management, Configuration Management).
- Advanced Linux systems administration skills.
- Familiarity with cloud infrastructure services (e.g., VPC, EC2, IAM) and cloud platform security and configuration services.
- Experience with distributed event data storage and analytical tools (e.g., Elasticsearch, Splunk).
- Familiarity with DevOps Infrastructure-as-Code tools (e.g., Terraform, Ansible, Git, CI/CD).
- Proficiency in system automation programming languages (e.g., Python, Bash).
Must have's:
- Bachelor s degree in Cybersecurity, Computer Science, Information Systems, Mathematics, Engineering or a related field or additional 3-5 years of relevant experience.
- 5+ years of experience in operating and maintaining large-scale IT systems, system or application integration.
- Demonstrated ability to apply critical thinking to transform undefined tasks into actionable processes and workstreams.
- Proficiency in administering Linux operating systems.
- Experience with Elasticsearch infrastructure (version 8.x and newer).
- Proven experience in producing dashboards and reports using tools like Elasticsearch and Grafana.
- Experience querying APIs to extract and utilize performance metrics.
- AWS cloud experience, including use of the AWS CLI.
- Intermediate proficiency in Python programming (e.g., AWS Lambda, Boto3).
- Experience with infrastructure automation (e.g., Ansible, Terraform)
- This position requires U.S. citizenship in compliance with federal contract requirements and eligibility to obtain a Q clearance.
Beneficial to have the following:
- Familiarity with government cloud environments.
- Knowledge of cyber data analytics.
- Experience with distributed systems integration.
- Understanding of information security principles.
- Familiarity with Splunk infrastructure.
- Familiarity with NISTIR 8112 standards.
Where it's done:
- Remote (Herndon, VA).