Epicareer Might not Working Properly
Learn More
I

Senior DevOPs Engineer

Salary undisclosed

Apply on

Availability Status

This job is expected to be in high demand and may close soon. We’ll remove this job ad once it's closed.


Original
Simplified

Job Description

Job Description
Senior DevOPs Engineer
This position will be hybrid initially
Overview:
East Tennessee R&D facility is seeking highly qualified individuals to play a key role in improving the security, performance, and reliability of the client computing environments.
In this role, you will work within the HPC Clusters Group inside of the Systems Section to support numerous activities of the center.
The HPC Clusters Group administers and supports the division s HPC computing infrastructure, which includes system installation, deployment, acceptance, performance testing, upgrades, problem diagnosis, and troubleshooting. The HPC Systems Section administers and supports the division s computing, networking, and storage systems.
Major Duties/Responsibilities:
  • Work with the team to define and implement best practices and standards within the organization.
  • Automate systems administration tasks utilizing open-source configuration management tools.
  • Identify automation opportunities to improve DevOps operations, make recommendations to management, and lead the implementation of improvements.
  • Review architecture and offer recommendations for improvements, lead implementation efforts.
  • System troubleshooting and problem-solving across multiple platforms (dev/test/prod)
  • Work with team to adopt a software-defined infrastructure and infrastructure as code paradigm.
  • Embrace continuous integration and continuous delivery (CI/CD) processes. Train and mentor junior-level staff in these processes.
  • Evaluate new technology options and vendor products; rely on expertise to recommend new technology and products to management.
  • CI/CD technologies Gitlab runners, etc.
  • Configuration Management - i.e., Puppet, Ansible, etc.
  • Identify and document IT best practices that will improve the systems deployment function.
  • Ensure the secure and effective operation of computing systems through compliance with ORNL procedures and IT Internal Operating Procedures.
  • Work with other systems engineers and vendors to resolve hardware and software issues.
  • Answer escalated helpline calls in addition to primary project work.
  • Monitor systems performance.
  • Install and configure software, both commercial packages, and various open-source packages.
  • Maintain documentation/notes on software builds and installs.
Basic Qualifications:
  • Bachelor s degree in computer science or related technical subjects or equivalent combination of education and experience.
  • A minimum of 7 years of experience managing UNIX/Linux Systems.
  • A minimum of 2 years of experience managing container infrastructure using docker.
  • A minimum of 2 years utilizing configuration management and automation tools such as Git, Ansible, Puppet or other CI/CD pipeline tools.
  • Fluency in at least one scripting language such as Bash, Python, Go or equivalent.
  • Must be able to obtain a federal security clearance.
Preferred Qualifications:
  • Working knowledge of multiple operating systems.
  • Experience with RHEL7/8
  • Knowledge of networking fundamentals including TCP/IP, traffic analysis, common protocols, and network diagnostics.
  • Experience with performance and diagnostic tools for benchmarking, analysis, and tuning of systems, networking, and storage.
  • Experience with Nagios, Zabbix, Ganglia, and other network and device monitoring systems.
  • Previous experience working in a government, scientific, or other highly technical environment.
  • Background of contributing to open-source projects or avocational endeavors such as hacker/maker spaces is desirable.
  • Technical documentation skills, including the ability to prepare simple documentation web pages.
  • Excellent interpersonal skills suitable for user support and ability to work well with peer system administrators.
  • Excellent written and verbal communication skills.
  • Ability to work independently and demonstrated analytical and problem-solving skills.
  • Demonstrated ability to balance complex research and security requirements.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
Report this job