CLS (Centralized Logging Solution) Specialist
Apply on
- Help monitor and maintain CLS performance, availability, and capacity.
- Help maintain application container images.
- Offer solutions for ingestion of logs to Splunk via cloud native solutions.
- Maintain all infrastructure as code.
- Provide operations monitoring of CLS platform to enable proactive issue identification, response, and resolution.
- Recommend and execute improvements to the existing CLS architecture and design with growth and scalability in mind to optimize performance, stability, reliability, and agility.
- Responsible for reporting on current infrastructure status, and planning for future usage.
- Responsible for Beats agent deployments and container infrastructure analysis, optimization, and capacity planning.
- Maintain CI/CD pipelines for configuration deployments to applications.
- Support large-scale deployments with data feeds from multiple on premise and cloud data centers.
- Upgrade, install, configure monitoring solution for AWS for Windows and Linux servers.
- Utilize automation tool such as Terraform, Ansible, AWS Cloud Formation, Azure Resource Manager, or similar.
- Participate in a rotating on call schedule and weekly off hours maintenance.
What qualifications do I need?
- Candidate background eligibility requirements are United States citizen or be a Permanent Resident and have lived in the United States for at least 3 years, clean criminal background and able to obtain a Public Trust (High-Risk) Position
- Bachelor s degree in computer science, electronics engineering or other engineering or technical discipline OR AWS/Azure Certification (AWS Professional / Specialty Cert. OR Azure Expert / Advanced Cert.) OR 4 years of relevant experience in one of the VAECOT suite of tools (Science Logic, Dynatrace, Turbot, AppDynamics)
- Minimum of three (3) years of experience in leading technical teams to achieve objectives and outcomes.
- Minimum of six (6) years setting up, configuring, and using AWS cloud operational tools to ensure service level agreements and performance targets are met, and continued compliance with policies, standards and guidelines
- Minimum of three (3) years specific to monitoring Centralized Logging Solution (CLS)/Splunk
- Subject matter expertise with ALL VAEC Cloud Service Providers which currently includes Microsoft Azure and Amazon Web Services (AWS)
- Experience with programming with Python or equivalent (e.g., Powershell, AWS or Azure CLI)
- Knowledge of enterprise logging, with a focus on security event logging
- A solid understanding of cloud concepts, either using Azure or AWS semantics
- Experience in one or more of the VAECOT suite of tools, shown below.
Candidates that do not meet the minimum qualifications will not be considered.
- Multiple Microsoft or Amazon cloud certifications
- Previous Federal Government experience
- Strong ability to foster collaborative work in dynamic team environment
- Strong creative, analytical and problem solving and trouble-shooting skills
- Strong knowledge (recent experience) with the following technologies: storage, servers, data centers, networking
- Strong technical experiences working migrations or systems development as well as coordinating from a business perspective
- Strong understanding of SDLC concepts, full lifecycle development for systems/applications
VAEC Operational Tools (VAECOT)
Some experience in one or more of the following tools:
Third party tools
Application Performance Monitoring: Dynatrace, AppDynamics
Cloud Security: Nessus, NetSkope, Enterprise Security External Change Council, Identity and Assessment Management, Continuous Monitoring as a Service, McAfee, eMASS, Centrify
Cloud Governance: Turbot
DevOps/Configuration Management/Help Desk: Ansible, Service Desk, ScienceLogic, ServiceNow, SPLUNK, Jira ServiceDesk, Cloudockit, GitHub
Containerization: Red Hat OpenShift
Migration: CloudKey, Version One
Reporting: Apptio
Cloud Service Provider (CSP) Operational Tools Tools/Services
AWS Security: System Manager (Explorer and OpsCenter), CloudWatch, Config, CloudTrail, Elasticsearch (Kinesis DataStreams), GuardDuty, Inspector, Key Management Service (KMS), Security Hub, Directory Service, Identity and Access Management, Resource Access Manager, Cognito, Secrets Manager, Certificate Manager, Artifact
Aws Monitoring and Logging: QuickSight, Eventbridge (AWS Kinesis DataStreams), Simple Notification Service (SMS), Elasticsearch (AWS Kinesis DataStreams), CloudTrail, CloudWatch
Aws Networking: Virtual Private Cloud (VPC), Route S3, API Gateway, Direct Connect, AppStream 2.0, Transit Gateway, Elastic Loadbalancer, Firewall Manager, WAF & Shield
AWS Storage: Cloud Tiering Services to S3 from On-Prem, Simple Storage Services (S3), S3 Glacier, Storage Gateway, Elastic File System (EFS), Backup
Azure Security: Monitor (Log Analytics and ASC), Event Hubs, Security Center (ASC), Information Protection (AIP) , Key Vault, PowerBI, Network Watcher (Performance Monitor), Monitor (Log Analytics and ASC)
Azure Monitoring and Logging: Information Protection (AIP), Advance Threat Protection, Security Center (ASC), Information Protection (AIP), Key Vault, Active Directory, Role Based Access Control (RBAC), Resource Manager (ARM), Resource Graph (ARG), Active Directory B2C, Key Vault, App Service, Service Trust Portal
Azure Networking: Virtual Network, Traffic Manager, DNS, Application Gateway, Express Route, Web Apps, FrontDoor, VPN Gateway, Loadbalancer, Firewall
Azure Storage: NetApp File Service, Storage (Blobs, Disks, Files, Queues, Tables), Storage Archive Access Tier, StorSimple, Files, Backup