L2 Cloud Engineer with AWS, Observability, LogicMonitor, Dynatrace, Data Center Migration|| Remote|| Must have linkedin and 9+ years of exp.|| Max. rate is $55/hr on C2C ||
L2 Cloud Engineer with AWS, Observability, LogicMonitor, Dynatrace, Data Center Migration
Remote
Must have linkedin and 9+ years of exp.
Seeking experienced L2 Cloud Engineer Observability with expertise in LogicMonitor and Dynatrace to support a data center migration project. The ideal candidate will be responsible for implementing, managing, and optimizing observability, performance monitoring, and alerting across on-premise and cloud environments to ensure a seamless migration with minimal disruption.
Experience: 9 years of relevant experience
Client Location: Onshore, US
Mode of work: Remote; Needs to travel to client site when required by the client. Work Time Zone: EST or CST
Rate : $55/hr on C2C
Detailed Job Description
Key Responsibilities:
1. Observability & Performance Monitoring:
- Deploy, configure, and manage LogicMonitor and Dynatrace for real-time monitoring of data center workloads before, during, and after migration.
- Set up dashboards, alerts, and reports to track server, network, application, and cloud performance.
- Monitor on-premise and cloud workloads to ensure system availability, latency optimization, and proactive issue resolution.
- Handle L2-level escalations related to AWS services and infrastructure.
- Perform routine patching, updates, and backup management.
- Monitoring, Incident Management & Troubleshooting:
- Monitor AWS resources using CloudWatch, CloudTrail, and third-party tools (Datadog, Splunk, etc.).
- Respond to alerts and incidents. Act as L2 escalation for performance-related issues during data center migration.
- Maintain system uptime and availability as per SLAs.
- Analyze historical and real-time performance data to detect potential bottlenecks before
migration.
- Work closely with networking, cloud, and infrastructure teams to troubleshoot latency,
connectivity, and system stability issues.
- Perform Root Cause Analysis (RCA) for post-migration incidents and optimize observability
configurations accordingly.
- Automation & Optimization:
- Automate LogicMonitor and Dynatrace configurations using APIs and scripting (Python, PowerShell, or Bash).
- Fine-tune monitoring thresholds and alerting rules to minimize false positives and improve incident response efficiency.
- Implement AI-driven analytics using Dynatrace for predictive issue detection and proactive resolution.
- ContinuousImprovement:
- Stay abreast of emerging trends and technologies in observability and performance monitoring.
- Recommend and implement enhancements to existing monitoring and alerting systems.
- Collaboration & Documentation:
- Work closely with migration teams, infrastructure engineers and other teams to ensure observability best practices are followed.
- Provide detailed documentation on monitoring configurations, troubleshooting steps, and migration-related performance benchmarks.
- Train L1 support teams on handling performance-related alerts and escalating critical issues.
Qualifications:
- Bachelor s degree in Computer Science, Information Technology, or a related field.
- 3-5 years of experience in cloud engineering, with a focus on observability and monitoring.
- Proven experience in data center migrations and cloud infrastructure management.
Technical Skills:
- Hands-on experience with LogicMonitor and Dynatrace for real-time observability and performance analysis.
- Proficiency in cloud-native monitoring tools such as AWS CloudWatch, Azure Monitor, Prometheus, Grafana, ELK Stack, OpenTelemetry, and Jaeger.
- Strong understanding of cloud platforms (AWS, Azure, Google Cloud Platform) and their observability frameworks
- Knowledge of Kubernetes, containers, and microservices monitoring.
- Experience in tracking performance metrics pre- and post-migration to validate system
stability
- Understanding of network dependencies, firewalls, and security configurations impacting
migration observability
- Experience in providing real-time performance insights for stakeholders during cutover
windows.
- Ability to automate tasks using Python, PowerShell, Bash, or Terraform.
- Experience in integrating monitoring tools with ITSM (ServiceNow, Jira) and alerting platforms
(PagerDuty, OpsGenie).
- Experience in monitoring Windows & Linux servers, databases, and network infrastructure.
- Experience in data center migrations for application and infrastructure performance analysis.
- Experience with scripting and automation using languages such as Python, Shell, or Go.
- Manage AWS Identity and Access Management (IAM) roles, policies, and permissions.
- Implement security best practices, including encryption, logging, and access controls.
- Troubleshoot networking issues in VPC, Security Groups, Route Tables, and VPNs.
Soft Skills:
- Excellent problem-solving abilities and attention to detail.
- Strong communication and collaboration skills.
- Ability to work independently and manage multiple tasks effectively.
- Work with L3 engineers, architects, and security teams to ensure best practices.
- Maintain detailed documentation for infrastructure, troubleshooting guides, and SOPs.
- Provide technical support and training to L1 engineers when required.
Preferred:
- Dynatrace Associate or Professional Certification.
- LogicMonitor Certified Professional.
- Certifications in cloud platforms (e.g., AWS Certified Solutions Architect, Azure Administrator).
- Experience with infrastructure as code tools like Terraform or Ansible.
- Familiarity with DevOps practices and CI/CD pipelines.
- Familiarity with ITIL processes, change management, and incident handling.
Thanks,
KK
L2 Cloud Engineer with AWS, Observability, LogicMonitor, Dynatrace, Data Center Migration
Remote
Must have linkedin and 9+ years of exp.
Seeking experienced L2 Cloud Engineer Observability with expertise in LogicMonitor and Dynatrace to support a data center migration project. The ideal candidate will be responsible for implementing, managing, and optimizing observability, performance monitoring, and alerting across on-premise and cloud environments to ensure a seamless migration with minimal disruption.
Experience: 9 years of relevant experience
Client Location: Onshore, US
Mode of work: Remote; Needs to travel to client site when required by the client. Work Time Zone: EST or CST
Rate : $55/hr on C2C
Detailed Job Description
Key Responsibilities:
1. Observability & Performance Monitoring:
- Deploy, configure, and manage LogicMonitor and Dynatrace for real-time monitoring of data center workloads before, during, and after migration.
- Set up dashboards, alerts, and reports to track server, network, application, and cloud performance.
- Monitor on-premise and cloud workloads to ensure system availability, latency optimization, and proactive issue resolution.
- Handle L2-level escalations related to AWS services and infrastructure.
- Perform routine patching, updates, and backup management.
- Monitoring, Incident Management & Troubleshooting:
- Monitor AWS resources using CloudWatch, CloudTrail, and third-party tools (Datadog, Splunk, etc.).
- Respond to alerts and incidents. Act as L2 escalation for performance-related issues during data center migration.
- Maintain system uptime and availability as per SLAs.
- Analyze historical and real-time performance data to detect potential bottlenecks before
migration.
- Work closely with networking, cloud, and infrastructure teams to troubleshoot latency,
connectivity, and system stability issues.
- Perform Root Cause Analysis (RCA) for post-migration incidents and optimize observability
configurations accordingly.
- Automation & Optimization:
- Automate LogicMonitor and Dynatrace configurations using APIs and scripting (Python, PowerShell, or Bash).
- Fine-tune monitoring thresholds and alerting rules to minimize false positives and improve incident response efficiency.
- Implement AI-driven analytics using Dynatrace for predictive issue detection and proactive resolution.
- ContinuousImprovement:
- Stay abreast of emerging trends and technologies in observability and performance monitoring.
- Recommend and implement enhancements to existing monitoring and alerting systems.
- Collaboration & Documentation:
- Work closely with migration teams, infrastructure engineers and other teams to ensure observability best practices are followed.
- Provide detailed documentation on monitoring configurations, troubleshooting steps, and migration-related performance benchmarks.
- Train L1 support teams on handling performance-related alerts and escalating critical issues.
Qualifications:
- Bachelor s degree in Computer Science, Information Technology, or a related field.
- 3-5 years of experience in cloud engineering, with a focus on observability and monitoring.
- Proven experience in data center migrations and cloud infrastructure management.
Technical Skills:
- Hands-on experience with LogicMonitor and Dynatrace for real-time observability and performance analysis.
- Proficiency in cloud-native monitoring tools such as AWS CloudWatch, Azure Monitor, Prometheus, Grafana, ELK Stack, OpenTelemetry, and Jaeger.
- Strong understanding of cloud platforms (AWS, Azure, Google Cloud Platform) and their observability frameworks
- Knowledge of Kubernetes, containers, and microservices monitoring.
- Experience in tracking performance metrics pre- and post-migration to validate system
stability
- Understanding of network dependencies, firewalls, and security configurations impacting
migration observability
- Experience in providing real-time performance insights for stakeholders during cutover
windows.
- Ability to automate tasks using Python, PowerShell, Bash, or Terraform.
- Experience in integrating monitoring tools with ITSM (ServiceNow, Jira) and alerting platforms
(PagerDuty, OpsGenie).
- Experience in monitoring Windows & Linux servers, databases, and network infrastructure.
- Experience in data center migrations for application and infrastructure performance analysis.
- Experience with scripting and automation using languages such as Python, Shell, or Go.
- Manage AWS Identity and Access Management (IAM) roles, policies, and permissions.
- Implement security best practices, including encryption, logging, and access controls.
- Troubleshoot networking issues in VPC, Security Groups, Route Tables, and VPNs.
Soft Skills:
- Excellent problem-solving abilities and attention to detail.
- Strong communication and collaboration skills.
- Ability to work independently and manage multiple tasks effectively.
- Work with L3 engineers, architects, and security teams to ensure best practices.
- Maintain detailed documentation for infrastructure, troubleshooting guides, and SOPs.
- Provide technical support and training to L1 engineers when required.
Preferred:
- Dynatrace Associate or Professional Certification.
- LogicMonitor Certified Professional.
- Certifications in cloud platforms (e.g., AWS Certified Solutions Architect, Azure Administrator).
- Experience with infrastructure as code tools like Terraform or Ansible.
- Familiarity with DevOps practices and CI/CD pipelines.
- Familiarity with ITIL processes, change management, and incident handling.
Thanks,
KK