Epicareer Might not Working Properly
Learn More
I

Machine Learning Engineer at REMOTE

  • Full Time, remote
  • Innoworklab Technologies Pvt Ltd
  • Remote, United States of America
Salary undisclosed

Apply on


Original
Simplified
Title - AWS Machine Learning Engineer
Location - Remote

Job Description:

We are seeking an experienced and motivated AWS Machine Learning Engineer to join our team. This role focuses on leveraging AWS cloud infrastructure and machine learning tools to design, build, deploy, and maintain robust machine learning solutions. The ideal candidate will be deeply familiar with Python, various ML frameworks (including PyTorch, TensorFlow), and AWS tools such as SageMaker, Lambda. They will have strong CI/CD process knowledge and a passion for optimizing ML workflows to support business-driven use cases.

Key Responsibilities:

  1. ML Solution Design & Deployment: Collaborate with data scientists to understand ML models (XGBoost, deep learning models, etc.) and create scalable, efficient infrastructure for distributed calculations and deployment on AWS.
  2. AWS Services Expertise: Work with services like EC2, S3, SageMaker, and CloudWatch to design, implement, and monitor machine learning pipelines. Setup and manage AWS accounts, S3 buckets, and other foundational AWS infrastructure.
  3. SageMaker & Model Deployment: Deploy and manage machine learning models in SageMaker Studio, utilizing containerized environments and implementing best practices for model registries and monitoring (real-time and batch inferences).
  4. Teach Data Engineers: Train and mentor data engineers to productionize existing machine learning models on AWS, ensuring successful deployment and maintenance in a production environment.
  5. CI/CD Pipelines for ML: Implement continuous integration/continuous delivery (CI/CD) pipelines for both code and ML models, handling model experimentation, testing, and monitoring.
  6. AWS Engineering: Build AWS architecture using CloudFormation, Terraform, and other infrastructure-as-code tools to support machine learning operations (MLOps).
  7. Cost Optimization: Ensure efficient use of resources, selecting appropriate EC2 instances for different ML workloads and optimizing model inference to reduce costs.
  8. Monitoring & Troubleshooting: Use AWS CloudWatch for error tracking and performance monitoring. Develop strategies to improve performance and reliability.
  9. Innovative Use Cases: Proactively explore new use cases and solutions on AWS to improve ML processes and support various business functions.
  10. Collaboration & Learning: Work with cross-functional teams, including data scientists, software engineers, and AWS specialists, to deliver high-quality solutions. Curiosity and willingness to teach new tools and services are essential.

Key Skills & Qualifications:

  1. Python Expertise: Advanced knowledge of Python for machine learning applications, including ML frameworks such as PyTorch, TensorFlow, and XGBoost.
  2. AWS Proficiency: Strong experience with core AWS services, including EC2, S3, SageMaker, CloudWatch, and understanding of account setup, infrastructure basics (e.g., ALBs), and automation tools (CloudFormation, Terraform).
  3. CI/CD Process: Understanding of software CI/CD and ML CI/CD, including pipelines for code, model experimentation, testing, and deployment.
  4. MLOps Knowledge: Familiarity with MLOps practices, including model experimentation, testing, monitoring, and version control.
  5. Containerization: Experience working with containers on AWS (Docker, ECR, ECS) and deploying containerized ML solutions in SageMaker.
  6. AWS Certifications: Preferred.
  7. Cloud Infrastructure Expertise: Ability to choose appropriate infrastructure resources for different jobs, focusing on cost-effectiveness and performance.
  8. Monitoring and Inference Optimization: Experience with real-time and batch inference monitoring and optimization for cost-effective ML model deployment.
  9. Collaboration & Growth: Willingness to mentor junior engineers and train data engineers, or grow in the role (for entry-level candidates), and openness to learning new AWS tools and technologies.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
Report this job