Epicareer Might not Working Properly
Learn More

Data Engineer

Salary undisclosed

Apply on


Original
Simplified

Position Overview:

We are seeking a skilled Data Engineer to ensure the availability, quality, and integration of data essential for analysis, business operations, and decision-making. The role involves transforming raw datasets into analytics-ready, curated data, supporting data science, BI, and various analytical functions. You will design, develop, test, and maintain scalable data processing systems, collaborating closely with Data Architects, Data Scientists, business, and IT teams.

Key Responsibilities:

  • Data Acquisition: Gather and store data in formats suitable for data preparation and business applications.
  • Data Preparation: Cleanse, validate, integrate, and transform raw data into curated datasets ready for analytics.
  • Data Publishing: Release data for reuse by other teams, ensuring quality and usability.

Business Analysis & Technical Leadership:

  • Engage proactively with business units to identify opportunities to add value through data.
  • Translate business requirements into technical specifications and vice versa.
  • Elicit, define, and manage technical and business requirements to ensure solutions align with business goals.
  • Lead and participate in design reviews, ensuring alignment with business needs.
  • Identify opportunities for process improvement and reuse of existing services and systems.
  • Stay current with emerging tools and technologies to inform IT strategy and promote innovative data solutions.

Core Responsibilities:

  • Architect and build scalable, high-performance data pipelines following data lakehouse and data warehouse standards.
  • Collaborate with business stakeholders to define data requirements and provide technical solutions.
  • Ensure effective transition of data applications to service management teams, including thorough knowledge transfer.
  • Lead the system testing process, resolving defects and managing resources efficiently.
  • Partner with vendors and influence solution development to ensure adherence to technical direction and data integration strategies.

Key Requirements:

  • Education: Bachelor's degree in Computer Science, Information Technology, Management Information Systems, or equivalent experience.
  • Experience:
  • 5+ years of development experience with core tools and technologies (SQL, Python, AWS – Lambda, Glue, S3, Redshift, Athena, IAM roles & policies, PySpark).
  • 3+ years of experience with Agile Development and CI/CD pipeline deployment using GitHub.
  • 2+ years of job orchestration using Airflow.
  • Expertise in data modeling and management of large datasets.
  • Experience with security models for large-scale data solutions.

Preferred Qualifications:

  • Experience working in regulated environments and adhering to internal quality policies.
  • Familiarity with AWS database technologies and cloud infrastructure.
  • Understanding of data warehousing, integration, and data architecture.
  • Industry experience in pharmaceuticals, healthcare, or early drug discovery.
  • Knowledge of Sales & Marketing business processes and systems.
  • Experience defining technical standards, best practices, and design principles.
  • A strong curiosity and desire to innovate.