Data Engineer
Apply on
Job Description
Manifest Solutions is currently seeking a Python Data Engineer for a hybrid position in Columbus, OH.
Data Engineer is responsible for developing Life Sciences content curation and delivery system for the purpose of building life sciences databases that empower proprietary life sciences search technologies. This role encompasses developing and deploying scientific software solutions in the life sciences information space to support transformational initiatives, delivering both short and long-term results to the business.
Develops data transformation and integration pipelines and infrastructure foundations of life sciences content in support of scientific databases and data curation.
Combines strong software development and data engineering skills with a working knowledge of basic biology/chemistry/physics to develop sophisticated informatics solutions that drive efficiencies in content curation and workflow process.
Applies data transformation and other data-engineering software development capabilities to contribute to the building of new scientific information management systems supporting scientific database building activities.
Education/Experience
4-year degree in computer science, engineering, informatics, or equivalent experience
Minimum of 4 years of software development experience
Competencies/Technologies
Proficiency in Python
Proficiency in other programming languages such as JavaScript/TypeScript/Java
Proficiency in Linux/Unix environments
Experience building applications for public cloud environments (AWS preferred)
Experiencewith databases technologies (NoSQL, relational, property graph, RDF/triple store)
Experience with data engineering tools and techniques is highly desired
Experience with AWS DevOps tools (git, Cloud Development Kit, CDK Pipeline) is highly desired
Experience building applications using AWS Serverless technologies such as Lambda, SQS, Fargate, S3 is highly desired
Experience working with XML and XPath is highly desired
Experience with MarkLogic/Xquery is a plus
Experience with Apache Airflow is a plus
Experience building containerized applications (Docker, Kubernetes) is a plus
Strong communication, organizational savvy, interpersonal skills
Self-motivated with the ability to work with minimal supervision
Innovates and continuously improves; focuses on areas of highest potential