Support Engineer- Spark HDI
Apply on
Top Skills:
Top Skills:
- Minimum 5+ years experience with HD Insights; Spark
2. Minimum 5+ years experience with Azure Fundamentals and Azure PAAS (desirable)
3. Minimum 5+ years experience with Excellent Verbal and written Comms.
Job Description:
Job responsibilities:
Designing and implementing highly performant data ingestion pipelines from multiple sources using Apache Spark and/or Azure Databricks and/or HDInsights
Delivering and presenting proofs of concept to of key technology components to project stakeholders.
Developing scalable and re-usable frameworks for ingesting of geospatial data sets
Integrating the end to end data piple-line to take data from source systems to target data repositories ensuring the quality and consistency of data is maintained at all times
Working with event based / streaming technologies to ingest and process data
Working with other members of the project team to support delivery of additional project components (API interfaces, Search)
Evaluating the performance and applicability of multiple tools against customer requirements
Working within an Agile delivery / DevOps methodology to deliver proof of concept and production implementation in iterative sprints.
Qualifications
Strong knowledge of Data Management principles
Experience in building ETL / data warehouse transformation processes
Direct experience of building data pipelines using HDInsights and Apache Spark (preferably Databricks).
Experience using geospatial frameworks on Apache Spark and associated design and development patterns
Microsoft Azure Big Data Architecture certification.
Hands on experience designing and delivering solutions using the Azure Data Analytics platform (Cortana Intelligence Platform) including Azure Storage, Azure SQL Data Warehouse, Azure Data Lake, Azure Cosmos DB, Azure Stream Analytics
Experience with Apache Kafka / Nifi for use with streaming data / event-based data