Job Description
6 - 8 years’ of experience on developing data solutions in Python using Spark framework.
Desired Candidate Profile
- Designs, modifies, and builds new and scalable data processes.
- Ability to perform root cause analysis and identify performance bottlenecks in Spark Jobs.
- Expert in Data Engineering and building data pipelines, implementing Algorithms in a distributed environmentExpert in Data Engineering and building data pipelines, implementing Algorithms in a distributed environment.
- Ability to design and develop parallel processing data platform in PySpark.
- Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
- Strong Proficiency in SQL.
- Cloud knowledge especially Azure.
- Collaborates with stakeholders, IT, database engineers and other scientists.
- Hands-on knowledge in Azure Synapse and Azure Data Factory is a plus.