6 - 8 years’ of experience on developing data solutions in Python using Spark framework.
Desired Candidate Profile
Designs, modifies, and builds new and scalable data processes.
Ability to perform root cause analysis and identify performance bottlenecks in Spark Jobs.
Expert in Data Engineering and building data pipelines, implementing Algorithms in a distributed environmentExpert in Data Engineering and building data pipelines, implementing Algorithms in a distributed environment.
Ability to design and develop parallel processing data platform in PySpark.
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
Strong Proficiency in SQL.
Cloud knowledge especially Azure.
Collaborates with stakeholders, IT, database engineers and other scientists.
Hands-on knowledge in Azure Synapse and Azure Data Factory is a plus.