Incident & Support: Provide technical support for cloud-based data platforms (data warehouses, pipelines, distributed computing) and swiftly resolve production incidents including performance degradation, stability issues, and data reliability concerns.
Root Cause Analysis (RCA): Perform and document root cause analysis for long-term prevention.
Monitoring & Observability: Proactively monitor system health and data pipeline performance using cloud-native tools, developing dashboards, alerts, and reporting frameworks for real-time insight.
Automation: Build and maintain automation scripts using Python, PowerShell, and Bash to reduce repetitive tasks and enhance operational efficiency.
Platform Improvement: Suggest and implement improvements to increase platform resilience, reliability, and performance.
Collaboration: Work closely with Data Engineers, Full Stack Support Engineers, Data Scientists, and client-facing teams for troubleshooting and resolution.
Knowledge Base: Write, maintain, and share runbooks and troubleshooting guides.
On-Call: Be available for extended working hours during critical outage events.
Education: Bachelor’s degree in Computer Science, Information Systems, Engineering, or a closely related field.
Experience: 3+ years of hands-on experience in support engineering, cloud operations, or data engineering within a cloud environment (Microsoft Azure preferred).
Data Platform: Strong practical experience with cloud-hosted data platforms, including data warehouses, pipeline orchestration services, and distributed compute engines.
Analytics Platforms: Experience working with modern scalable analytics platforms such as Databricks, Spark, Azure Synapse, or Microsoft Fabric.
Containerization: Familiarity with container orchestration and virtualization technologies like Kubernetes and Docker.
Monitoring: Familiarity with cloud-native monitoring and observability tools.
Automation: Ability to build and maintain automation scripts using languages like Python, PowerShell, and Bash (implied by the job responsibilities).
Troubleshooting: Proven ability to investigate and resolve issues using SQL/T-SQL, Python, and Spark workloads.
Operations: Knowledge of incident management practices (escalation, resolution, and prevention) and experience upholding high standards for reliability in business-critical production systems.
Benefits:
IF you meet the above requirements and want to make a career-changing move, apply today by emailing your CV to itcareers@hireresolve.za.com
Specialists in Civil, Structural, Mechanical Engineering, Information Technology, Mining, Manufacturing and Finance Careers! Hire Resolve is one of the larger and more agile South African recruitment companies that focus on placing professionals and skilled people in permanent employment and contract employment. We prefer and focus on working with top quality professionals and candidates in South Africa and Africa. Hire Resolve has successfully placed Engineering, Mining, IT, Manufacturing and Finance professionals with top firms across the Western Cape, Eastern Cape, KwaZulu Natal, Gauteng and in Africa. Hire Resolve has assisted candidates to find jobs at over 100 JSE listed companies of which many are global companies with offices and operations in South Africa and Africa. It is for this reason that we are well respected in the industries we operate in and in the recruitment industry.
You have successfully created your alert.
You will receive an email when a new job matching your criteria is posted.
Please check your email. It looks like you haven't verified your account yet. Here's what you're missing out on:
Didn't receive the link? Resend Verification Link