– Design, implement, test, and optimize software features that improve observability for Azure Core services. – Develop scalable, resilient systems for telemetry, detection, and automated recovery across global deployments. – Collaborate with engineering and product teams to integrate observability solutions with existing systems. – Contribute to open-source frameworks and adopt best practices for observability and reliability. – Actively participate in incident response and ensure high availabi…
Required Qualifications
- Bachelor's Degree in Computer Science or related technical field AND 7+ years technical engineering experience with coding in languages including, but not limited to(C, C++, C#, Java, JavaScript, or Python) - (OR equivalent experience).Hands-on experience with any cloud platform. 2+ year(s) experience designing and building distributed systems or cloud-scale services. 2+ year(s) of proficiency with observability concepts (telemetry, logging, metrics, detection) and operational excellence. - 1+ year(s) of collaborating across teams and delivering exceptional solutions in a fast-paced environment.
Preferred Qualifications
- Bachelor's Degree in Computer Science or related technical field AND 7+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python - OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python - OR equivalent experience. Experience with service reliability engineering and incident management for mission-critical systems. Familiarity with open-source frameworks and standards related to observability. Demonstrated ability improving system reliability and performance at scale.
Original Posting
This role is sourced from Microsoft. Apply on Microsoft careers page