Building and Scaling our Inferencing Cloud across our flagships AOAI Service and growing the Model as A Service model Families. Collaborate with appropriate stakeholders (e.g., project manager, technical lead) to determine user requirements Leads discussions for architecture of products/solutions and creates proposals for architecture by testing design hypotheses Leads by example within the team by producing extensible and maintainable code. Applies debugging tools and examines logs to verify as…
Required Qualifications
Bachelor's Degree in Computer Science, or related technical discipline and 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python or equivalent experience. Experience on python, pytorch, LLMs, Generative AI * Experience with distributed systems design and implementation Proficiency in Agile development practices and Continuous Integration/Continuous Deployment (CI/CD) * Passion for machine learning, artificial intelligence, and data science, LLM Scaling, LLM Inferencing Engines Experience working on large-scale projects or applications * Good communication skills and ability to collaborate with diverse remote teams Quick learner with a passion for solving complex and exciting problems Familiarity with Azure is a plus
Original Posting
This role is sourced from Microsoft. Apply on Microsoft careers page