Design, develop, and deploy end-to-end AI/ML systems, including data ingestion, model training, evaluation, and integration into production environments. Ship and maintain large-scale AI applications, with a focus on performance monitoring and continuous improvement. Stay current with the latest advancements in GenAI, LLMs, and AI frameworks.
Responsibilities
Build and optimize applications leveraging LLMs and open-source GenAI frameworks such as LangChain, LlamaIndex, Haystack, Transformers, AutoGen, and DSPy. Implement advanced GenAI techniques including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and model fine-tuning. Write production-grade software in Python and C# or Java, ensuring maintainability, scalability, and performance. Collaborate with cross-functional teams to translate business requirements into technical solutions. Conduct rigorous evaluation of AI models using appropriate metrics and benchmarks. Optimize models for latency, throughput, and accuracy in real-world scenarios. Work closely with data scientists, product managers, and other engineers to drive AI initiatives. Prototype and experiment with emerging technologies to assess feasibility and impact.
Required Qualifications
Bachelor's Degree in Computer Science, Data Science or related technical field Minimum 9+ Years of total Experience. 5+ years technical engineering experience with coding in languages including C#, Java AND Python Should have 5+ years in Data Science experience 3 years of experience with LLMs and open-source GenAI frameworks, such as LangChain, LlamaIndex, Haystack, or equivalents (e.g., Transformers, AutoGen, DSPy), including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and fine-tuning and evaluation.โ Proficiency in writing production-quality software code in one or more modern programming languages (Python, C#) 3+ years experience developing software systems end-to-end, from design to implementation. 2+ years experience in shipping at least 2 large scale ML/AI-based services or applications on cloud platforms (Azure, AWS, GCP, etc.)
Original Posting
This role is sourced from Microsoft. Apply on Microsoft careers page