Set technical directions in multilingual speech model, speech LLMs, model customization and impact accuracy, latency, and compute. Mentor and influence peers, sharing expertise and fostering a growth-oriented inclusive team culture. Commitment to fostering an inclusive, growth-oriented team culture. BS/MS/.
Responsibilities
Build novel data generation solutions to synthesize complex speech scenarios and finetune models. Build data analysis metrics and solutions to understand the model results, identify gaps, and guide solutions.
Required Qualifications
PhD Degree in CS/EE or related fields with strong focus in speech recognition systems, machine learning, and AI technology innovations. 8+ years of experience in speech or machine learning in academic or industrial setting, or 8+ years' experience in software development skills and aptitude for software design, coding and quality. Demonstration of excellent problem-solving skills in speech and machine learning areas. Proven track record of delivering impactful results and high-quality solutions in complex technical environments. Strong programming skills in Python, C++ or similar languages, with experience in large-scale data processing and distributed computing. Effective communication skills, both verbal and written. Experience with speech/audio processing, multilingual model development, or voice agent technologies. Familiarity with Azure, cloud-based AI platforms, or enterprise-scale deployment of speech solutions. Contributions to open-source projects, patents, or publications in top-tier conferences/journals. Demonstrated leadership in driving technical direction, influencing cross-functional teams, and mentoring peers.
Original Posting
This role is sourced from Microsoft. Apply on Microsoft careers page