Microsoft
Senior Machine Learning Research Engineer
Job Description
Responsibilities
- Driving model/HW codesign
- Developing and analysing novel NN architectures
- Inventing novel low-precision data formats
- Inventing novel model architectures
- Collaborating with data scientists and ML researchers
- Interfacing with HW architecture teams
- Interfacing with SW framework teams
Qualifications
- MS/PhD in Machine learning, Computer Architecture/Systems, High-Performance Computing or related areas
- Extensive experience in ML systems/Model optimizations/Efficient model architecture
- Track record of original research and delivering novel results in ML systems area
- Hands on experience with frameworks such as PyTorch/TensorFlow/TensorRT
- Deep knowledge of CNN/transformer architecture and optimization strategies – quantization, sparsity, NAS, sharding, KV Cache, Flash Attention
- Strong programming skills in Python/C/C++
- Experience in implementing low-level linear algebra/BLAS kernels and performance optimisations
- Outstanding communication skills
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.