Microsoft is a highly innovative company that collaborates across disciplines to produce cutting edge cloud technology that changes our world. The Cloud Server Infrastructure (CSI) team in Microsoft’s Azure C+E division is responsible for delivering server infrastructure for Microsoft’s online services. The hardware for operating these services (over 200 and counting), comprises of hundreds of thousands of servers spread globally and applications that reach hundreds of millions of users every day. Our customer-base is growing rapidly, our infrastructure investments are multiplying, and the size of our global infrastructure is increasing by the day - along with the scale of our challenges. Learn more about our team and projects here https://azure.microsoft.com/en-us/blog/ecosystem-momentum-positions-microsoft-s-project-olympus-as-de-facto-open-compute-standard/
We are seeking a motivated performance engineer to work on all aspects of the performance and architecture of Artificial Intelligence (AI) chips and systems, including understanding frameworks, neural network analysis, performance characterization on deployed hardware. The development environment is highly collaborative and involves leveraging other teams as well as understanding customers’ and partners’ requirements. Development targets is for sophisticated programmable AI processors in a cloud server environment. An appropriate candidate will be able to utilize a broad HW/SW background to independently tackle complex problems and provide innovative solutions.
Primary responsibilities include:
• Building simulation infrastructure, creating stimulus and running simulations to analyze neural networks and AI applications.
• compare potential chip and system architecture.
• guide implementation tradeoffs.
• Define, review, and verify performance monitoring features in target silicon
• Use performance monitoring features to assess usage patterns and identify performance bottlenecks in deployed HW.
• Perform performance analysis to identify opportunities for HW acceleration.
MS required. Computer science and/or computer engineering preferred. PhD preferred.
5+ years of development experience in hardware or software in all stages of a release cycle (i.e. architecture, design, algorithms, implementation, performance testing or validation).
3+ year working with ASIC or Silicon design and development.
Strong proficiency developing functional or performance simulation in industry or academia.
Strong proficiency with register level hardware interfaces and specifications.
Interest in working in a cross functional team environment.
Excellent communications skills.
The following is a plus:
A mix of hardware and software experience
Familiarity with TensorFlow, CNTK, Caffe, Python, etc.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to email@example.com.
A little about us:
Microsoft offers training and employment opportunities to help you turn your military experience and skills into a civilian technology career.