AMDAI Compiler Development Engineer
任职要求
Strong technical and analytical skills in C++ development in a Linux environment. Ability to work as part of a team, while also being able to work independently, define goals and scope and lead your own development effort. KEY RESPONSIBILITIES: Optimize Deep Learning Frameworks: In depth experience in enhance and optimize frameworks like TensorFlow and PyTorch for AMD GPUs in open-source repositories. Develop GPU Kernels: Create and optimize GPU kernels to maximize performance for specific AI operations. Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance. Collaborate with GPU Library Teams: Work tightly with internal teams to an…
工作职责
THE ROLE: MTS Software development engineer on teams building and optimizing Deep Learning applications and AI frameworks for AMD GPU compute platforms. Work as part of an AMD development team and open-source community to analyze, develop, test and deploy improvements to make AMD the best platform for machine learning applications.
1、参与人工智能芯片的软硬件协同设计,指令集功能验证; 2、参与人工智能芯片的编译器算法设计和实现, 工具链开发与维护,网络模型的性能调优; 3、参与深度学习软件栈的设计和实现; 1. Working closely with hardware/architecture engineering and software teams to understand the hardware and software requirements. 2. Responsible for compiler and tool chain design, implementation, maintaining and performance tuning. 3. Responsible for the design and implementation of deep learning software stack.
We are now looking for a Deep Learning Performance Software Engineer! We are expanding our research and development for deep learning. We seek excellent Software Engineers and Senior Software Engineers to join our team. We specialize in developing GPU-accelerated Deep learning software. Researchers around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in numerous areas. Join the team that builds software to enable new solutions. Your ability to work in a fast-paced customer-oriented team is required and excellent communication skills are necessary. What you’ll be doing: • Develop deep learning compiler • Develop highly optimized deep learning kernels • End-to-end performance optimization • Do performance optimization, analysis, and tuning
• Develop production-quality software that ships as part of NVIDIA's AI software stack, including optimized large language model (LLM) support. • Analyze the performance of important workloads, tuning our current software, and proposing improvements for future software. • Work with cross-collaborative teams of deep learning software engineers and GPU architects to innovate across applications like generative AI, autonomous driving, computer vision, and recommender systems. • Adapt to the constantly evolving AI industry by being agile and excited to contribute across the codebase, including API design, software architecture, performance modeling, testing, and GPU kernel development.