logo of amd

AMDSoftware Development Engineer

社招全职 Engineering地点:上海状态:招聘

任职要求


Skilled engineer with strong technical and analytical expertise in C++ development within Linux environments. The ideal candidate will thrive in both collaborative team settings and independent work, with the ability to define goals, manage development efforts, and deliver high-quality solutions. Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices are essential. KEY RESPONSIBILITIES:  Deep Learning & LLM Framework Optimization: Optimize major DL/LLM frameworks (TensorFlow, PyTorch, vLLM, SGLang) for AMD GPUs and contribute improvements upstream. GPU Kernel & Operator Optimization: Develop and tune GPU kernels and performance-critical operators to maximize throughput and minimize latency. Model & Architecture Optimization: Adapt and optimize LLM architectures (e.g., Llama, Qwen, DeepSeek) and apply advanced techniques like FlashAttention, PagedAttention, and quantization. End-to-End Performance Engineering: Perform comprehensive profiling to identify bottlenecks and implement system, memory, and communication optimizations across multi-GPU and multi-node setups. Compiler & Pipeline Acceleration: Leverage advanced compiler technologies and graph compilers to enhance the full deep learning and inference pipeline. Research & Advanced Techniques: Prototype and integrate emerging optimization methods such as speculative decoding and weight-only quantization into production systems. Cross-Team & Open-Source Collaboration: Collaborate with internal GPU library teams and open-source maintainers to align improvements and ensure seamless upstream integration. Software Engineering Excellence: Apply robust engineering practices to deliver maintainable, reliable, and production-quality performance optimizations. MANDATORY EXPERIENCE:  Inference Frameworks, Model Architectures & Optimization Expertise: Deep practical experience with vLLM or SGLang, mastery of modern LLMs (e.g., DeepSeek, Qwen), strong theoretical grounding in Transformer/Attention/MoE/KV Cache, an…
登录查看完整任职要求
微信扫码,1秒登录

工作职责


THE ROLE:  As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference performance across multi-GPU and multi-node systems. You will engage with both internal GPU library teams and open-source maintainers to ensure seamless integration of optimizations, utilizing cutting-edge compiler technologies and advanced engineering principles to drive continuous improvement.
包括英文材料
C+++
Linux+
大模型+
开发框架+
TensorFlow+
PyTorch+
vLLM+
SGLang+
内核+
还有更多 •••
相关职位

logo of amazon
社招Software

- Lead the development of new Android innovative features and applications and initiatives across the organization. - Investigate, prototype, and deliver new and innovative software applications. - Deliver high quality software through working in a diverse, team-focused Agile/Scrum environment. - Instil best practices for software development and documentation, assure designs meet requirements, and deliver quality work - Support development activities by being onsite with partners and vendors.

更新于 2025-09-26深圳
logo of apple
社招Software

• Define and deliver scalable test software architecture usable across multiple product lines. • Build drivers, applications, protocols, frameworks, and utilities that power Apple test systems. • Collaborate with cross-functional partners in Hardware, Software, Operations, and CoreOS. • Develop and deploy calibration and restore software solutions for new product introductions. • Expand CI/CD pipelines with automation, testing frameworks, and diagnostic utilities. • Investigate and resolve issues with hands-on debugging and performance optimization. • Partner with and lead vendors to deliver robust, high-quality software solutions. • Drive continuous improvement in software design, system efficiency, and development processes.

更新于 2025-09-04深圳
logo of amd
社招 Enginee

THE ROLE:  As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your strong experience will be critical in enhancing GPU kernels, deep learning models, and training/inference performance across multi-GPU and multi-node systems. You will engage with both internal GPU library teams and open-source maintainers to ensure seamless integration of optimizations, utilizing cutting-edge compiler technologies and advanced engineering principles to drive continuous improvement.

更新于 2025-10-25上海
logo of amd
社招 Enginee

THE ROLE: Khronos3D driver team is part of AMD platform software engineering organization, G&E, responsible for Vulkan and OpenGL driver engineering over AMD GPU/APU products. We are looking for a talented Linux graphics driver engineer candidate to join our team to drive and enhance AMD graphics driver quality and achieve successful SW stack deliveries and Customer product deployment.

更新于 2025-11-14上海