千问千问事业部-大模型算子高级研发专家-杭州/北京/广州
社招全职3年以上地点:北京 | 杭州 | 广州状态:招聘
任职要求
1. 精通C++/CUDA/Python,具备扎实的计算机体系结构、并行计算和高性能计算基础,能够独立完成复杂GPU Kernel的设计、实现、调优和工程化落地; 2. 深入理解GPU硬件架构与性能优化方法,熟悉Tensor Core、Memory Hierarchy、Shared Memory、Register、Warp Scheduling、异步流水等机制,具备系统化性能分析和瓶颈定位能力; 3. 熟悉大模型推理核心算子,包括Attention、MLP、MoE GEMM、…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 参与大模型训练、推理核心算子的设计、开发与性能优化,覆盖Attention、MLP、MoE GEMM、RMSNorm、RoPE、Sampling、KV Cache读写、量化/反量化等关键算子,支撑千亿/万亿参数模型的低延迟、高吞吐推理; 2. 面向NVIDIA、AMD及其他通用AI加速硬件,研发高性能Kernel实现方案,充分利用Tensor Core、Shared Memory、异步流水、Persistent Kernel等硬件能力,持续提升算子吞吐、延迟和资源利用率; 3. 参与FP8、FP4、INT8、INT4等低比特推理相关算子研发与优化,推动量化算子、算子融合、图级优化与推理框架协同落地,降低端到端推理成本; 4. 针对线上真实负载开展系统化性能分析、Benchmark、性能归因与问题定位,解决算子性能瓶颈、稳定性和工程化落地问题,沉淀可复用的优化方法和工程实践。
包括英文材料
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
内核+
https://www.youtube.com/watch?v=C43VxGZ_ugU
I rummage around the Linux kernel source and try to understand what makes computers do what they do.
https://www.youtube.com/watch?v=HNIg3TXfdX8&list=PLrGN1Qi7t67V-9uXzj4VSQCffntfvn42v
Learn how to develop your very own kernel from scratch in this programming series!
https://www.youtube.com/watch?v=JDfo2Lc7iLU
Denshi goes over a simple explanation of what computer kernels are and how they work, alonside what makes the Linux kernel any special.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
还有更多 •••