美团【北斗】大模型训推引擎工程师(后训练/推理方向)
校招全职核心本地商业-业务研发平台地点:北京 | 上海状态:招聘
任职要求
【任职资格】 必备项: 1.2027届本科及以上学历,计算机、人工智能等相关专业; 2.具备良好的计算机基础素养和分析解决问题的能力,熟练掌握C/C++或Python编程语言; 3.学习能力强,对AI Infra方向有技术热情,富有极客精神; 4.具备扎实的机器学习和深度学习基础,熟悉GPU、NPU硬件架构,熟悉CUDA,CCL,RDMA或者任意机器学习DSL; 5.具有良好的沟通协作能力,工作积极主动。 加分项: 1.具备有影响力的大规模分布式系统、高性能计算项目经验; 2.在OSDI、SOSP、NSDI、MLSys、ICML、ICLR、NeurIPS等等顶级会议上有论文发表; 3.在高影响力开源项目(如vLL…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
方向一:面向Agent RL的LLM分布式后训练系统 1.多模态与超长上下文训练优化:针对万级以上超长序列(Long Context)及多模态输入,研究高效的上下文并行(CP/SP)、序列并行策略及分布式通信拓扑,优化内存足迹,支撑长文本与复杂多模态后训练。 2.计算资源弹性调度与容错容灾:研究超大规模集群下的弹性训练技术(Elastic Training),实现节点动态加入/退出、无感故障自动恢复(Fault Tolerance)与训练超参数自动寻优,提升集群算力有效利用率(MFU)。 3.Agent RL算法基础设施:构建面向Agent强化学习的高效Rollout引擎、多模型(Policy/Value/Reward/Reference)多阶段流水线编排及动态负载均衡机制;设计并建设支持高并发、低延迟的工具调用(Tool-use)虚拟环境/沙箱模拟器交互系统。 4.训推一致性协同设计(Co-design):打通训练与推理底层架构,研究在后训练阶段(如在线RL、Iterative DPO)如何实现训练框架与高性能推理引擎的深度融合,降低在线反馈延迟,实现训推一体化高效演进。 方向二:大模型高效推理与加速技术 1.面向Agent场景的动态推理架构:针对Agent多轮对话、工具调用等复杂交互带来的极致动态性,研究智能KV Cache管理优化策略(如动态复用、分级存储、长短时记忆重组),解决长序列及高并发下的显存瓶颈。 2.极致模型压缩:研究前沿的模型压缩技术,包括先进的高比例量化(INT4/FP4/低比特混合精度)、结构化/非结构化稀疏(Sparsity),并开发对应的硬件友好型Mega算子,压榨硬件极限性能。 3.下一代投机采样技术: 深入研究并落地最前沿的解码加速技术,包括Draft-Target分离部署架构、Structured Speculative Decoding等异构投机采样方案;探索多Token预测(MTP)在线联合验证、无轻量模型依赖的自投机(Self-Speculative)等免草稿模型加速路径。 4.前沿推理系统深度开发:跟踪并引领AI Infra业界最新进展,深度参与并优化主流开源推理框架,进行架构级的核心代码重构、极致的通信优化与算子融合,推动最新infra成果在核心业务的高效落地。
包括英文材料
学历+
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
还有更多 •••