滴滴26届正式批-Post-training 研发工程师-基础产品
校招全职工程-后端类地点:杭州 | 北京状态:招聘
任职要求
1、2026届毕业生,本科及以上学历,计算机、数学、统计学、自动化等相关专业优先 2、熟悉Post-Training流程,深入了解Agentic RL训练,包括但不限于PPO、DPO、GRPO等算法 3、具备大模型训练框架开发能力,包括pytorch、megatron-lm等 4、具备强化学习框架开发能力,包括openRLHF、verl等 5、具备一线的C++/Python工程能力,精通数据结构和常用算法,掌握各种编译、调试、性能分析工具,熟悉并行编程(CUDA/Triton等)优先。
工作职责
1、参与大模型训练框架的开发、维护 2、参与Agentic RL 训练框架的开发、优化 3、和算法一起在网约车场景落地Agentic RL。
包括英文材料
学历+
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
相关职位
校招工程-后端类
1、参与滴滴内部 post-training 框架研发,聚焦 LLM + RL 方向,设计框架架构与技术路线,提升其扩展性、稳定性与效率 2、优化框架性能,如训练速度、显存占用等,降低训练成本,为 LLM + RL 训练提供有力技术支撑 3、协同业务团队,将 LLM 能力在业务场景落地,根据业务需求定制训练方案并评估验证模型 4、关注行业前沿,引入有价值的技术到公司框架和模型中,探索新算法与方法,推动技术创新。
更新于 2025-08-25
校招产品类
1、熟悉LBS产品市场需求,通过与客户沟通、行业分析来定义地图要素的当前和未来的需求 2、制定产品目标或愿景,向相关方清晰传达产品的战略和规划 3、推动某项产品从需求调研、到发布上线,领导各功能团队来定义和实现,如模型规格、工具平台、生产运营、应用引擎等 4、管理整个产品线的生命周期,从产品规划一直到应用效果 5、主导制定某项产品的用途(usage)、范围(scenario)、模型(model)、规格(specifications)、手册(SOP)、用法(rules)等文档 6、主导定义和设计某项产品的工作流程(workflow & pipeline)、合格质量标准(access quality)、交付内容(content、format、release note、etc.) 7、熟悉并能够使用前沿的技术来提高产品的质量,加快产品迭代的速度,降低产品的成本(TOC)。
更新于 2025-08-21