滴滴26届正式批-AI infra工程师-L Lab
校招全职工程-后端类地点:北京状态:招聘
任职要求
1、2026届毕业生,本科及以上学历,计算机科学、数学、统计学、自动化等相关专业优先 2、熟悉Post-Training流程,深入了解RL领域,包括但不限于RM、PPO、DPO、GRPO等算法 3、具备大模型训练框架开发能力,包括Pytorch、Megatron等 4、具备强化学习框架开发能力,包括OpenRLHF、Verl等 5、具备一线的C++/Python工程能力,精通数据结构和常用算法,掌握各种编译、调试、性能分析工具,熟悉并行编程(CUDA/Triton等)优先。
工作职责
1、参与滴滴内部 post-training 框架研发,聚焦 LLM + RL 方向,设计框架架构与技术路线,提升其扩展性、稳定性与效率 2、优化框架性能,如训练速度、显存占用等,降低训练成本,为 LLM + RL 训练提供有力技术支撑 3、协同业务团队,将 LLM 能力在业务场景落地,根据业务需求定制训练方案并评估验证模型 4、关注行业前沿,引入有价值的技术到公司框架和模型中,探索新算法与方法,推动技术创新。
包括英文材料
学历+
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
相关职位
校招工程-后端类
1、参与机器学习平台AI Infra建设,针对各类深度学习场景的训推全链路优化,包括平台产品、训推框架集成、存储加速、GPU虚拟化等 2、参与深度学习前瞻技术跟踪调研,探索新技术在内部场景的落地。
更新于 2025-08-18
校招机器人类
1、应用先进的深度学习算法,解决自动驾驶中智能体的行为理解、预测和规划问题 2、研究和发展创新性的 VLM 和 VLA 模型,用以实时理解和预测其他交通参与者的行为,并做出合理运动规划 3、开发基于AI算法的行为模型,并应用于车端预测、规划系统或仿真中的 Smart/Sim Agent 4、设计和优化基于AI 算法的在线预测和规划系统,增强系统的实时性和鲁棒性 5、开发和完善模型训练的离线系统,包括数据挖掘、数据处理和模型评估及可视化。
更新于 2025-08-18
校招数据类
1、参与一站式大数据开发平台的产品设计和落地,包括但不限于:元数据管理、任务开发、任务调度、数据集成、数据治理、智能化开发部分等板块 2、高效完成跨部门沟通对接,准确洞察客户痛点,了解各类用户在日常研发、分析的痛点问题,并结合业务场景完成开发平台的产品功能模块设计,输出完善的解决方案 3、协调上下游团队,负责产品功能的落地以及后续产品推广等,对产品效果负责 4、追踪调研业界前沿动态,深入了解同类产品优劣势,基于公司内的需求,完成开发平台中长期建设规划,并按节奏落地,不断提升产品竞争力。
更新于 2025-08-18