
哈啰智驾决策规划算法工程师/专家(RL方向)-【自动驾驶】
社招全职算法地点:北京 | 上海状态:招聘
任职要求
1. 计算机科学、自动化、机器学习、机器人学等相关专业本科及以上学历; 2. 具备扎实的强化学习理论基础,熟悉主流RL算法(PPO、SAC、TD3、IQL等),并至少在一项自动驾驶或机器人项目中具有RL算法落地经验; 3. 熟练掌握Python/C++,熟悉PyTorch等深度学习框架,具备大规模强化学习分布式训练(如Ray、Kubernetes)或仿真平台开发经验者优先; 4. 熟悉自动驾驶决策规划常见方法(如MDP/POMDP、搜索与优化算法),并能够将强化学习与传统规划方法(如MPC、Lattice)结合解决实际问题; 5. 具备良好的数学基础…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 负责基于强化学习(如Online/Offline RL、Model-based RL)的自动驾驶行为决策与运动规划算法研发,重点解决结构化道路(高速、城市快速路)及非结构化场景(自动泊车)中的动态交互与博弈问题; 2. 针对复杂动态场景(密集车流、无保护路口、人车混流),设计基于数据驱动的决策规划算法,通过大规模分布式训练系统提升策略的智能性、安全性及泛化能力; 3. 构建与迭代仿真环境(如CARLA、NVIDIA Isaac)与世界模型,推动强化学习策略的仿真训练与实车迁移(Sim2real),形成“真实数据→仿真训练→实车验证”的闭环优化; 4. 参与全栈自动驾驶决策控制系统的开发,对接感知、预测、端到端模块,实现基于强化学习的决策规划算法在车载平台上的部署、性能优化与实车路测; 5. 跟踪强化学习在自动驾驶领域的前沿进展(如大模型与RL结合、逆强化学习、多智能体博弈),进行技术预研与算法创新,推动研究成果在量产项目中的应用与落地。
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
学历+
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
自动驾驶+
https://www.youtube.com/watch?v=_q4WUxgwDeg&list=PL05umP7R6ij321zzKXK6XCQXAaaYjQbzr
Lecture: Self-Driving Cars (Prof. Andreas Geiger, University of Tübingen)
https://www.youtube.com/watch?v=NkI9ia2cLhc&list=PLB0Tybl0UNfYoJE7ZwsBQoDIG4YN9ptyY
You will learn to make a self-driving car simulation by implementing every component one by one. I will teach you how to implement the car driving mechanics, how to define the environment, how to simulate some sensors, how to detect collisions and how to make the car control itself using a neural network.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
还有更多 •••
相关职位
社招3年以上技术类-开发
我们是蚂蚁网络基础设施团队,为蚂蚁提供稳定高效的网络连接服务。我们致力于研发和建设一流的网络基础设施,通过技术驱动来不断优化通算和智算业务连接成本、性能和稳定性。 ● 负责蚂蚁AIDC网络架构设计、演进以及交换机或网卡协议栈等研发优化落地 ● 通过算法-框架-工程协同实现成本和性能全局最优
更新于 2025-04-03杭州
社招3年以上技术类-开发
我们是蚂蚁集团网络技术团队,为蚂蚁集团全站提供通智一体、稳定高效的网络基础设施产品、平台和服务。 ● 负责推理网关核心能力设计和开发; ● 通过创新的流量调度算法减少推理成本;
更新于 2025-04-03北京|杭州|成都
社招3年以上技术类-开发
我们是蚂蚁集团网络技术团队,为蚂蚁集团全站提供通智一体、稳定高效的网络基础设施产品、平台和服务。 ● 负责集合通信库的设计和研发; ● 通过稳定性建设和通信优化提升大模型训练效率和减少推理成本;
更新于 2025-04-03北京|杭州|成都