滴滴算法工程师(VLM/LLM方向)(J250603009)
社招全职技术地点:杭州状态:招聘
任职要求
* 计算机科学、人工智能、机器学习或相关专业硕士及以上学历 * 拥有 VLM(如 Qwen-VL系列,InternVL系列)或 LLM(如 GPT 系列、LLaMA)背景,具备大模型预训练、微调、推理或工程化经验 * 精通至少一种深度学习框架(JAX、PyTorch、TensorFlow),熟悉 Hugging Face Transformers 等工具链 * 具备行为预测、时序模型、或模仿学习等算法背景者优先 * 熟练掌握 Python和C++ * 有模型性能优化经验(模型剪枝/量化/蒸馏、训练和推理加速)者优先 * 良好的团队协作与沟通能力,热衷技术创新
工作职责
* 基于 Vision-Language Models (VLM) 和 Large Language Models (LLM),设计与实现自动驾驶中行为预测与运动规划的基座模型(Foundation Model) * 利用多模态预训练大模型进行轨迹生成与融合,提升基座模型对其他交通参与者意图的理解与预测能力 * 针对车端/云端部署,开展模型算法层面性能优化工作,例如压缩、剪枝、蒸馏、训练和推理加速等,确保模型可用性、系统实时性与资源利用率 * 与算法、软件和系统团队紧密协作,推动模型集成及在仿真与真实车载平台的落地
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
学历+
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
GPT+
https://www.youtube.com/watch?v=kCc8FmEb1nY
We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Llama+
https://github.com/LlamaFamily/Llama-Chinese
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用。
https://www.llama.com/docs/overview/
This guide provides information and resources to help you set up Llama including how to access the model, hosting, how-to and integration guides.
JAX+
https://docs.jax.dev/en/latest/notebooks/thinking_in_jax.html
JAX is a library for array-oriented numerical computation, with automatic differentiation and JIT compilation to enable high-performance machine learning research.
相关职位
校招AIDU项目
VLM模型方向: -负责基于VLM开源大模型与自动驾驶领域结合的专用自动驾驶VLM模型设计与实现,实现对复杂场景语义理解,给出决策语义或者行为语义; -负责对应模型调研、设计、研发与落地等工作,包含服务端大模型与车端小模型。 VLM数据闭环方向: -负责VLM模型训练与评价需要的数据爬取、挖掘、自动标注等核心算法工作; -使用业界大模型进行数据生成、标注等的训练、评价数据获取相关核心算法工作。
更新于 2025-05-19
校招AIDU项目
VLM模型方向: -负责基于VLM开源大模型与自动驾驶领域结合的专用自动驾驶VLM模型设计与实现,实现对复杂场景语义理解,给出决策语义或者行为语义; -负责对应模型调研、设计、研发与落地等工作,包含服务端大模型与车端小模型。 VLM数据闭环方向: -负责VLM模型训练与评价需要的数据爬取、挖掘、自动标注等核心算法工作; -使用业界大模型进行数据生成、标注等的训练、评价数据获取相关核心算法工作。
更新于 2025-07-23