小鹏汽车端侧模型部署工程师
社招全职地点:上海状态:招聘
任职要求
1、熟练掌握Python/C++编程语言,熟悉深度学习框架(如PyTorch、TensorFlow、ONNX等); 2、熟悉模型部署流程,包括模型导出、转换、优化、打包等; 3、熟悉模型压缩与优化技术,如量化、剪枝、蒸馏等; 4、熟悉主流推理引擎(如TensorRT、OpenVINO、ONNX Runtime等),具备推理加速经验者优先; 5、熟悉Linux系统开发与调试,具备脚本编写能力; 6、熟悉自动驾驶系统架构,了解模型在感知、决策、控制等模块中的应用; 7、熟悉CUDA生态,具备CUDA算子开发或优化经验者优先; 8、了解异构计算平台(如GPU、NPU、DSP)的部署与优化,有相关经验者优先; 9、熟悉典型自动驾驶/多模态模型结构,包括感知、决策、控制等模块的集成与部署。
工作职责
1、负责深度学习模型的部署与优化,确保模型在目标硬件平台上的高效运行; 2、参与模型压缩、量化、剪枝等优化技术的研究与实现,提升模型在边缘设备上的推理效率; 3、与算法团队协作,将训练完成的模型封装为可部署的推理服务,支持多平台适配; 4、负责模型部署后的性能监控与调优,确保模型在不同硬件平台上的稳定性与一致性; 5、编写自动化部署脚本与测试用例,提升部署效率与自动化水平; 6、跟踪前沿技术,探索模型部署在边缘计算、嵌入式系统等场景下的应用潜力; 7、参与CUDA生态的底层算子开发与优化,提升模型在GPU平台上的推理性能; 8、与硬件团队协作,优化模型在异构计算平台(如GPU、NPU、DSP)上的部署方案;
包括英文材料
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
ONNX+
https://github.com/onnx/tutorials
Open Neural Network Exchange (ONNX) is an open standard format for representing machine learning models.
[英文] Introduction to ONNX
https://onnx.ai/onnx/intro/
This documentation describes the ONNX concepts (Open Neural Network Exchange).
推理引擎+
https://www.youtube.com/watch?v=_dvk75LEJ34
https://www.youtube.com/watch?v=XtT5i0ZeHHE
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
脚本+
[英文] Scripting language
https://en.wikipedia.org/wiki/Scripting_language
https://zhuanlan.zhihu.com/p/571097954
一个脚本通常是解释执行而非编译。脚本语言通常都有简单、易学、易用的特性,目的就是希望能让程序员快速完成程序的编写工作。
自动驾驶+
https://www.youtube.com/watch?v=_q4WUxgwDeg&list=PL05umP7R6ij321zzKXK6XCQXAaaYjQbzr
Lecture: Self-Driving Cars (Prof. Andreas Geiger, University of Tübingen)
https://www.youtube.com/watch?v=NkI9ia2cLhc&list=PLB0Tybl0UNfYoJE7ZwsBQoDIG4YN9ptyY
You will learn to make a self-driving car simulation by implementing every component one by one. I will teach you how to implement the car driving mechanics, how to define the environment, how to simulate some sensors, how to detect collisions and how to make the car control itself using a neural network.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
相关职位
社招
负责自动驾驶端侧大模型的部署与优化工作; 研究并落地大模型优化相关技术,包括模型量化、算子优化等,推动在自动驾驶业务中的应用; 参与模型部署与优化工具链的研发工作; 与算法团队协同配合,完成从模型训练到部署的全链路优化,确保软硬件之间的高效协同。
更新于 2025-07-08
社招3年以上
1、负责将复杂的AI模型优化并部署到端侧硬件平台,比如NVIDIA等系列(包括Orin、Xavier、Nano等)。 2、优化模型以满足实时处理的需求,并确保在资源受限的设备上高效运行。 3、负责实现优化模型中部分算子实现,使其在合适的平台(CPU/GPU/DSP等)部署运行,提升模型效率。 4、参与跨部门项目,确保技术解决方案满足业务需求。 5、持续跟踪最新的AI研究进展,并评估其在端侧部署中的可行性。 6、编写技术文档,包括部署指南、性能评估报告和故障排除手册等。
更新于 2024-11-28