
商汤模型部署工程师
社招全职5年以上算法工程地点:上海状态:招聘
任职要求
1.本科及以上学历,计算机科学、电子工程或自动化等相关专业; 2.5年以上工作经验,3年以上的嵌入式开发工作经验, 3.熟悉计算机系统体系架构,软件性能优化加速; 4.至少熟悉一种主流推理框架,如VLLM、HuggingFace Transformers,TensorRT等 4.熟悉Al训练框架(TensorFlow、PyTorch、Caffe等)优先; 6.有NVIDIA、MediaTek、Qualcomm、地平线等平台部署经验者优先; 7.熟练掌握C/C++、Python、Git、CMake、Makefile等基本技能
工作职责
模型部署与优化工程师(端侧) 1.负责端上(Linux/Android)平台的模型部署; 2.负责大模型在NPU/DSP/GPU/CPU开发与部署; 3.负责大模型在端侧(NV/MTK/Qualcomm等)的量化及推理性能优化; 4.负责大模型测试工具的开发;
包括英文材料
学历+
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Git+
https://www.youtube.com/watch?v=rH3zE7VlIMs
Learn Git from start to finished in this full course written by ThePrimeagen.
CMake+
https://cmake.org/getting-started/
We want to give you the resources you need to confidently leverage CMake as your build system of choice.
https://learnxinyminutes.com/zh-cn/cmake/
CMake 是一个跨平台且开源的自动化构建系统工具。通过该工具你可以对你的源代码进行测试、编译或创建安装包。
https://www.youtube.com/watch?v=7YcbaupsY8I
CMake introduction for absolute beginners.
相关职位
社招
负责自动驾驶端侧大模型的部署与优化工作; 研究并落地大模型优化相关技术,包括模型量化、算子优化等,推动在自动驾驶业务中的应用; 参与模型部署与优化工具链的研发工作; 与算法团队协同配合,完成从模型训练到部署的全链路优化,确保软硬件之间的高效协同。
更新于 2025-07-08

社招算法序列
工作职责 1. 负责端到端自动驾驶模型在不同硬件平台上的部署与优化,参与模型评测; 2. 设计实现模型一致性评测工具链,确保跨平台一致性,识别并解决差异问题; 3. 参与软硬件协同优化设计。与硬件工程师协作,参与硬件设计和优化,提供模型在私有硬件平台的执行效率。
更新于 2025-09-09

社招算法工程
1. 负责端到端自动驾驶模型在不同硬件平台上的部署与优化,参与模型评测; 2. 设计实现模型一致性评测工具链,确保跨平台一致性,识别并解决差异问题; 3. 参与软硬件协同优化设计。与硬件工程师协作,参与硬件设计和优化,提供模型在私有硬件平台的执行效率。
更新于 2025-10-11