
商汤部署工程师
社招全职5年以上系统开发地点:上海状态:招聘
任职要求
1、5年以上工作经验,3年以上的嵌入式开发工作经验; 2、熟悉深度神经网络以及大模型的主流网络架构; 3、至少熟悉一种主流推理框架,如vLLM、HuggingFace Transformers,TensorRT等; 4、具有算子优化能力能力; 5、有NVIDIA、MediaTek、Qualcomm、地平线等平台部署经验者优先; 6、熟练掌握C/C++、Python、Git、CMake、Makefile等基本技能。
工作职责
1、负责端上(Linux/Android)平台的模型部署; 2、负责大模型在NPU/DSP/GPU/CPU开发与部署; 3、负责大模型在端侧的量化及推理性能优化; 4、 负责大模型测试工具的开发;
包括英文材料
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Git+
https://www.youtube.com/watch?v=rH3zE7VlIMs
Learn Git from start to finished in this full course written by ThePrimeagen.
CMake+
https://cmake.org/getting-started/
We want to give you the resources you need to confidently leverage CMake as your build system of choice.
https://learnxinyminutes.com/zh-cn/cmake/
CMake 是一个跨平台且开源的自动化构建系统工具。通过该工具你可以对你的源代码进行测试、编译或创建安装包。
https://www.youtube.com/watch?v=7YcbaupsY8I
CMake introduction for absolute beginners.
相关职位

社招5年以上算法工程
模型部署与优化工程师(端侧) 1.负责端上(Linux/Android)平台的模型部署; 2.负责大模型在NPU/DSP/GPU/CPU开发与部署; 3.负责大模型在端侧(NV/MTK/Qualcomm等)的量化及推理性能优化; 4.负责大模型测试工具的开发;
更新于 2025-05-21
社招3年以上W9692
职位描述 1. 将图像算法落地到具体嵌入式平台上,解决工程化问题。 2. 实现跨平台的图像算法SDK开发,优化性能和资源。 3. 设计测试方案, 评估图像算法的性能和准确性。 4. 与数据公司合作,提供方案以获取符合需求的图像数据集。
更新于 2023-10-16
校招
1. 音频算法优化:分析和优化现有的音频处理算法,以提高性能和效率; 将深度学习算法推理用C/C++实现。;对算法做定点化,以便更高效运行在嵌入式系统中; 完成AI算法在芯片上的部署,必要时需要考虑量化噪声等问题进行模型重新训练; 2. 系统集成:将音频/健康算法集成到嵌入式系统中,确保与其他系统组件的无缝协作;参与系统架构设计,确保音频处理模块的高效集成; 3. 代码开发和维护:编写高质量、可维护的代码,遵循最佳编程实践;进行代码审查,确保代码质量和性能; 4. 测试和验证:和测试工程师合作,验证算法的功能和性能;解决算法处理中的问题和故障,确保系统的稳定性和可靠性。
更新于 2025-09-05