大疆中/高级高性能计算工程师(推理优化)
社招全职4年以上嵌入式地点:深圳状态:招聘
任职要求
1. 硕士及以上学历,4年及以上相关经验; 2. 熟悉至少1款主流移动端处理器的芯片架构和NN优化策略,有基于NPU/DSP/GPU的NN和图像算法深入部署调优经验; 3. 熟悉至少1种主流NN部署框架,包括但不限于QNN/coreml/MNN/ncnn/caffe/tensorflow等; 4. 了解常用的模型压缩技术,包括不限于蒸馏、剪枝、量化、稀疏等; 5. 熟练掌握C/C++/python编程,具备良好的软件工程习惯; 6. 具备良好的学习能力,自驱力和沟通协调能力。
工作职责
1. 负责NN算法、图像算法在主流移动端处理器上的部署和优化,达成模型(含大模型)推理的耗时/功耗等目标; 2. 负责NN部署框架设计、开发实现、算子优化和工具链维护; 3. 负责撰写相关业务设计文档。
包括英文材料
学历+
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Core ML+
[英文] Getting Started
https://apple.github.io/coremltools/docs-guides/source/introductory-quickstart.html
Core ML Tools can convert trained models from other frameworks into an in-memory representation of the Core ML model.
https://developer.apple.com/machine-learning/core-ml/
Core ML is optimized for on-device performance of a broad variety of model types by leveraging Apple silicon and minimizing memory footprint and power consumption.
https://www.youtube.com/watch?v=g3yj9_DHrME
Bring the power of machine learning directly to your apps with Core ML.
MNN+
https://github.com/alibaba/MNN?tab=readme-ov-file#intro
MNN is a highly efficient and lightweight deep learning framework.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
相关职位
社招嵌入式
1. 大规模及中小规模模型分布式训练的性能优化,包括数据读取、算子优化、通信优化、显存优化等,加速训练过程,提升训练系统稳定性、资源利用效率及面向目标平台(如嵌入式设备)的可部署性; 2. 持续分析、优化大规模多机集群及中小规模训练任务的性能,与算法同事协作优化训练系统的整体效率和稳定性; 3. 负责云端推理服务的性能优化与落地,包括模型转换、计算图优化、算子融合、低精度推理(INT8/FP16)、推理框架适配(如TensorRT等),提升推理吞吐量、降低延迟和资源消耗; 4. 跟进业内先进的训练框架、推理框架及训练/推理优化技术,推动其在业务中的实践。
更新于 2025-06-24
社招4年以上嵌入式
1. 负责自研芯片AI编译器方案设计及开发实现(侧重点为高能效比与加速器的高利用率); 2. 负责开发编译器后端优化Pass,如指令调度、内存分配等,最大化发挥NPU算力; 3. 负责开发编译器性能调优工具链,支持模型推理效率分析和自动化优化。
更新于 2025-05-22
校招AI/算法类
专注于大模型系统优化、异构计算的前沿技术研究和落地,研究领域包括不限于高性能大模型系统架构、LLM-as-a-Service技术等。 岗位职责: 1. 负责大模型轻量化及推理优化的研究,支持大模型在云侧及端侧的高效推理及微调; 2. 负责端上大模型及AI智能体运行引擎的研发和部署。
更新于 2025-07-23