大疆高级嵌入式工程师(高性能计算)
社招全职8年以上嵌入式地点:深圳 | 上海状态:招聘
任职要求
1. 8年以上编译器开发经验,精通C++/Python,熟悉TVM/MLIR/LLVM等工具; 2. 深入理解NPU/GPU架构及计算特性,具备芯片软硬件协同优化经验; 3. 具备完整的AI编译器开发经验; 4. 熟悉AI模型压缩、量化等技术,了解自动驾驶、大模型等场景的编译需求。
工作职责
1. 负责AI编译器技术路线制定,主导自研AI编译器整体架构设计; 2. 负责NN编译优化技术方案,包括图优化、算子融合、内存分配等核心模块设计; 3. 负责NN编译与硬件协同设计,参与NPU指令集和架构设计,芯片性能验证等; 4. 负责跟踪AI编译器领域技术演进,规划技术演进方向与推动落地。
包括英文材料
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
LLVM+
https://llvm.org/docs/GettingStarted.html
Welcome to the LLVM project!
https://llvm.org/docs/tutorial/
This is the “Kaleidoscope” Language tutorial, showing how to implement a simple language using LLVM components in C++.
https://mcyoung.xyz/2023/08/01/llvm-ir/
“LLVM” is an umbrella name for a number of software components that can be used to build compilers.
https://www.youtube.com/watch?v=Lvc8qx8ukOI
This is the first lecture from the "Programming Language with LLVM" course where we build a full programming language similar to JavaScript from scratch, using LLVM compiler infrastructure.
自动驾驶+
https://www.youtube.com/watch?v=_q4WUxgwDeg&list=PL05umP7R6ij321zzKXK6XCQXAaaYjQbzr
Lecture: Self-Driving Cars (Prof. Andreas Geiger, University of Tübingen)
https://www.youtube.com/watch?v=NkI9ia2cLhc&list=PLB0Tybl0UNfYoJE7ZwsBQoDIG4YN9ptyY
You will learn to make a self-driving car simulation by implementing every component one by one. I will teach you how to implement the car driving mechanics, how to define the environment, how to simulate some sensors, how to detect collisions and how to make the car control itself using a neural network.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
相关职位
社招4年以上嵌入式
1. 负责自研芯片AI编译器方案设计及开发实现(侧重点为高能效比与加速器的高利用率); 2. 负责开发编译器后端优化Pass,如指令调度、内存分配等,最大化发挥NPU算力; 3. 负责开发编译器性能调优工具链,支持模型推理效率分析和自动化优化。
更新于 2025-05-22
社招5年以上软件
1. 负责相机业务软件核心框架的开发,保证其高性能、稳定性、低耦合等; 2. 负责框架基础组件/业务的方案设计、运营、提效; 3. 负责效能工具的发掘、设计、开发; 4. 负责支持产品化阶段遇到的框架性能、稳定性问题分析等。
更新于 2025-07-03
社招嵌入式
1. 大规模及中小规模模型分布式训练的性能优化,包括数据读取、算子优化、通信优化、显存优化等,加速训练过程,提升训练系统稳定性、资源利用效率及面向目标平台(如嵌入式设备)的可部署性; 2. 持续分析、优化大规模多机集群及中小规模训练任务的性能,与算法同事协作优化训练系统的整体效率和稳定性; 3. 负责云端推理服务的性能优化与落地,包括模型转换、计算图优化、算子融合、低精度推理(INT8/FP16)、推理框架适配(如TensorRT等),提升推理吞吐量、降低延迟和资源消耗; 4. 跟进业内先进的训练框架、推理框架及训练/推理优化技术,推动其在业务中的实践。
更新于 2025-06-24