蔚来AI算子开发工程师
社招全职3-5年算法地点:杭州 | 上海 | 合肥 | 深圳状态:招聘
任职要求
岗位要求: 1.熟悉C/C++、python编程,有较好的编程习惯和编程基础; 2.熟悉常用数据结构及算法,如堆/栈/队列/树/图等; 有如下经验优先考虑: 1.熟悉常用NN算子的实现方式,有算子开发和优化经验,包括Conv,DeConv,激活,Pool,Attention等; 2.有计算机体系结构背景,熟悉芯片架构/微架构(包括CPU/GPU/SoC/FPGA等),有软硬件协同设计经验; 3.熟悉GPGPU硬件架构,熟悉CUDA,熟悉cuDNN,有深度学习计算框架优化经验尤佳; 4.有复杂算法(如图像与视频处理、计算机视觉等)CUDA/OpenCL/汇编级优化经验尤佳;
工作职责
职位描述: 1.负责AI处理器的高性能算子方案设计,功能开发,性能优化工作; 2.负责算子自测试用例的设计和开发,保证算子的质量提升和 CI 看护; 3.探索AI高性能算子的软硬件协同优化方案,提升硬件的利用效率 4.开发算子库开发所需的功能和性能分析工具,提升算子库开发效率
包括英文材料
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
编程规范+
[英文] Google Style Guides
https://google.github.io/styleguide/
Every major open-source project has its own style guide: a set of conventions (sometimes arbitrary) about how to write code for that project. It is much easier to understand a large codebase when all the code in it is in a consistent style.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
SOC+
https://www.arm.com/resources/education/books/modern-soc
The aim of this textbook is to expose aspiring and practising SoC designers to the fundamentals and latest developments in SoC design and technologies using examples of Arm Cortex-A technology and related IP blocks and interfaces.
https://www.arm.com/resources/education/education-kits/introduction-to-soc
To produce students with solid introductory knowledge on the basics of SoC design and key practical skills required to implement a simple SoC on an FPGA and write embedded programs targeted at the microprocessor to control the peripherals.
https://www.youtube.com/watch?v=dokgLSAhqHI
A key part of the digital innovation revolution has been the embrace of the SoC, or system-on-chip.
FPGA+
https://nandland.com/fpga-101/
These are the fundamental concepts that are important to understand when designing FPGAs.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
OpenCL+
https://developer.nvidia.com/opencl
OpenCL™ (Open Computing Language) is a low-level API for heterogeneous computing that runs on CUDA-powered GPUs.
https://engineering.purdue.edu/~smidkiff/ece563/NVidiaGPUTeachingToolkit/Mod20OpenCL/3rd-Edition-AppendixA-intro-to-OpenCL.pdf
we will give a brief overview of OpenCL for CUDA programers.
[英文] Hands On OpenCL
https://handsonopencl.github.io/
An open source two-day lecture course for teaching and learning OpenCL
https://leonardoaraujosantos.gitbook.io/opencl/chapter1
Open Computing Language is a framework for writing programs that execute across heterogeneous platforms.
https://ulhpc-tutorials.readthedocs.io/en/latest/gpu/opencl/
OpenCL came as a standard for heterogeneous programming that enables a code to run in different platforms.
https://www.youtube.com/watch?v=4q9fPOI-x80
This presentation will show how to make use of the GPU from Java using OpenCL.
相关职位
校招算法
1. 参与面向空间数据流推理芯片架构、基于 MLIR 框架的AI 算子编译器/算子库开发; 2. 参与包括AI大模型在内的各类 AI 负载的端到端编译,自动编译生成算子代码实现; 3. 根据模型负载和硬件架构特性,对AI负载进行热点分析和性能优化; 4. 参与软硬一体 AI 推理芯片架构的落地。
校招算法
1. 参与面向空间数据流推理芯片架构、基于 MLIR 框架的AI 算子编译器/算子库开发; 2. 参与包括AI大模型在内的各类 AI 负载的端到端编译,自动编译生成算子代码实现; 3. 根据模型负载和硬件架构特性,对AI负载进行热点分析和性能优化; 4. 参与软硬一体 AI 推理芯片架构的落地。
校招算法
1. 参与面向空间数据流推理芯片架构、基于 MLIR 框架的AI 算子编译器/算子库开发; 2. 参与包括AI大模型在内的各类 AI 负载的端到端编译,自动编译生成算子代码实现; 3. 根据模型负载和硬件架构特性,对AI负载进行热点分析和性能优化; 4. 参与软硬一体 AI 推理芯片架构的落地。