快手【快Star-X】GenAI异构计算架构与优化工程师
校招全职J1020地点:北京状态:招聘
任职要求
1、技术能力: 精通Linux环境下C/C++和Python开发,扎实的计算机体系结构、操作系统、编译原理基础。 深入理解深度学习框架底层实现(如TensorFlow/PyTorch的计算图优化、运行时调度等)。 熟悉至少一种主流异构计算架构(如NVIDIA CUDA、AMD ROCm、Google TPU等)及其编程模型。 具备高性能算子开发、模型训练/推理优化经验者优先。 2、经验背景: 在AI芯片评估、模型优化、高性能计算等领域有实际项目经验。 有顶级会议(ASPLOS、ISCA、MLSys等)论文发表或AI编译器技术(MLIR、TVM等)研究经验者优先。 3、综合素质: 具备优秀的算法思维、系统架构设计能力与工程实现能力。 对AI技术前沿有强烈兴趣,能快速学习并解决复杂技术问题。
工作职责
1、负责异构计算芯片(GPU/NPU/ASIC等)的评估、选型与深度优化,构建面向业务场景的算力评估体系。 2、主导AI推理引擎在目标芯片上的设计与实现,实现毫秒级低延迟与高吞吐推理能力。 3、优化大规模模型训练框架的设计与实现,提升分布式训练效率,缩短模型迭代周期。 4、开发高性能算子库,突破芯片算力瓶颈,最大化硬件利用率。 5、推动异构编程范式革新,降低模型迁移成本,提升开发效率。
包括英文材料
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
相关职位
社招7年以上产品运营类
1、根据多视角的洞察分析,结合公司战略,制定手机产品策略和卖点方向,确保产品竞争力,重点需要较强的策略洞察和分析能力,支撑P+3工作; 2、产品规划及定义,输出产品任务书,保证产品竞争力及产品目标在项目中的实现; 3、整合营销,把控营销方向,保证产品调性及产品核心利益点的传播; 4、产品操盘,参与产品销售策略制定,保证产品全生命周期良好的损益及销售目标达成; 5、对行业敏感,对于产品体验有高追求,具有良好用户思维,不断推动产品及体验优化。
社招3年以上TEG技术
1.参与具身智能平台的系统架构设计和模块研发工作,支持将实验室算法和数据的能力开放,提供稳定高效和安全的服务; 2.参与建设具身智能仿真平台,支持多模态感知、决策规划、运动操作、人机交互等相关算法能力的迭代优化; 3.参与构建云边协同体系,安全防护体系,以及资源的调配优化能力; 4.参与建设机器人社区生态,包括开发者工具套件、技术论坛、资源共享中心等。
更新于 2025-06-12
社招TEG产品
1.深入理解具身智能开放平台算法模型以及提供的技术服务,面向机器人行业开发者推广并跟进开发者使用效果;组织面向开发者的各类活动,提升平台用户覆盖和活跃; 2.面向具身智能领域开发者群体运营,针对开发者关于具身智能相关算法、数据集、仿真环境等相关技术问题提供技术支持,维护开发者关系; 3.与研发团队保持紧密协作,及时传递市场需求与客户反馈,驱动产品持续迭代与优化。
更新于 2025-05-29