安克创新助理高性能计算工程师
校招全职地点:深圳状态:招聘
任职要求
1. 计算机、电子、自动化等相关专业,硕士及以上学历; 2. 熟悉C++、Python、CUDA编程语言; 3. 熟悉PyTorch/Megatron/DeepSpeed等业界主流训练框架,熟悉TensorRT-LLM/vLLM/SGLang等大模型推理引擎; 4. 良好的团队合作能力和沟通能力,能够与跨职能团队紧密合作,共同完成项目目标; 5. 有优秀的问题解决能力和自我驱动能力,能够独立思考并解决技术挑战。
工作职责
1. 设计并优化AI大模型训练框架,通过混合并行加速、训推一体复用等技术,提升大模型训练性能; 2. 针对模型的训练和推理任务进行底层代码级优化(CPU/GPU/异构计算); 3. 研发高效的故障定位系统和容错机制,保障大规模训练的稳定性,监控训练任务日志,快速识别和修复问题。
包括英文材料
学历+
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
SGLang+
[英文] Install SGLang
https://docs.sglang.ai/get_started/install.html
SGLang is a fast serving framework for large language models and vision language models.
https://github.com/sgl-project/sgl-learning-materials
推理引擎+
https://www.youtube.com/watch?v=_dvk75LEJ34
https://www.youtube.com/watch?v=XtT5i0ZeHHE
相关职位
社招技术类-算法
负责蚂蚁数据分析平台的数据分析智能助理Copilot、数据分析Manus等智能化产品的算法研究、SFT、RFT模型优化,以及海量数据规模下的高性能数据分析算子研发。
更新于 2025-07-21
社招3-5年网易游戏(互娱)
1、负责BI平台服务端架构设计与核心代码编写,支撑海量数据实时分析、可视化交互及高性能查询 2、深入研究AI技术与BI场景的结合点,设计并实现AI智能助理、统一知识库、智能数据洞察、智能舆情等创新功能 3、主导技术难题攻关,解决分布式计算、低延迟响应等技术挑战 4、跟进行业技术趋势,推动大数据与AI技术在游戏业务场景的落地
更新于 2025-10-17