阿里巴巴1688-AI Infra工程-杭州
社招全职2年以上地点:杭州状态:招聘
任职要求
1. 熟悉Linux开发环境,熟练掌握Python、C/C++等一种或多种语言; 2. 熟练掌握vllm、sglang、rtp-llm等大模型推理加速框架,以及kvcache、pd分离、投机采样等大模型推理加速技术; 3. 熟悉CUDA,有…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
主导/参与1688的AI Infra建设: 1. llm推理框架的研发与优化,训练/强化学习框架的研发与优化,解决1688电商域的模型使用问题; 2. 算法-软件-硬件协同优化(异构并行计算、AI编译、稀疏量化、混部与弹性等),发挥1688集群的计算潜力; 3. 研究业界前沿的AI算法、系统和硬件,探索面大模型AI在线服务或离线批处理的最佳系统。
包括英文材料
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
还有更多 •••
相关职位
社招3年以上技术-开发
1. 负责llm后训练训推调优与框架优化,优化负载均衡策略,提升训练和推理效率; 2. 负责rl训练工程环境搭建,包括mcp工具,沙箱,agent等环境,确保其在处理大规模训练时的性能,提高其性能和稳定性; 3. 对设计与实现的功能进行测试和调优,保证其在不同环境下的运行效率。
更新于 2025-09-19杭州
社招3年以上技术-开发
1. 负责llm后训练训推调优与框架优化,优化负载均衡策略,提升训练和推理效率; 2. 负责rl训练工程环境搭建,包括mcp工具,沙箱,agent等环境,确保其在处理大规模训练时的性能,提高其性能和稳定性; 3. 对设计与实现的功能进行测试和调优,保证其在不同环境下的运行效率。
更新于 2026-01-15杭州
社招3年以下网易有道
1.结合HPC和AI前沿技术,设计和优化大模型训练和推理框架,负责模型优化、算子优化、图优化、分布式优化等,提升计算效率 2. 负责云侧或端侧大模型和小模型推理服务开发、性能优化、上线等工作
更新于 2025-11-03北京