阿里巴巴研究型实习生-超长序列下数据负载感知的MoE模型并行训练加速
实习兼职阿里巴巴研究型实习生地点:北京 | 杭州状态:招聘
任职要求
1、精通C/C++、Java、Python等至少一门编程语言,了解至少一种主流机器学习建模框架; 2、对于机器学习和数学有较强的理解能力,有机器学习领域经验(计算机视觉、语音、NLP、推荐系统)优先; 3、热衷于数据结构和算法、在ACM大赛成绩优异者优先; 4、有实际成果并发表在国际顶级会议、期刊者优先; 5、熟悉异构计算和高性能计算,有GPU、端侧等…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
针对超长序列输入,异构样本数据,设计和实现超大规模分布式大模型训练框架。结合多种上下文长度、多种模态的样本数据,适配稀疏混合专家模型、稀疏注意力机制等新型大模型结构。对算子优化、通信优化进行深入的挖掘,消除模型并行、流水并行等实现的瓶颈。设计数据感知的训练方案,消除异构数据对效率影响。联合多种新硬件,达到极致的训练推理成本,推进大模型在国内最大的电商场景的业务落地。 职位职责包括但不限于: 1. 设计和实现机器学习系统所需要的大规模分布式计算系统,参与机器学习全生命周期(训练、推理、MLOps、CI/CD、AB testing)组件的开发与优化。 2. 构建业界领先的超大规模稀疏训练引擎,通过软硬件协同以及工程算法联合优化,持续榨取高性能异构硬件的性能潜力,不断推高分布式训练的性能天花板。 3. 构建业界先进的多模态、大语言训练引擎,通过持续的分布式训练优化,不断提升算法的训练规模,提升硬件的使用效率。 4. 构建超大规模全流程机器学习平台,覆盖从特征工程、样本构建到模型量化交付,并通过构建SQL计算引擎、分布式特征服务、样本湖等,提升特征样本计算存储效率。 5. 图学习、联邦学习、强化学习等多个AI方向算法工程解决方案的探索和落地。 6. 在机器学习系统的前沿领域(如分布式训练、软硬协同设计等)参与应用驱动的研
包括英文材料
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
还有更多 •••