美团【基座大模型北斗实习】大模型多模态训练研究
实习兼职核心本地商业-基础研发平台地点:北京 | 上海状态:招聘
任职要求
1、精通 C++/Python,熟悉 CUDA 编程、NCCL 通信库或 RDMA 网络优化; 2、具有很强的学习能力、复杂问题归纳梳理能力、沟通和团队协作能力,具备能够深度钻研技术的耐心; 3、至少深入研读过 Megatron-LM,…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
简介:多模态训练pretrain和posttrain的前沿研究,可根据个人背景和研究兴趣选择以下方向之一深入推进: 1、超长序列的高效pretrain训练方案。 2、基于投机采样的方式加速多模态RL的训练效率。 3、针对compute use场景的大规模agentic RL 高效训练方案探索。
包括英文材料
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
NCCL+
https://developer.nvidia.com/nccl
The NVIDIA Collective Communication Library (NCCL) implements multi-GPU and multi-node communication primitives optimized for NVIDIA GPUs and networking.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
还有更多 •••