美团【LongCat大模型人才校招】基座大模型工程架构专家
校招全职核心本地商业-基础研发平台地点:北京 | 上海状态:招聘
任职要求
1.具备良好的计算机基础素养和分析解决问题的能力,熟练掌握C++或Python。 2.学习能力强,对机器学习系统优化有技术热情,富有极客精神。 3.熟悉PyTorch框架和TVM/MLIR等编译优化技术的优先。 4.熟悉GPU、NPU硬件架构,熟练使用CUDA,…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1.面向多种算力硬件和高性能网络设计分布式训练架构,包括样本IO优化、计算图编译与执行、多维度并行策略、多模型交互流程等,支持万亿参数模型在几万张GPU集群高效稳定训练,实现多种模态的基座和推理模型的高效稳定训练。 2.面向多种算力、网络环境和应用场景,设计并实现高性能的模型推理架构,应用量化、剪枝等模型压缩方法,持续降低推理成本。 3.通过手工优化方法,对特化模型子结构和硬件设备上实现SOTA性能,持续迭代基于编译的优化方案,提升通用优化的适用性、优化效果以及对新硬件的覆盖能力。 4.管理及优化全公司算法团队硬件资源,通过算法预估与启发式策略,对全公司万级别节点的大规模GPU/CPU集群构建精细化调度服务能力,持续提升资源使用效率。 【为什么是我们】 1.业界前列的算力规模,海量数据和丰富的应用场景,挑战与机遇并存。 2.协同算法团队深度参与大模型项目,Codesign设计并训练行业领先的大模型。 3.从数据规模、集群体量、算法和业务复杂度多个维度提供了技术挑战和锻炼发展的机会,个人成长速度快。 4.追求卓越和鼓励创新的团队氛围。
包括英文材料
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
NCCL+
https://developer.nvidia.com/nccl
The NVIDIA Collective Communication Library (NCCL) implements multi-GPU and multi-node communication primitives optimized for NVIDIA GPUs and networking.
还有更多 •••