米哈游【提前批】LLM研究员(pretrain modeling)
校招全职程序&技术类地点:上海状态:招聘
任职要求
1、计算机科学、人工智能或相关领域的硕士或博士学历 2、在自然语言处理、大语言模型研究或机器学习领域中具有相关研究经验 3、具备扎实的代码与算法基础,熟练掌握PyTorch等深度学习框架 4、具备有效的沟通和协作技能,对探索新技术和推动技术创新充满热情 加分项 1、在NeurIPS/ICML/ACL/EMNLP等顶级会议上发表过高引论文 2、在ACM/ICPC,NOI/IOI,TopCoder等编程竞赛上有获奖 3、具有大规模训练大模型经历,了解分布式训练框架及对应的性能调优和资源管理,参与过大模型训练,对megatron 比较熟悉 4、对不同LLM模型结构(如MoE,sparse attention等)有过深入理解和分析,有发表过论文探索模型结构更佳
工作职责
1、专研训练框架,快速定位训练中出现的问题,分析训练过程中的模型表现,跟infra team合作来保证训练策略的正确性 2、紧跟领域前沿技术,研究新型LLM模型架构,提升训练或推理的计算效率和模型性能 3、研究不同架构、数据、目标函数和优化方法等各个算法方面的scaling law,总结出高效稳定的预训练策略 4、拓展模型在长文本理解和生成的能力
包括英文材料
学历+
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
NeurIPS+
https://neurips.cc/
ICML+
https://icml.cc/
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
相关职位
校招程序&技术类
1、紧跟领域前沿技术,搭建一致、拟人、高智商、高情商的AI角色 2、持续迭代 Memory、Planning、RAG、Tool use、Multi-Agent等关键技术,提升Agent的对话管理、行为决策与环境交互能力 3、研发高效的Agent系统,持续优化架构与性能,推动Agent在产品化应用中的落地 4、探索并实现复杂场景下的Agent数据闭环,构建稳健、可靠的评估流程
校招程序&技术类
1、紧跟领域前沿技术,探索有效和高效的 RLHF 或 RLAIF 等post-training方法,提升大语言模型的拟人化、趣味性, 以及角色扮演、创意写作等方向的综合能力 2、参与预研项目的研发,与产品、策划、工程等多个团队紧密协作,拆解并设计具体的算法解决方案和交付目标 3、构建高质量、多领域的数据处理及分析流程,包括但不限于数据清洗、数据合成、数据混合策略等 4、构建稳健可靠的算法评估流程,揭示大语言模型能力边界和潜在机制