小米大模型算法工程师
社招全职3年以上A215198地点:北京状态:招聘
任职要求
1. 计算机相关专业,三年及以上人工智能算法领域相关工作和项目落地经验; 2. 掌握Python开发语言,具有较高水平的算法基础和工程实现能力; 3. 精通机器学习、深度学习、大语言模型基础理论和方法等领域专业知识; 4. 掌握PyTorch、TensorFlow、Megatron、DeepSpeed等框架,了解各种并行策略并具备大规模分布式训练的经验; 5. 掌握业界领先大模型的基本原理、训练和微调方法,如LLaMa、Qwen、GLM等; 6. 出色的大模型研究能力,有高质量论文或开源项目的产出者优先考虑; 7. 具备较强的自我驱动力、独立的逻辑思考能力、严谨的归纳总结能力、高效的沟通协调能力;
工作职责
1. 负责将大模型技术应用于文本内容生成等业务领域,推动大模型技术匹配及赋能目标业务场景; 2. 负责大模型预训练和微调算法的研发平台搭建,以及大模型预训练和微调语料的处理与维护等; 3. 负责基于大模型算法的开发与优化,包括大模型增量预训练、高效微调、推理优化,解决落地过程中的算法和工程技术难题; 4. 负责追踪学术界和工业界在大模型预训练、微调、强化学习等方向的前沿进展,持续进行模型框架和训练方法的优化迭代;
包括英文材料
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
相关职位
社招1年以上算法开发岗
1、参与生成式大模型能力构建;不局限于模型设计、prompt优化、预训练、模型推理加速、其他能力建设等; 2、采用最先进的并行处理和分布式学习技术,制定并执行性能优化策略,显著提升大型语言模型的训练速度和推理能力,例如跟进DeepSeek R1技术架构等,确保技术行业领先; 3、推进大模型技术在京东物流各个业务场景落地,包括不限于智能问答、智能数据分析、智能决策以及Computer Use等,助力业务流程优化,增质提效; 4、深度探索大语言模型方向,保持技术领先优势,推动京东物流在行业内树立高效、精准的大模型/多模态大模型应用标杆,并取得业务收益。
更新于 2025-06-09
社招大模型
1、探索新一代大语言模型基座架构,完成扩散模型(diffusion model)在大语言模型的重塑,突破逐个token预测的方式,实现高效的推理模式,探索全新scaling law; 2、实现大模型训练的数据清洗、合成和评估;设计和实现大模型训练的AI Infra框架。
更新于 2025-09-05