钉钉大模型算法工程师
校招全职钉钉2026届秋季应届生招聘地点:杭州状态:招聘
任职要求
1. 本科及以上学历,计算机、人工智能、软件工程等相关专业优先。 2. 具备扎实的机器学习与深度学习理论基础,精通PyTorch/JAX/TensorFlow等至少一种主流框架;熟悉SFT、RLHF等后训练技术,或拥有相关项目、竞赛、论文经验者优先 3. 具备优秀的实验设计与问题定位能力,能够独立分析并解决大模型在不同场景下的性能与表现问题;拥有顶会(ICML/NeurIPS/ICLR/ACL/CVPR等)论文发表经历者优先。 4. 善于沟通与团队协作,乐于在快速迭代的环境中分享见解、推动项目落地;拥有ACM-ICPC、Kaggle等竞赛获奖经历或开源大模型项目贡献经历者优先。
工作职责
1. 负责构建在关键应用场景中具备领先优势的文本/视觉理解(VL)大模型,并深入研究持续预训练(CPT)与退火训练技术,打造强大的垂直领域基座模型。 2. 探索并设计面向垂直领域的奖励机制与奖励模型,通过先进的强化学习技术,深度激发并提升基座模型在专业领域的知识整合与推理能力。 3. 追踪并探索最前沿的文本/多模态模型架构及高效训练/推理方案,在先进模型结构、对齐算法、强化学习、推理效率优化、奖励模型设计、视觉推理及模型可解释性等方向上进行深入研究,产出具有行业影响力的技术成果。
包括英文材料
学历+
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
JAX+
https://docs.jax.dev/en/latest/notebooks/thinking_in_jax.html
JAX is a library for array-oriented numerical computation, with automatic differentiation and JIT compilation to enable high-performance machine learning research.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
ICML+
https://icml.cc/
NeurIPS+
https://neurips.cc/
ICLR+
https://iclr.cc/
CVPR+
https://cvpr.thecvf.com/
Kaggle+
[英文] Kaggle Learn
https://www.kaggle.com/learn
Gain the skills you need to do independent data science projects.
RLHF+
[英文] What is RLHF?
https://aws.amazon.com/what-is/reinforcement-learning-from-human-feedback/
Reinforcement learning from human feedback (RLHF) is a machine learning (ML) technique that uses human feedback to optimize ML models to self-learn more efficiently.
https://www.ibm.com/think/topics/rlhf
Reinforcement learning from human feedback (RLHF) is a machine learning technique in which a “reward model” is trained with direct human feedback, then used to optimize the performance of an artificial intelligence agent through reinforcement learning.
ACL+
https://www.aclweb.org/portal/
Computational linguistics is the scientific study of language from a computational perspective.
ICPC+
https://icpc.global/
The International Collegiate Programming Contest is an algorithmic programming contest for college students.
相关职位
社招1年以上算法开发岗
1、参与生成式大模型能力构建;不局限于模型设计、prompt优化、预训练、模型推理加速、其他能力建设等; 2、采用最先进的并行处理和分布式学习技术,制定并执行性能优化策略,显著提升大型语言模型的训练速度和推理能力,例如跟进DeepSeek R1技术架构等,确保技术行业领先; 3、推进大模型技术在京东物流各个业务场景落地,包括不限于智能问答、智能数据分析、智能决策以及Computer Use等,助力业务流程优化,增质提效; 4、深度探索大语言模型方向,保持技术领先优势,推动京东物流在行业内树立高效、精准的大模型/多模态大模型应用标杆,并取得业务收益。
更新于 2025-06-09
社招大模型
1、探索新一代大语言模型基座架构,完成扩散模型(diffusion model)在大语言模型的重塑,突破逐个token预测的方式,实现高效的推理模式,探索全新scaling law; 2、实现大模型训练的数据清洗、合成和评估;设计和实现大模型训练的AI Infra框架。
更新于 2025-09-05