顺丰大模型算法工程师
社招全职3-5年地点:深圳状态:招聘
任职要求
1、精通大模型训练的核心算法和技术原理,熟悉主流的大模型架构(如Transformer、MoE等),有 LangChain、LangGraph、AutoGen 等智能体开发框架的实际开发经验。 2、拥有计算机、自然语言处理、深度学习、运筹学等相关专业硕士及以上学历,且具备3年以上自然语言处理研究经历或相关工作经验。有大模型结合运筹优化、复杂任务规划、智能决策等相关研究或项目经验者尤佳; 3、熟练掌握Pytho…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、负责大物流供应链场景下的决策智能体的研发。覆盖网络规划、资源规划、产品规划等核心业务,设计并训练具备复杂逻辑推理、任务规划及决策能力的AI智能体底盘; 2、基于 AutoGen 或 LangGraph 框架,设计和实现多智能体(Agent)系统,支持 Agent-to-Agent 协作,探索大模型在决策优化领域的应用。优化 Agent 的推理链路和交互效率,提升系统的稳定性与可扩展性。 3、与产品、业务、工程等团队紧密协作,将AI智能体与现有运筹算法、业务规则相结合,构建具备思考-规划-行动能力的智能体系统,解决实际业务中的高难度决策问题,确保模型落地后的业务价值; 4、跟踪AI及决策智能领域的前沿技术动态(如思维链CoT、RAG、智能体框架等),结合物流供应链的业务痛点,探索LLM+OR(运筹学)等创新技术方案,推动技术边界的拓展。
包括英文材料
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
LangChain+
https://python.langchain.com/docs/tutorials/
New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.
https://www.freecodecamp.org/news/beginners-guide-to-langchain/
LangChain is a popular framework for creating LLM-powered apps.
AutoGen+
https://microsoft.github.io/autogen/0.2/docs/Getting-Started/
AutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks.
https://www.youtube.com/watch?v=JmjxwTEJSE8
Whether you know everything there to AI Agents or are a complete beginner, I believe there is something to learn here.
智能体+
https://learn.microsoft.com/en-us/shows/ai-agents-for-beginners/
In this 10-lesson course we take you from concept to code while covering the fundamentals of building AI agents.
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
开发框架+
[英文] Understanding Modern Development Frameworks: A Guide for Developers and Technical Decision-makers
https://www.freecodecamp.org/news/understanding-modern-development-frameworks-guide-for-devs/
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
还有更多 •••
相关职位
社招1年以上算法开发岗
1、参与生成式大模型能力构建;不局限于模型设计、prompt优化、预训练、模型推理加速、其他能力建设等; 2、采用最先进的并行处理和分布式学习技术,制定并执行性能优化策略,显著提升大型语言模型的训练速度和推理能力,例如跟进DeepSeek R1技术架构等,确保技术行业领先; 3、推进大模型技术在京东物流各个业务场景落地,包括不限于智能问答、智能数据分析、智能决策以及Computer Use等,助力业务流程优化,增质提效; 4、深度探索大语言模型方向,保持技术领先优势,推动京东物流在行业内树立高效、精准的大模型/多模态大模型应用标杆,并取得业务收益。
更新于 2025-06-09北京
社招大模型
1、探索新一代大语言模型基座架构,完成扩散模型(diffusion model)在大语言模型的重塑,突破逐个token预测的方式,实现高效的推理模式,探索全新scaling law; 2、实现大模型训练的数据清洗、合成和评估;设计和实现大模型训练的AI Infra框架。
更新于 2025-11-20北京|上海