拼多多大模型应用算法专家/工程师
社招全职1年以上技术类地点:上海状态:招聘
任职要求
1)计算机、人工智能、数学、统计学等相关专业,硕士及以上学历; 2)1年+大模型相关项目经验(如LLM预训练、LLM后训练、Prompt Engineering、RAG、Agent等); 3)熟悉Transformer架构和RL算法,理解注意力机制、位置编码、RoPE、ALiBi、PPO、DPO、GRPO等关键设计; 4)熟练掌握Python,熟悉PyTorch/DeepSpeed/HuggingFace/Megatron/Verl等训练框架; 5)具备扎实的NLP基础,熟悉BERT、T5、GPT等模型结构及其应…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1)模型应用落地:负责Prompt设计、Few-shot/Zero-shot优化、Continual Pretrain、SFT/RL、RAG链路搭建,提升模型在垂直场景的效果与稳定性,并落地业务解决方案,如AI搜索、智能问答、内容生成、对话系统等; 2)数据构建与评估:构建高质量指令数据、偏好数据、评估集,设计自动化评估指标(如BLEU、ROUGE、人工一致性、幻觉率); 3)系统协同优化:与工程团队协作,提升模型的训练效率和推理效率,包括但不限于KV-Cache、量化、投机解码等技术,以及部署链路(如vLLM、TensorRT、Triton)的优化; 4)业务效果闭环:建立A/B实验体系,跟踪模型上线效果,持续迭代优化,推动业务指标(如CTR、转化率、用户满意度)提升。
包括英文材料
学历+
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Prompt+
https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/introduction-prompt-design
A prompt is a natural language request submitted to a language model to receive a response back.
https://learn.microsoft.com/en-us/azure/ai-foundry/openai/concepts/prompt-engineering
These techniques aren't recommended for reasoning models like gpt-5 and o-series models.
https://www.youtube.com/watch?v=LWiMwhDZ9as
Learn and master the fundamentals of Prompt Engineering and LLMs with this 5-HOUR Prompt Engineering Crash Course!
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
还有更多 •••
相关职位
社招技术类
1、负责 Coding 大模型能力优化,包括 Prompt、SFT、RL、RAG、Agent 等方案设计与落地; 2、提升模型在代码补全、代码生成、代码解释、Bug 修复、单测生成、Code Review 等场景中的效果; 3、构建代码数据、偏好数据和评测集,建立面向真实研发任务的评估体系; 4、与工程团队协作,优化模型训练、推理和部署效率,推动能力在 IDE、代码平台、研发流程中的落地; 5、建立业务效果闭环,持续跟踪模型上线后的采纳率、提效收益和研发质量提升情况。
更新于 2026-04-21上海
社招1-3年J0011
1、LLM模型应用落地:参与LLM在搜索内部的应用,探索LLM的创新落地场景; 2、RAG技术研究与落地:参与RAG技术在搜索内部的应用与创新,提升快手搜索智能问答效果; 3、技术优化与创新:持续优化现有的算法技术,推动算法创新,不断业务效果和用户体验; 4、跨团队合作:与产品团队、工程团队和业务团队紧密合作,理解业务需求,将算法技术转化为实际的产品和解决方案; 5、算法评估与改进:负责对算法模型进行评估和改进,提高算法的准确性、效率和可解释性。
更新于 2026-03-31北京
社招3年以上技术类-算法
1、围绕高德核心业务场景,结合大模型技术实现端到端简化,提升业务效果。负责大模型应用的开发落地工作,包括但不限于LLM应用、Prompt工程、SFT、多模态理解、知识库构建和优化等方面; 2、负责将大模型应用到实际业务场景,包括但不限于销售流程、商家经营、内容生成等业务领域和环节; 3、能够运用多模态理解大模型,实现对非结构化服务数据(店铺装修素材、菜单、评价、商品等)的深度理解和使用。
更新于 2025-08-05北京