拼多多大模型算法应用工程师
社招全职技术类地点:上海状态:招聘
任职要求
1. 扎实的编程技术,适应Linux环境下的开发,熟悉以Python为主的大模型开发流程,并掌握一定Java/C++编程语言的能力。 2. 了解业界新模型的发展方向和技术知识,包括但不限于 - 向量方法 - 多模态 - 模型构架 - 训练数据集收集 - 指令微调数据集生成 - 模型微调 - 模型对齐 - 推理性能加速等。 3. 熟悉Langchain,Haystack,LlamaIndex等开源大模型应用框架,了解如vLLM,LmDeploy,Ollama,Llama.cpp等的开源推理框架。 4. 熟悉深度学习技术,并了解 Pytorch 环境下,基于如 Deepspeed 架构下的模型开发和训练 5. 对分布式计算有一定的了解 6. 良好的团队沟通和协作能力,有能力为非大模型背景的听众提炼技术要点。 加分项 1. 有利用大模型落地应用或解决实际个人问题的经验 2. 在分布式集群上有训练大模型的经验 3. 熟悉GPU构架,有一定模型推理性能优化经验 4. 有对LLM社区有一定贡献或有相关领域的学术文章发表
工作职责
1. 负责生成式语言模型应用的设计,开发,和落地,为用户使用场景提供更好的体验。 2. 在已部署的大模型服务中,调研不同模型与架构对服务指标的影响。 3. 结合提示词工程 (prompt engineering),模型微调 (supervised/parameter efficient fine-tuning),函数调用 (function calling),配合向量数据库的检索增强生成 (RAG)等大模型技术,研发关键功能,实现稳定,可复现的模型产出。 4. 跟进业界的最新产出结果,根据业务需求,为团队调研引入新的大模型应用场景。
包括英文材料
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
LangChain+
https://python.langchain.com/docs/tutorials/
New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.
https://www.freecodecamp.org/news/beginners-guide-to-langchain/
LangChain is a popular framework for creating LLM-powered apps.
LlamaIndex+
https://developers.llamaindex.ai/python/framework/getting_started/starter_example/
This tutorial will show you how to get started building agents with LlamaIndex.
https://www.ibm.com/think/tutorials/llamaindex-rag
LlamaIndex is a powerful open source framework that simplifies the process of building RAG pipelines.
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
Ollama+
https://www.youtube.com/watch?v=GWB9ApTPTv4
Learn how to set up and use Ollama to build powerful AI applications locally.
https://www.youtube.com/watch?v=UtSSMs6ObqY
In this short video, I'll teach you everything you need to know to get up and running with Ollama.
Llama+
https://github.com/LlamaFamily/Llama-Chinese
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用。
https://www.llama.com/docs/overview/
This guide provides information and resources to help you set up Llama including how to access the model, hosting, how-to and integration guides.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
LMDeploy+
https://lmdeploy.readthedocs.io/en/latest/get_started/get_started.html
This tutorial shows the usage of LMDeploy on CUDA platform.
llama.cpp+
https://blog.steelph0enix.dev/posts/llama-cpp-guide/
No LLMs were harmed during creation of this post.
https://github.com/ggml-org/llama.cpp/discussions/15396
This is a detailed guide for running the new gpt-oss models locally with the best performance using llama.cpp.
https://www.youtube.com/watch?v=EPYsP-l6z2s
In this guide, you'll learn how to run local llm models using llama.cpp.
DeepSpeed+
https://www.youtube.com/watch?v=pDGI668pNg0
相关职位
社招算法开发岗
大模型算法应用工程师 岗位职责及目标 1、聚焦大语言模型(LLM)应用,包括但不限于大模型在数字人、智能客服、对话式导购、AI-Agent、文档对话等领域的核心技术攻坚,持续提升大模型的应用效果,保持技术在行业的竞争力; 2、持续跟进和探索大语言模型、多模态AIGC等AI技术的未来发展和应用趋势,和业界保持紧密的交流。
更新于 2025-06-15
社招算法开发岗
大模型算法应用工程师 岗位职责及目标 1、聚焦大语言模型(LLM)应用,包括但不限于大模型在数字人、智能客服、对话式导购、AI-Agent、文档对话等领域的核心技术攻坚,持续提升大模型的应用效果,保持技术在行业的竞争力; 2、持续跟进和探索大语言模型、多模态AIGC等AI技术的未来发展和应用趋势,和业界保持紧密的交流。
更新于 2025-06-12
社招A204639
1、负责企业应用中生成式AI能力的设计、开发和部署,提供更好的用户体验; 2、结合工作流、提示工程、模型选择、超参数配置、模型微调、数据检索和编码等技术,支持关键产品功能。 3、开发并维护我们使用大型语言模型能力的服务,保证算法服务的稳定性和可观测性; 4、跟进前沿趋势,为团队调研引入新的AI应用场景。
更新于 2024-05-06