顺丰大模型后端研发工程师
社招全职5-10年地点:深圳状态:招聘
任职要求
计算机相关专业本科及以上学历,5年以上后端开发经验 - 精通Golang/Python/Java等至少一种主流编程语言,具备扎实的数据结构与算法基础 - 具有分布式系统设计经验,熟悉负载均衡、消息队列、缓存等技术,同时理解这些技术在AI系统中的应用 - 熟悉大语言模型工作原理,有LLM应用开发或Agent系统相关经验优先 - 熟悉向量数据库(如Milvus、Qdrant等)或全文检索系统(如Elasticsearch) - 具备高并发、高可用系统设计与实现经验,了解性能优化方法 - 良好的问题独立解决能力和技术文档编写能力 - 具备优秀的团队协作精神,能够在跨部门环境中有效沟通 - 熟悉Dify或者n8n开源项目优先,参与过LLM开源项目优先 - 了解推理优化相关技术优先
工作职责
- 负责Agent平台核心架构设计与实现,构建高可靠性,高性能,高可扩展性的后端系统 - 设计并实现Agent运行时引擎,包括大语言模型调用、工具集成、上下文管理等核心组件 - 构建Agent评估与监控体系,设计指标收集、性能分析与质量保障机制 - 优化Agent执行链路,提升系统吞吐量,降低延迟,减少资源消耗,提高Agent系统性能,包含不限于推理优化,Prompt优化,Agent执行流程优化等 - 设计多Agent协作机制,实现复杂任务的分解与协同处理能力 - 与算法团队紧密合作,将前沿LLM技术落地为生产级系统
包括英文材料
学历+
后端开发+
https://www.youtube.com/watch?v=tN6oJu2DqCM&list=PLWKjhJtqVAbn21gs5UnLhCQ82f923WCgM
Learn what technologies you should learn first to become a back end web developer.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
消息队列+
https://www.youtube.com/watch?v=xErwDaOc-Gs
缓存+
https://hackernoon.com/the-system-design-cheat-sheet-cache
The cache is a layer that stores a subset of data, typically the most frequently accessed or essential information, in a location quicker to access than its primary storage location.
https://www.youtube.com/watch?v=bP4BeUjNkXc
Caching strategies, Distributed Caching, Eviction Policies, Write-Through Cache and Least Recently Used (LRU) cache are all important terms when it comes to designing an efficient system with a caching layer.
https://www.youtube.com/watch?v=dGAgxozNWFE
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
Milvus+
[英文] Tutorials Overview
https://milvus.io/docs/tutorials-overview.md
This page provides a list of tutorials for you to interact with Milvus.
https://www.baeldung.com/milvus-tutorial-intro
In this tutorial, we’ll explore Milvus, a highly scalable open-source vector database.
https://www.youtube.com/watch?v=7ejr_ZzU9jw
Discover the power of Milvus, an open-source vector database revolutionizing AI applications.
https://www.youtube.com/watch?v=Yhv19le0sBw
Vector databases have been trending recently as they power modern search, recommendations, and AI-driven applications.
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
高可用+
https://redis.io/blog/high-availability-architecture/
A high available architecture is when there are a number of different components, modules, or services that work together to maintain optimal performance, irrespective of peak-time loads.
https://www.ibm.com/think/topics/high-availability
High availability (HA) is a term that refers to a system’s ability to be accessible and reliable close to 100% of the time.
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
相关职位
社招5年以上后端开发
1、负责小红书基座大模型团队hi lab APP应用的后端技术研发和产品迭代; 2、从高可用、高性能、高并发、安全性等方面出发推进项目落地; 3、与客户端、产品、设计、测试等团队紧密合作,完成项目推进和功能交付;
社招2年以上微信读书技术
1.负责多模态内容理解、多模态生成等算法技术在视频剪辑、多模态理解和生成业务上的落地工作:; 2.参与图像、视频、语音等多模态内容理解及生成相关算法的工程化落地; 3.参与多模态业务核心功能设计、开发及维护,包括但不限于服务性能调优、技术难点攻坚、指标自动化监控、异构大模型调度等; 4.参与维护后台服务,保证线上稳定性,以及搭建自动化分析系统,提升工程效率; 5.参与新技术调研以及探索,以及项目特定方向的重点、难点开发工作。
更新于 2025-09-15