
Soul APP大模型应用架构师
社招全职地点:北京状态:招聘
任职要求
1. 学历: 硕士及以上(AI/CS相关专业优先) 2. LLM能力: 精通Prompt Engineering、RAG、Agent框架(LangChain/LlamaIndex/AutoGen) 3. AI实战: 具备Agent/AI应用从0到1研发及上线经验,主导过大规模AI产品全流程落地 4. 工程能力: 精通Python/Java,具备复杂系统开发能力 5. 系统架构: 深入掌握分布式原理(CAP/BASE/Paxos/Raft/异步/幂等), 拥有高并发系统设计开发经验
工作职责
1. 主导架构设计: 负责智能体平台整体架构,保障生产环境高可用、高性能及可扩展性 2. 构建Agent框架: 设计实现基于LLM的Agent核心框架,落地RAG/SFT/RL技术,支撑多智能体协作 3. 技术前沿攻坚: 追踪Agent与LLM领域技术动态,引入创新方案保持技术竞争力
包括英文材料
学历+
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Prompt+
https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/introduction-prompt-design
A prompt is a natural language request submitted to a language model to receive a response back.
https://learn.microsoft.com/en-us/azure/ai-foundry/openai/concepts/prompt-engineering
These techniques aren't recommended for reasoning models like gpt-5 and o-series models.
https://www.youtube.com/watch?v=LWiMwhDZ9as
Learn and master the fundamentals of Prompt Engineering and LLMs with this 5-HOUR Prompt Engineering Crash Course!
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
LangChain+
https://python.langchain.com/docs/tutorials/
New to LangChain or LLM app development in general? Read this material to quickly get up and running building your first applications.
https://www.freecodecamp.org/news/beginners-guide-to-langchain/
LangChain is a popular framework for creating LLM-powered apps.
LlamaIndex+
https://developers.llamaindex.ai/python/framework/getting_started/starter_example/
This tutorial will show you how to get started building agents with LlamaIndex.
https://www.ibm.com/think/tutorials/llamaindex-rag
LlamaIndex is a powerful open source framework that simplifies the process of building RAG pipelines.
AutoGen+
https://microsoft.github.io/autogen/0.2/docs/Getting-Started/
AutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks.
https://www.youtube.com/watch?v=JmjxwTEJSE8
Whether you know everything there to AI Agents or are a complete beginner, I believe there is something to learn here.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
CAP+
https://www.hellointerview.com/learn/system-design/core-concepts/cap-theorem
Master the fundamental tradeoffs between consistency and availability in distributed systems.
https://www.ibm.com/think/topics/cap-theorem
The CAP theorem says that a distributed system can deliver only two of three desired characteristics:
consistency, availability and partition tolerance (the ‘C,’ ‘A’ and ‘P’ in CAP).
https://www.youtube.com/watch?v=BHqjEjzAicA
Paxos+
https://lamport.azurewebsites.net/pubs/paxos-simple.pdf
The Paxos algorithm for implementing a fault-tolerant distributed system has been regarded as difficult to understand, perhaps because the original presentation was Greek to many readers.
https://paxos.systems/
Paxos algorithms are a family of consensus algorithms (or protocols) that are used in distributed systems to achieve consensus in the presence of crash failures.
https://www.scylladb.com/glossary/paxos-consensus-algorithm/
Paxos is a family of protocols for solving the problem of consensus in distributed networks.
Raft+
https://raft.github.io/
Raft is a consensus algorithm that is designed to be easy to understand.
https://thesecretlivesofdata.com/raft/
Understandable Distributed Consensus
https://www.youtube.com/watch?v=ZyqAbQkpeUo
Sail into the world of distributed systems with our in-depth, Raft consensus algorithm tutorial.
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
相关职位
社招5年以上D2863
1、负责大模型 RAG 和 Agent 框架多个模块的开发和升级工作,攻坚工程技术难点; 2、负责多模态对话、人机对话等新交互形态在产品应用中的设计和开发; 3、参与大模型 Ecosystem 解决方案的制定,推动平台/产品的稳定性、性能、开放性和扩展性建设。
更新于 2025-05-16
社招5年以上核心本地商业-基
负责AI应用探索类项目,工作包括/不限于:在线系统架构设计、在/离线数据生产链路设计、业务转化效果优化方案设计、系统性能优化、算法工程技术攻坚等 负责大模型应用基础组件的建设,工作包括/不限于:记忆组件、对话分析组件、知识库等的建设 承担大模型的落地开发工作,包括但不限于RAG、大模型微调、指令数据构建、模型Fine-tuning、Prompt Engineering、模型评测等研发,提升大模型的产品化能力,促进AI基础设施的构建; 跟进公司内外AI agent方向的技术演进,开展前瞻性技术研究,探索将新技术、新理念融于入现有系统的可能性,推动公司agent技术的持续创新和竞争力提升。
更新于 2025-05-08
社招3-5年网易云音乐
1、主导 AI 虚拟陪伴系统的整体架构设计,设计并实现支撑虚拟陪伴体验的整体系统架构; 2、完善记忆管理,实现关键事件、用户偏好、对话历史的高效存储、检索与利用; 3、RAG与内容个性化 ,负责检索增强生成(RAG)系统的工程化落地,结合用户兴趣、知识库及实时数据 生成个性化交互内容。优化检索策略,提升聊天体验; 4、主导系统性能优化,解决高并发场景下的低延迟响应与资源调度问题。
更新于 2025-08-15