小米高级AI系统开发工程师(大模型与RAG方向)
社招全职5年以上A18742地点:武汉状态:招聘
任职要求
1.计算机科学、人工智能或相关领域本科及以上学历,5年以上大型服务端开发经验,3年以上AI系统相关项目经验 2.有牵头大型AI工程项目经验,具备一定的团队管理或技术领导经验 3. 具备扎实的 Java 编程基础,熟悉常用的 Java 开发框架,包括不限于Spring,SpringMvc、SpringBoot、Spring Cloud,有高并发分布式系统开发经验 4. 熟悉常用数据库,包括不限于Mysql、MongoDB、ES、Redis等,熟悉常用的消息中间件 5. 熟悉python/GO开发语言,能进行一般的py…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 主导大模型系统架构设计: 负责RAG系统的整体架构设计,包括存储层、检索层、推理层与缓存层的技术选型与实现。 构建高可用、低延迟的分布式推理服务框架,支持向量数据库集成(如Milvus、Elastic)、知识库管理与多模态检索优化。 设计并实现Agent工作流编排框架,支持工具调用(MCP协议)、任务规划与自动化执行。 2.模型部署与性能优化: 负责大模型(LLM/VLM)的本地化部署、量化压缩、动态批处理与推理加速,优化GPU/CPU异构算力利用率。 3. AI服务平台开发: 基于Java/Go/Python构建高并发、可扩展的AI微服务,与现有业务系统深度集成,实现模型训练-部署-监控的全链路管理。 4.技术领导与跨团队协作: 指导中级工程师,制定技术方案,并主导技术攻关。与产品、算法、基础设施团队协作,定义需求并推动工程落地。
包括英文材料
学历+
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
开发框架+
[英文] Understanding Modern Development Frameworks: A Guide for Developers and Technical Decision-makers
https://www.freecodecamp.org/news/understanding-modern-development-frameworks-guide-for-devs/
Spring+
https://liaoxuefeng.com/books/java/spring/index.html
Spring是一个支持快速开发Java EE应用程序的框架。它提供了一系列底层容器和基础设施,并可以和大量常用的开源框架无缝集成,可以说是开发Java EE应用程序的必备。
https://spring.io/guides/gs/rest-service
https://spring.io/quickstart
Level up your Java code and explore what Spring can do for you.
Spring Cloud+
[英文] Spring Cloud Series
https://www.baeldung.com/spring-cloud-series
Learn Spring Cloud including concepts, additional libraries and examples for distributed systems.
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
MongoDB+
https://learnxinyminutes.com/mongodb/
MongoDB is a NoSQL document database for high volume data storage.
https://studio3t.com/academy/#courses
The fastest way to learn MongoDB
https://www.youtube.com/watch?v=c2M-rlkkT5o
This video will give you and introduction to MongoDB in 1 Hour. Afterwards I recommend exploring aggregation, replication, and sharding.
https://www.youtube.com/watch?v=ExcRbA7fy_A&list=PL4cUxeGkcC9h77dJ-QJlwGlZlTd4ecZOA
You'll learn how to use MongoDB (a NoSQL database) from scratch. You'll also learn how to integrate it into a simple Node.js API.
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
还有更多 •••
相关职位
社招3年以上技术类-开发
1. 探索大模型与智能体技术在alibaba.com全球化销售、商家和商品等业务的落地应用,如在潜客挖掘、客户分层、商家培育、智能发品、经营助手、智能诊断等场景下的适配方案,用创新的思路和技术方案解决业务带来的挑战,驱动业务增长与商家体验提升。 2. 研究大模型相关的训练、优化和应用技术,包括指令微调、RLHF、工具学习、提示词/上下文工程、RAG、Agent架构等,跟踪相关领域前沿进展,进行各类方案的技术选型和工程实现,并与产品、UI/UX、测试及运维团队紧密协作,确保项目高质量交付。 3. 设计并开发高可用、高并发的分布式服务,构建微服务架构(如Spring Cloud/Dubbo),优化服务API性能与线上稳定性,负责数据库、缓存、消息队列等组件的技术选型与性能调优。
更新于 2026-02-11杭州
社招3年以上核心本地商业-业
1、负责服务零售家庭服务业务相关系统的设计与开发,参与架构演进、系统优化; 2、负责一个或多个子系统的中长期规划,承接业务需求并做好项目管理、上下游协同工作; 3、指导初中级工程师的学习成长和技术方案设计,参与团队代码Review等工作; 4、负责技术难点攻关,不断提升核心服务的稳定性和系统性能,提升运营效率。
更新于 2025-07-23上海

社招5年以上
负责技术方案设计与落地,根据业务需求设计AI系统架构,推动技术方案从PoC验证到规模化部署 1、优化模型推理效率,应用剪枝、蒸馏等技术降低计算资源消耗 2、负责机器学习/深度学习模型的开发与调优,主导数据预处理、特征工程、模型训练与评估全流程 3、协同产品、数据团队,将业务需求转化为技术实现,输出专利与技术文档 4、跟踪AI前沿技术(如多模态学习、Agent框架),探索创新应用场景
更新于 2025-10-15北京