
得物【技术保障】算法SRE工程师/专家
社招全职5年以上技术类地点:上海状态:招聘
任职要求
1、计算机相关专业本科及以上学历,5年以上复杂业务系统运维经验,具备丰富的系统调优、性能优化和故障处理能力;有大规模搜广推工程运维经验者优先; 2、熟练掌握各类常用运维组件/中间件运维,如k8s、Nginx、Kafka、ES、Redis、ZK等;熟悉搜广推推理、训练相关的算法infra技术。 3、掌握Python/Shell/Golang中至少一种语言,具备编码能力,可以独立完成运维工具、脚本编写; 4、有强烈的线上安全意识、对生产环境责任心和敬畏心。具备较好的自驱性,主动学习,独立思考。
工作职责
1、负责核心搜推工程业务的稳定性,通过指标建设、制度建设、降级容灾、预案设计、容量管理、监控/告警优化等一系列手段提升业务稳定性; 2、高效满足研发团队的运维服务需求,整合技术保障平台能力、服务能力等资源,提供研发团队高质量的支撑保障,并深度参与业务重大架构方案的设计与评审; 3、通过效率指标识别,新技术引进在业务域落地等手段,配合成本运营部门持续优化技术成本投入; 4、负责核心基础服务标准化建设、维护和管理,建立SOP,自动化运维工具,规范团队人员变更操作,确保系统的持续集成与交付。
包括英文材料
学历+
中间件+
https://www.youtube.com/watch?v=1oWPUpMheGk
Kubernetes+
https://kubernetes.io/docs/tutorials/kubernetes-basics/
This tutorial provides a walkthrough of the basics of the Kubernetes cluster orchestration system.
https://kubernetes.io/zh-cn/docs/tutorials/kubernetes-basics/
本教程介绍 Kubernetes 集群编排系统的基础知识。每个模块包含关于 Kubernetes 主要特性和概念的一些背景信息,还包括一个在线教程供你学习。
https://www.youtube.com/watch?v=s_o8dwzRlu4
Hands-On Kubernetes Tutorial | Learn Kubernetes in 1 Hour - Kubernetes Course for Beginners
https://www.youtube.com/watch?v=X48VuDVv0do
Full Kubernetes Tutorial | Kubernetes Course | Hands-on course with a lot of demos
Nginx+
[英文] Beginner’s Guide
https://nginx.org/en/docs/beginners_guide.html
This guide gives a basic introduction to nginx and describes some simple tasks that can be done with it.
https://www.youtube.com/watch?v=9t9Mp0BGnyI
NGINX is open-source web server software used for reverse proxy, load balancing, and caching. It's important to understand, especially if you are a backend developer.
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
Redis+
[英文] Developer Hub
https://redis.io/dev/
Get all the tutorials, learning paths, and more you need to start building—fast.
https://www.runoob.com/redis/redis-tutorial.html
REmote DIctionary Server(Redis) 是一个由 Salvatore Sanfilippo 写的 key-value 存储系统,是跨平台的非关系型数据库。
https://www.youtube.com/watch?v=jgpVdJB2sKQ
In this video I will be covering Redis in depth from how to install it, what commands you can use, all the way to how to use it in a real world project.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
脚本+
[英文] Scripting language
https://en.wikipedia.org/wiki/Scripting_language
https://zhuanlan.zhihu.com/p/571097954
一个脚本通常是解释执行而非编译。脚本语言通常都有简单、易学、易用的特性,目的就是希望能让程序员快速完成程序的编写工作。
相关职位

社招技术类
1.主导搜索/推荐核心链路的端到端稳定性建设,基于 SLO/SLI 与错误预算管理变更节奏,确保高可用与快速交付; 2.设计并演进全链路监控、告警、自愈、降级体系,构建自动化响应与回溯机制,加速问题定位与恢复; 3.深度优化计算、存储、调度、编译链路性能,引入并落地 JIT/AOT 等前沿技术,支撑高吞吐、低延迟算法场景; 4.运营与优化 Zookeeper、Nginx、消息队列等核心组件,保障超大规模分布式环境的稳定性与性能; 5.推进非标服务标准化、容器化与云原生化,利用 Kubernetes 构建规模化、自动化、可灰度的交付与运维体系。
更新于 2025-08-27
社招3年以上技术类-运维
1、负责阿里本地生活行业产品技术方案、售前接入集成、售后保障咨询等工作 2、针对外卖、零售等各行业特点深入用户、商户、骑手、ISV等多角色业务场景,提供针对性技术保障服务。 3、深入故障应急、风险识别、监控发现、体验治理等一个或多个技术领域,并将相关能力平台化扩展与多场景复制,解决实际场景中面临的问题,提升全域用户体验 4、基于以上技术领域能力和现有技术保障体系,结合行业特性及挑战设计开发技术保障平台,制定稳定性保障策略与整体方案,并持续挖掘需求、痛点和创新点。
更新于 2025-08-22

社招技术类
1.负责大数据平台、算法平台相关业务链路运维工作; 2.负责处理大数据生态稳定性等问题,保障集群高效、稳定、经济运行; 3.与开源社区保持交流,发现对业务场景有帮助的特性并引入生产环境,或将经内部验证的特性贡献到社区。
更新于 2023-12-26