腾讯云计算稳定性治理后台研发工程师 -深圳/杭州
社招全职3年以上CSIG技术地点:深圳状态:招聘
任职要求
1.熟悉云计算IaaS/PaaS控制面常用架构,具备1年以上云平台控制面研发经验; 2.熟悉云计算帐号、认证、授权、计费等公共服务体系; 3.熟悉Redis、Kafka、Mysql等中间件和数据库服务,熟悉相关中间件的高可用架构设计; 4.熟练掌握Python/go等后台开发语言; 5.熟悉Prometheus/Grafana等监控工具,熟悉Gitlab CI/Github Action/ArgoCD等的使用; 6.了解Kubernetes,云原生基本理念; 7.有AI Agent开发经验,有研发效能工具研发经验者优先。 加分项 1.在同等条件下,通过腾讯云认证或取得同等资格认证的候选人,我们会优先考虑。
工作职责
1.负责稳定性治理工具体系的研发, 包括但不限于拨测、风险扫描等; 2.协助IaaS/PaaS产品研发效能提升,提升整体研发效率和交付质量,包括但不限于研发支撑、工具开发、流程和方法的优化与改进,提升研发和工程生产力和效率; 3.协助业务发现和解决实际的技术问题,提供技术支持和工程赋能,确保团队的技术能力和知识水平的提升,改善研发环境和体验。
包括英文材料
IaaS+
https://www.ibm.com/think/topics/iaas
https://www.youtube.com/watch?v=XRdmfo4M_YA
PaaS+
https://www.ibm.com/cn-zh/think/topics/paas
平台即服务 (PaaS) 是一种云计算模型,提供完整的按需云平台(硬件、软件和基础设施),用于开发、运行和管理应用程序。
https://www.ibm.com/think/topics/paas
https://www.youtube.com/watch?v=QAbqJzd0PEE
Redis+
[英文] Developer Hub
https://redis.io/dev/
Get all the tutorials, learning paths, and more you need to start building—fast.
https://www.runoob.com/redis/redis-tutorial.html
REmote DIctionary Server(Redis) 是一个由 Salvatore Sanfilippo 写的 key-value 存储系统,是跨平台的非关系型数据库。
https://www.youtube.com/watch?v=jgpVdJB2sKQ
In this video I will be covering Redis in depth from how to install it, what commands you can use, all the way to how to use it in a real world project.
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
中间件+
https://www.youtube.com/watch?v=1oWPUpMheGk
高可用+
https://redis.io/blog/high-availability-architecture/
A high available architecture is when there are a number of different components, modules, or services that work together to maintain optimal performance, irrespective of peak-time loads.
https://www.ibm.com/think/topics/high-availability
High availability (HA) is a term that refers to a system’s ability to be accessible and reliable close to 100% of the time.
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Prometheus+
https://grafana.com/docs/grafana/latest/getting-started/get-started-grafana-prometheus/
Prometheus is an open source monitoring system for which Grafana provides out-of-the-box support.
https://prometheus.io/docs/tutorials/getting_started/
Prometheus is a system monitoring and alerting system.
Grafana+
CI+
https://www.ibm.com/cn-zh/think/topics/continuous-integration
持续集成 (CI) 是一种软件开发实践,开发人员在整个开发周期中会定期将新的代码和代码变更集成到中央代码存储库中。它是 DevOps 和敏捷方法的关键组成部分。
https://www.youtube.com/watch?v=42UP1fxi2SY
Kubernetes+
https://kubernetes.io/docs/tutorials/kubernetes-basics/
This tutorial provides a walkthrough of the basics of the Kubernetes cluster orchestration system.
https://kubernetes.io/zh-cn/docs/tutorials/kubernetes-basics/
本教程介绍 Kubernetes 集群编排系统的基础知识。每个模块包含关于 Kubernetes 主要特性和概念的一些背景信息,还包括一个在线教程供你学习。
https://www.youtube.com/watch?v=s_o8dwzRlu4
Hands-On Kubernetes Tutorial | Learn Kubernetes in 1 Hour - Kubernetes Course for Beginners
https://www.youtube.com/watch?v=X48VuDVv0do
Full Kubernetes Tutorial | Kubernetes Course | Hands-on course with a lot of demos
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
DevOps+
https://roadmap.sh/devops
Step by step guide for DevOps, SRE or any other Operations Role in 2025
https://zhuanlan.zhihu.com/p/562036793
DevOps中的Dev指的是Development(开发),Ops指的是Operations(运维),用一句话来说,DevOps就是打通开发运维的壁垒,实现开发运维一体化。
后端开发+
https://www.youtube.com/watch?v=tN6oJu2DqCM&list=PLWKjhJtqVAbn21gs5UnLhCQ82f923WCgM
Learn what technologies you should learn first to become a back end web developer.
GitLab+
https://docs.gitlab.com/tutorials/
Learn about GitLab fundamentals by following guided instructions.
GitHub+
[英文] GitHub Learn
https://learn.github.com/
Discover a wide range of beginner-friendly tutorials, hands-on learning, and expert-led lessons.
Argo+
https://argo-cd.readthedocs.io/en/stable/understand_the_basics/
Before effectively using Argo CD, it is necessary to understand the underlying technology that the platform is built on.
https://www.youtube.com/watch?v=MeU5_k9ssrs
The ArgoCD chapter includes building a pipeline of dynamically updating & building a new application version using GitLab downstream pipeline feature.
相关职位
社招5年以上A234232
1、负责云平台上计算型相关产品(如云主机、容器)的后台系统等核心系统研发工作; 2、负责设计并实现计算资源大池化体系,支持裸金属、虚拟机、容器等多种形态的计算资源的管理和调度,提升资源流转和使用效率; 3、负责持续改善服务质量、提高系统稳定性和可用性,增强线上产品质量,通过工具和系统上提升团队研发效率,并对重点及有难度的技术进行攻坚; 4、学习研究业界先进技术,保持技术进步,对所负责的模块范围进行技术规划并使其落地。
更新于 2024-08-12
社招4年以上IDG
-主导自动驾驶信息安全产品服务端后台的设计和研发工作,确保项目的顺利进行和高质量交付 -根据需求进行功能模块化拆解、设计文档编写、编码、单元测试,确保系统的稳定性与可扩展性 -负责后端模块及服务在可扩展、高可用、高并发、可运维等方向的技术优化 -撰写并更新系统相关文档,为团队提供完整、准确的技术支持
更新于 2024-04-16
社招5年以上D2068
1、作为公司技术中后台的项目经理,以项目BP的角色负责研发线内项目全生命周期的管理和交付,能管理和协调跨团队,跨部门的大型项目/项目集并支持业务负责人的OKR管理、战略规划、会议体系设计等工作; 2、作为数据战略方向的项目经理,可以与前台业务团队、各中后台团队进行良好的跨团队协同,保证公司技术战略的切实落地,对结果负责; 3、结合业务和团队特点,能深入洞察项目管理过程中影响结果交付、团队协作效率方面的问题,并提出专业的优化方案与机制,推进落地,促进体系完善和对团队进行赋能。
更新于 2025-02-17