腾讯微服务稳定性研发工程师
社招全职5年以上CSIG技术地点:杭州状态:招聘
任职要求
1.熟练掌握 java/go/c++ 至少一门开发语言,熟悉高并发,操作系统,分布式通信,数据库等相关技术; 2.本科以上学历,5 年以上相关工作经验; 3.熟悉高可用性设计、容错机制、限流、熔断、重试等微服务的核心稳定性技术; 4.良好的团队协作和沟通能力,能够在高压环境下快速定位问题并提供解决方案; 5.熟悉开源的中间件软件;至少熟悉 1~2 个相关开源产品(包括但不限于 SpringCloud、SpringBoot、gRPC、Dubbo、nacos、Istio、apisix、Kong、K8s、envoy),有相关开发扩展及优化经验者优先; 6.熟悉中间件,RPC框架,API网关和 K8s,性能优化和稳定性等相关技术经验者优先; 7.具有高并发相关工作经验优先,熟悉SpringCloud,有 SpringCloud 相关开发及优化经验者优先。
工作职责
1.负责腾讯云微服务产品(包括不限于微服务平台、注册配置中心、微服务框架、服务网格、云原生网关、Serverless等)的设计,核心代码开发,系统稳定性开发和性能优化等工作; 2.深入理解不同行业的系统架构和微服务应用场景,输出可复制、可扩展的产品技术方案,帮助腾讯云微服务产品在更多企业客户中产生价值; 3.稳定性架构设计与优化,构建和优化资源管理、服务发现、负载均衡、流量管理、容错机制等关键组件,确保系统的可扩展性和健壮性; 4.持续监控和改进微服务系统的可观测性,分析并解决微服务运行中的性能瓶颈和不稳定因素,并利用数据驱动方法解决系统故障、性能问题; 5.具备大规模微服务架构实践经验,尤其是在高并发、大流量系统下的微服务稳定性建设。
包括英文材料
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
学历+
高可用+
https://redis.io/blog/high-availability-architecture/
A high available architecture is when there are a number of different components, modules, or services that work together to maintain optimal performance, irrespective of peak-time loads.
https://www.ibm.com/think/topics/high-availability
High availability (HA) is a term that refers to a system’s ability to be accessible and reliable close to 100% of the time.
微服务+
https://learn.microsoft.com/en-us/training/modules/dotnet-microservices/
Microservice applications are composed of small, independently versioned, and scalable customer-focused services that communicate with each other by using standard protocols and well-defined interfaces.
https://microservices.io/
Microservices - also known as the microservice architecture - is an architectural style that structures an application as a collection of two or more services.
https://spring.io/microservices
Building small, self-contained, ready to run applications can bring great flexibility and added resilience to your code.
https://www.ibm.com/think/topics/microservices
Microservices, or microservices architecture, is a cloud-native architectural approach in which a single application is composed of many loosely coupled and independently deployable smaller components or services.
https://www.youtube.com/watch?v=CqCDOosvZIk
https://www.youtube.com/watch?v=hmkF77F9TLw
Learn about software system design and microservices.
中间件+
https://www.youtube.com/watch?v=1oWPUpMheGk
Spring Cloud+
[英文] Spring Cloud Series
https://www.baeldung.com/spring-cloud-series
Learn Spring Cloud including concepts, additional libraries and examples for distributed systems.
Spring Boot+
https://spring.io/guides/gs/spring-boot
his guide provides a sampling of how Spring Boot helps you accelerate application development.
https://www.youtube.com/watch?v=Nv2DERaMx-4&list=PLzUMQwCOrQTksiYqoumAQxuhPNa3HqasL
The author teaches you how to use Spring Boot from a complete beginner, to building a REST API with a real database, Dockerising it and deploying it to the cloud.
gRPC+
[英文] Introduction to gRPC
https://grpc.io/docs/what-is-grpc/introduction/
An introduction to gRPC and protocol buffers.
Dubbo+
https://dubbo.apache.org/en/overview/mannual/java-sdk/tasks/
Nacos+
https://nacos.io/docs/latest/overview/
Nacos 是 Dynamic Naming and Configuration Service 的首字母简称,一个易于构建 AI Agent 应用的动态服务发现、配置管理和AI智能体管理平台。
Istio+
https://istio.io/latest/docs/examples/microservices-istio/
This modular tutorial provides new users with hands-on experience using Istio for common microservices scenarios, one step at a time.
https://www.freecodecamp.org/news/learn-istio-manage-microservices/
In a world without Istio, one service makes direct requests to another and in case of failures, the service is responsible for handling those.
Kubernetes+
https://kubernetes.io/docs/tutorials/kubernetes-basics/
This tutorial provides a walkthrough of the basics of the Kubernetes cluster orchestration system.
https://kubernetes.io/zh-cn/docs/tutorials/kubernetes-basics/
本教程介绍 Kubernetes 集群编排系统的基础知识。每个模块包含关于 Kubernetes 主要特性和概念的一些背景信息,还包括一个在线教程供你学习。
https://www.youtube.com/watch?v=s_o8dwzRlu4
Hands-On Kubernetes Tutorial | Learn Kubernetes in 1 Hour - Kubernetes Course for Beginners
https://www.youtube.com/watch?v=X48VuDVv0do
Full Kubernetes Tutorial | Kubernetes Course | Hands-on course with a lot of demos
Envoy+
https://www.envoyproxy.io/docs/envoy/latest/start/start
This section gets you started with a very simple configuration and provides some example configurations.
https://www.youtube.com/watch?v=E-UpGmj6B9M
Envoy is a key part of a number of service meshes currently on the market, including Istio and the Kuma CNCF Sandbox project.
RPC+
https://javaguide.cn/distributed-system/rpc/rpc-intro.html
为什么要 RPC ? 因为,两个不同的服务器上的服务提供的方法不在一个内存空间,所以,需要通过网络编程才能传递方法调用所需要的参数。并且,方法调用的结果也需要通过网络编程来接收。
https://www.youtube.com/watch?v=S2osKiqQG9s
This video is part of an 8-lecture series on distributed systems, given as part of the undergraduate computer science course at the University of Cambridge.
相关职位
实习J1014
1、建设支撑快手百万微服务业务系统的智能运维平台及产品,涵盖基于优秀DevOps理念的自动化持续发布与部署平台,基于PB级海量数据的链路追踪、智能定位技术的监控平台,基于先进大数据分析、人工智能技术的稳定性保障平台; 2、负责超大规模平台和系统建设,包括技术选型、架构设计、研发工作,通过敏捷开发支持产品需求快速迭代,不断优化系统架构,支撑业务规模增长,保障服务稳定; 3、跟进业界先进理论和实践,结合公司实际业务需求,打造业界领先的智能运维体系。
更新于 2025-03-04
校招工程-后端类
1、负责公司统一服务发现系统, RPC 框架的开发与维护 2、深入业务,与业务共建云原生的为微服务治理架构,向业务提供一系列解决服务稳定性问题的治理方案,涉及流量调度、限流、熔断等,积极探索双活治理、全链路超时重试治理、强弱依赖治理、业务架构治理等新方向 3、与业务方共建多活架构,全链路超时治理、重试治理、强弱依赖治理等核心治理能力。
更新于 2025-08-20
校招J1014
1、建设支撑快手百万微服务业务系统的智能运维平台及产品,涵盖基于优秀DevOps理念的自动化持续发布与部署平台,基于PB级海量数据的链路追踪、智能定位技术的监控平台,基于先进大数据分析、人工智能技术的稳定性保障平台; 2、负责超大规模平台和系统建设,包括技术选型、架构设计、研发工作,通过敏捷开发支持产品需求快速迭代,不断优化系统架构,支撑业务规模增长,保障服务稳定; 3、跟进业界先进理论和实践,结合公司实际业务需求,打造业界领先的智能运维体系。
更新于 2025-07-30