TCL运维平台开发工程师
社招全职5年以上研发技术类地点:深圳状态:招聘
任职要求
1.编程能力:精通 Golang 或 Python,具备高性能系统开发与优化经验。 CMDB 开发经验:熟悉 CMDB 系统的设计与实现,包括资源建模、数据采集、资产管理等功能。 2.多云管理经验:熟悉 AWS、Azure、阿里云等云服务的 API 集成与自动化管理,具备多云资源统一管理平台的开发经验。 3.容器技术:熟悉 Docker、Kubernetes 等容器技术,能够基于微服务架构设计高可用系统。 4.自动化运维:熟悉 Ansible、Terraform 等自动化工具,能够实现基础设施即代码(IaC)。 5.数据库能力:熟悉 MySQL、PostgreSQL、MongoDB 等数据库的设计与优化,具备大规模数据存储与查询经验。 6.系统运维:熟悉 Linux 系统,掌握常见网络协议(如 HTTP、TCP/IP),具备良好的系统调优能力。 7.教育背景:计算机相关专业本科及以上学历。 8.工作经验:5 年以上运维平台开发相关经验,有大型企业运维平台开发经验者优先。 9.语言能力:具备良好的英语阅读能力,能够熟练阅读技术文档。 加分项 有开源项目贡献经验或个人技术博客。 熟悉 Prometheus、ELK 等监控与日志分析工具。 具备 DevOps 实践经验,熟悉 CI/CD 流程。 有大规模分布式系统开发经验。
工作职责
1.运维平台开发:负责公司内部运维平台的设计与开发,包括 CMDB 系统和多云管理平台。 2.CMDB 系统:设计资源建模方案,开发数据采集、资产管理、生命周期管理等核心功能。 3.多云管理:集成 AWS、Azure、阿里云等云服务 API,开发跨云资源管理与自动化部署功能。 4.系统优化:优化平台性能,提升系统的稳定性与可扩展性,确保高并发场景下的可靠运行。 5.自动化运维:开发自动化工具与脚本,实现基础设施即代码(IaC),提升运维效率。 6.技术支持:参与平台的日常维护与技术支持,快速定位并解决系统问题。 7.团队协作:与产品经理、运维团队紧密合作,推动项目的高效落地。
包括英文材料
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
AWS+
https://aws.amazon.com/
Amazon Web Services offers reliable, scalable, and inexpensive cloud computing services. Free to join, pay only for what you use.
Azure+
https://azure.microsoft.com/
Invent with purpose, realize cost savings, and make your organization more efficient with Microsoft Azure’s open and flexible cloud computing platform.
Docker+
https://www.youtube.com/watch?v=GFgJkfScVNU
Master Docker in one course; learn about images and containers on Docker Hub, running multiple containers with Docker Compose, automating workflows with Docker Compose Watch, and much more. 🐳
https://www.youtube.com/watch?v=kTp5xUtcalw
Learn how to use Docker and Kubernetes in this complete hand-on course for beginners.
Kubernetes+
https://kubernetes.io/docs/tutorials/kubernetes-basics/
This tutorial provides a walkthrough of the basics of the Kubernetes cluster orchestration system.
https://kubernetes.io/zh-cn/docs/tutorials/kubernetes-basics/
本教程介绍 Kubernetes 集群编排系统的基础知识。每个模块包含关于 Kubernetes 主要特性和概念的一些背景信息,还包括一个在线教程供你学习。
https://www.youtube.com/watch?v=s_o8dwzRlu4
Hands-On Kubernetes Tutorial | Learn Kubernetes in 1 Hour - Kubernetes Course for Beginners
https://www.youtube.com/watch?v=X48VuDVv0do
Full Kubernetes Tutorial | Kubernetes Course | Hands-on course with a lot of demos
微服务+
https://learn.microsoft.com/en-us/training/modules/dotnet-microservices/
Microservice applications are composed of small, independently versioned, and scalable customer-focused services that communicate with each other by using standard protocols and well-defined interfaces.
https://microservices.io/
Microservices - also known as the microservice architecture - is an architectural style that structures an application as a collection of two or more services.
https://spring.io/microservices
Building small, self-contained, ready to run applications can bring great flexibility and added resilience to your code.
https://www.ibm.com/think/topics/microservices
Microservices, or microservices architecture, is a cloud-native architectural approach in which a single application is composed of many loosely coupled and independently deployable smaller components or services.
https://www.youtube.com/watch?v=CqCDOosvZIk
https://www.youtube.com/watch?v=hmkF77F9TLw
Learn about software system design and microservices.
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
高可用+
https://redis.io/blog/high-availability-architecture/
A high available architecture is when there are a number of different components, modules, or services that work together to maintain optimal performance, irrespective of peak-time loads.
https://www.ibm.com/think/topics/high-availability
High availability (HA) is a term that refers to a system’s ability to be accessible and reliable close to 100% of the time.
Ansible+
https://docs.ansible.com/ansible/latest/getting_started/index.html
Ansible automates the management of remote systems and controls their desired state.
Terraform+
https://developer.hashicorp.com/terraform/tutorials
Build, change, and destroy infrastructure with Terraform. Start here to learn the basics of Terraform with your favorite cloud provider.
https://www.youtube.com/watch?v=_45W3Z8XWL4
In this video you will learn the basics of using Terraform.
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
PostgreSQL+
[英文] PostgreSQL Tutorial
https://neon.com/postgresql/tutorial
This PostgreSQL tutorial helps you quickly understand PostgreSQL.
[英文] PostgreSQL Tutorial
https://www.pgtutorial.com/
This PostgreSQL tutorial will teach you about PostgreSQL from beginner to advanced.
https://www.youtube.com/watch?v=qw--VYLpxG4
It is the most advanced open source database system widely used to build back-end systems.
https://www.youtube.com/watch?v=SpfIwlAYaKk
Learn PostgreSQL, one of the world's most advanced and robust open-source relational database systems.
MongoDB+
https://learnxinyminutes.com/mongodb/
MongoDB is a NoSQL document database for high volume data storage.
https://studio3t.com/academy/#courses
The fastest way to learn MongoDB
https://www.youtube.com/watch?v=c2M-rlkkT5o
This video will give you and introduction to MongoDB in 1 Hour. Afterwards I recommend exploring aggregation, replication, and sharding.
https://www.youtube.com/watch?v=ExcRbA7fy_A&list=PL4cUxeGkcC9h77dJ-QJlwGlZlTd4ecZOA
You'll learn how to use MongoDB (a NoSQL database) from scratch. You'll also learn how to integrate it into a simple Node.js API.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
HTTP+
https://developer.mozilla.org/zh-CN/docs/Web/HTTP
超文本传输协议(HTTP)是一个用于传输超媒体文档(例如 HTML)的应用层协议。它是为 Web 浏览器与 Web 服务器之间的通信而设计的,但也可以用于其他目的。
TCP/IP+
[英文] What is TCP/IP?
https://www.techtarget.com/searchnetworking/definition/TCP-IP
TCP/IP stands for Transmission Control Protocol/Internet Protocol and is a suite of communication protocols used to interconnect network devices on the internet.
学历+
Prometheus+
https://grafana.com/docs/grafana/latest/getting-started/get-started-grafana-prometheus/
Prometheus is an open source monitoring system for which Grafana provides out-of-the-box support.
https://prometheus.io/docs/tutorials/getting_started/
Prometheus is a system monitoring and alerting system.
ELK+
https://logz.io/learn/complete-guide-elk-stack/
With millions of downloads for its various components since first being introduced, the ELK Stack is the world’s most popular log management platform.
https://www.baeldung.com/ops/elk
In this tutorial, we’ll learn about the basics of the ELK stack.
https://www.youtube.com/watch?v=jk4RoEYCZTo
explains how to install and configure ELK (Elastic Search, Logstash, Kibana) Stack, a log management solution for analyzing and visualizing your data.
DevOps+
https://roadmap.sh/devops
Step by step guide for DevOps, SRE or any other Operations Role in 2025
https://zhuanlan.zhihu.com/p/562036793
DevOps中的Dev指的是Development(开发),Ops指的是Operations(运维),用一句话来说,DevOps就是打通开发运维的壁垒,实现开发运维一体化。
CI+
https://www.ibm.com/cn-zh/think/topics/continuous-integration
持续集成 (CI) 是一种软件开发实践,开发人员在整个开发周期中会定期将新的代码和代码变更集成到中央代码存储库中。它是 DevOps 和敏捷方法的关键组成部分。
https://www.youtube.com/watch?v=42UP1fxi2SY
CD+
https://www.redhat.com/zh-cn/topics/devops/what-is-ci-cd
CI/CD 是持续集成和持续交付/部署的缩写,旨在简化并加快软件开发生命周期。
https://www.youtube.com/watch?v=R8_veQiYBjI&list=PLy7NrYWoggjzSIlwxeBbcgfAdYoxCIrM2
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
相关职位
社招软件开发岗
1.负责超大规模生产k8s集群运维平台开发,不限于(部署、升级、配置、节点、网络、权限,ebpf可观测)等运维功能平台化开发 2. 按照项目计划,按时提交高质量代码,完成开发任务; 3. 规范文档的编写、维护,以及其他与项目相关工作;
更新于 2025-06-16
社招5年以上研发类
一、 1.负责运维管理平台的设计和开发工作,包括云管平台、CMDB、工单系统、作业平台、监控平台、容器平台等; 2.参与运维平台的需求分析、架构选型和技术方案评审; 3.负责项目中技术难题攻关,主导并推动线上系统技术故障分析解决; 4.负责开发过程的核心模块代码编写,审核和检查,提升开发效率和代码质量; 5.识别运维痛点,并不断探索新技术方向,完善相关运维工具设计及开发,持续改进运维工作效率、质量和成本。 二、
更新于 2025-06-27

社招3年以上
1、负责服务器工单交付平台、配件管理平台、故障工单平台等服务器运维管理的前后端开发; 2、负责现有运维平台的优化迭代、上线部署工作; 3、负责服务器运维操作的api封装和开发工作。
更新于 2023-05-12
社招2年以上软件开发岗
1、负责开发IT统一运维平台、监控报警体系和Devops自动化平台的构建、设计、开发、部署、升级与维护,包括不限于监控告警系统、日志系统、容量管理、CMDB资源管理、配置中心、调度系统、流程系统、IM服务平台等系统开发 2、运维自动化工具开发:基于SRE运维工作,理解需求背景和业务发展,开发自动化工具和平台提升效率; 3、SRE高可用保障:参与故障应急、稳定性优化等工作,并设计系统助力运维能力提升; 4、负责IT成本管理,稳定性建设,日志分析、挖掘问题隐患、配合制作相关预案,项目跟进 5、负责日常应用运维oncall,SRE,包括配置、优化、备份、故障处理等工作
更新于 2025-08-17