快手资深游戏运维工程师-【游戏事业部】
社招全职5年以上D10501地点:杭州状态:招聘
任职要求
1、5年以上工作经验,熟悉Linux操作系统运维,具有快速定位问题和问题解决能力; 2、计算机或相关专业本科或以上学历; 3、精通掌握Shell,熟悉Python或者Go语言者优先; 4、熟练掌握Linux操作系统的管理,具备RHCE者优先; 5、具有主流公有云维护经验; 6、熟练掌握Zabbix,Prometheus,Grafana等监控工具; 7、熟悉容器技术,有良好的Kubernetes技术能力; 8、熟练掌握CICD中需要用到的开源工具如Jekins,Jira,Ansible,SaltStack,Supervisor等工具的运用; 9、练掌握Nodejs,Nginx,Tomcat等; 10、熟练配置主流中间件,包括但不限于Kafka,Es,Etcd,Zookeeper等; 11、熟悉数据库架构及SQL; 12、有大型互联网公司自动化运维经验,了解存储、负载均衡设备配置者优先; 13、有一定的网络知识,熟悉TCP/IP等网络协议; 14、有良好的逻辑分析能力和学习能力。
工作职责
1、负责手游的技术运维工作,保障服务处于高可用的水平,包括日常的部署、升级、扩容、缩容、告警处理、故障解决、容灾演练等; 2、负责运维相关提升开发运营效率的支撑工具的开发; 3、负责跟进移动游戏业务从接入到上线全过程的技术运营解决方案; 4、解决业务上线过程及长期运营中,遇到的技术问题,发现运营风险及技术难点,优化业务流程,优化游戏架构,提升运维工作效率和服务质量; 5、与项目组建立良好的合作沟通与信任关系,对业务存在的风险问题进行定期反馈和改善跟进,保障业务稳定运行。
包括英文材料
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
学历+
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Prometheus+
https://grafana.com/docs/grafana/latest/getting-started/get-started-grafana-prometheus/
Prometheus is an open source monitoring system for which Grafana provides out-of-the-box support.
https://prometheus.io/docs/tutorials/getting_started/
Prometheus is a system monitoring and alerting system.
Grafana+
Kubernetes+
https://kubernetes.io/docs/tutorials/kubernetes-basics/
This tutorial provides a walkthrough of the basics of the Kubernetes cluster orchestration system.
https://kubernetes.io/zh-cn/docs/tutorials/kubernetes-basics/
本教程介绍 Kubernetes 集群编排系统的基础知识。每个模块包含关于 Kubernetes 主要特性和概念的一些背景信息,还包括一个在线教程供你学习。
https://www.youtube.com/watch?v=s_o8dwzRlu4
Hands-On Kubernetes Tutorial | Learn Kubernetes in 1 Hour - Kubernetes Course for Beginners
https://www.youtube.com/watch?v=X48VuDVv0do
Full Kubernetes Tutorial | Kubernetes Course | Hands-on course with a lot of demos
Ansible+
https://docs.ansible.com/ansible/latest/getting_started/index.html
Ansible automates the management of remote systems and controls their desired state.
SaltStack+
https://www.digitalocean.com/community/tutorials/an-introduction-to-saltstack-terminology-and-concepts
Salt, or SaltStack, is a remote execution tool and configuration management system.
[英文] SaltStack Tutorial
https://www.tutorialspoint.com/saltstack/index.htm
SaltStack is an open-source configuration management and remote execution engine.
Node.js+
https://liaoxuefeng.com/books/javascript/nodejs/index.html
从本章开始,我们就正式开启JavaScript的后端开发之旅。
https://www.youtube.com/watch?v=32M1al-Y6Ag
This is an intro to Node.js. No frameworks or libraries.
https://www.youtube.com/watch?v=zb3Qk8SG5Ms&list=PL4cUxeGkcC9jsz4LDYc6kv3ymONOKxwBU
In this Node JS tutorial I'll introduce to what exactly Node is all about, why we'd use it and the technologies you'll need to be familiar with to get started.
Nginx+
[英文] Beginner’s Guide
https://nginx.org/en/docs/beginners_guide.html
This guide gives a basic introduction to nginx and describes some simple tasks that can be done with it.
https://www.youtube.com/watch?v=9t9Mp0BGnyI
NGINX is open-source web server software used for reverse proxy, load balancing, and caching. It's important to understand, especially if you are a backend developer.
Tomcat+
https://pdai.tech/md/framework/tomcat/tomcat-x-design-web-container.html
我们通过学习如何设计一个最基本的web容器来看它需要考虑什么;进而在真正学习Tomcat时,多把重点放在它的顶层设计上,而不是某一块代码上, 思路永远比具体实现重要的多。
https://www.youtube.com/watch?v=rElJIPRw5iM
This content is about Tomcat, a popular web server used by Java programmers and system administrators.
中间件+
https://www.youtube.com/watch?v=1oWPUpMheGk
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
TCP/IP+
[英文] What is TCP/IP?
https://www.techtarget.com/searchnetworking/definition/TCP-IP
TCP/IP stands for Transmission Control Protocol/Internet Protocol and is a suite of communication protocols used to interconnect network devices on the internet.
相关职位
社招5年以上D6918
1、负责服务器底层引擎开发维护,完成各种游戏类型的服务器底层功能开发; 2、负责服务器架构设计调优,查改一些瓶颈的地方; 3、根据不同类型游戏需求,设计完善服务器框架; 4、指导gameplay同学更好更高效的完成逻辑开发,并根据反馈优化框架; 5、设计搭建运维、经分等系统以及指导相关模块的对接。
更新于 2025-03-18

社招5年以上技术支持类
1、负责游戏/应用系统部署搭建、版本更新、系统监控、日常维护和故障应急响应; 2、根据业务重要等级,推动关键业务规避单点问题,并具有高可用、多机房容灾的能力; 3、推动及开发自动化运维、管理平台建设,提升运维工作效率; 4、参与系统架构的可运维性设计,管理完善监控报警平台,提升稳定性保障; 5、负责收集工作中的痛点和不足,总结经验,优化流程,持续提升运维的质量和效率。
更新于 2025-10-16
社招4年以上A126562
1、参与游戏业务的离线和实时数据仓库建设和运维; 2、参与数据ETL流程的优化并解决ETL相关技术问题; 3、参与复杂数据链路依赖和多元数据内容生态下的数据治理工作; 4、依托公司成熟的大数据方案,在业务快速落地数据解决方案。
更新于 2024-07-01