字节跳动数据库基础平台研发工程师
社招全职1年以上A146634地点:杭州状态:招聘
任职要求
1、本科及以上,计算机相关专业,熟悉Go/Java/Python语言中的一种,熟悉常用框架,1年以上工程实践经验,对代码质量有追求; 2、有大规模高并发、高可用应用的架构设计和开发运维经验,对可靠性、性能、可用性等有深刻理解; 3、掌握K8S、Etcd、Nginx、Prometheus等容器相关组件的运维管理,具备丰富的优化、排障经验,有源码级的了解更佳; 4、熟悉至少一种分布式系统(数据库、表格、缓存、消息队列、对象、块等) 。
工作职责
1、负责构建分布式控制平台,统一管理海量的线上主机以及上面的各类数据库服务; 2、负责构建分布式中间件,为数据库业务提供各种公共服务和统一的容器底座,赋能业务开发; 3、负责构建监控告警系统,实时监控各业务运行状态,提供秒级的响应报警以及安全审计保障; 4、负责运营平台、运维平台、故障诊断、Devops等各类系统开发。
包括英文材料
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
高可用+
https://redis.io/blog/high-availability-architecture/
A high available architecture is when there are a number of different components, modules, or services that work together to maintain optimal performance, irrespective of peak-time loads.
https://www.ibm.com/think/topics/high-availability
High availability (HA) is a term that refers to a system’s ability to be accessible and reliable close to 100% of the time.
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
Kubernetes+
https://kubernetes.io/docs/tutorials/kubernetes-basics/
This tutorial provides a walkthrough of the basics of the Kubernetes cluster orchestration system.
https://kubernetes.io/zh-cn/docs/tutorials/kubernetes-basics/
本教程介绍 Kubernetes 集群编排系统的基础知识。每个模块包含关于 Kubernetes 主要特性和概念的一些背景信息,还包括一个在线教程供你学习。
https://www.youtube.com/watch?v=s_o8dwzRlu4
Hands-On Kubernetes Tutorial | Learn Kubernetes in 1 Hour - Kubernetes Course for Beginners
https://www.youtube.com/watch?v=X48VuDVv0do
Full Kubernetes Tutorial | Kubernetes Course | Hands-on course with a lot of demos
Nginx+
[英文] Beginner’s Guide
https://nginx.org/en/docs/beginners_guide.html
This guide gives a basic introduction to nginx and describes some simple tasks that can be done with it.
https://www.youtube.com/watch?v=9t9Mp0BGnyI
NGINX is open-source web server software used for reverse proxy, load balancing, and caching. It's important to understand, especially if you are a backend developer.
Prometheus+
https://grafana.com/docs/grafana/latest/getting-started/get-started-grafana-prometheus/
Prometheus is an open source monitoring system for which Grafana provides out-of-the-box support.
https://prometheus.io/docs/tutorials/getting_started/
Prometheus is a system monitoring and alerting system.
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
缓存+
https://hackernoon.com/the-system-design-cheat-sheet-cache
The cache is a layer that stores a subset of data, typically the most frequently accessed or essential information, in a location quicker to access than its primary storage location.
https://www.youtube.com/watch?v=bP4BeUjNkXc
Caching strategies, Distributed Caching, Eviction Policies, Write-Through Cache and Least Recently Used (LRU) cache are all important terms when it comes to designing an efficient system with a caching layer.
https://www.youtube.com/watch?v=dGAgxozNWFE
消息队列+
https://www.youtube.com/watch?v=xErwDaOc-Gs
相关职位
社招1年以上A31800A
1、负责构建分布式控制平台,统一管理海量的线上主机以及上面的各类数据库服务; 2、负责构建分布式中间件,为数据库业务提供各种公共服务和统一的容器底座,赋能业务开发; 3、负责构建监控告警系统,实时监控各业务运行状态,提供秒级的响应报警以及安全审计保障; 4、负责运营平台、运维平台、故障诊断、Devops等各类系统开发。
更新于 2024-01-16
社招3年以上A73594A
1、负责为数据库公有云、混合云打造统一底座,为 ToB 云产品提供统一的基础设施支持,支撑云产品对外输出; 2、负责提供通用的基础服务,包括集群管理、服务发现、云配置中心、资源调度、全链路监控、故障诊断、容量管理、灰度发布等重要能力,目标是构建无人值守的管控系统; 3、研究并提供数据库云上最佳实践,协同拉齐各个云产品的规范并推动落地,提升云产品的稳定性,减少开发和试错成本; 4、研究数据库多云管理、Serverless 等新型产品形态的基础架构,赋能数据库云产品。
更新于 2023-12-12
社招3年以上A230249
1、负责对基础平台k8s部署升级,并进行性能优化,参与k8s定制和改造工作,支撑业务混部、超卖和调度优化等需求; 2、负责构建监控告警等可观测平台,实时监控各业务运行状态,提供秒级的响应报警以及安全审计保障; 3、负责运营平台、应用变更运维平台、故障诊断、Devops等各类系统开发。
更新于 2023-12-12