腾讯腾讯云-服务器性能调优专家
社招全职5年以上腾讯云技术地点:深圳状态:招聘
任职要求
1.本科及以上学历,5年以上服务器、数据中心或分布式系统性能优化经验,有大规模集群调优经验者优先; 2.深入理解服务器硬件架构(x86/ARM)、操作系统原理及内核机制(进程调度、内存管理、I/O栈)。熟悉云计算平台的服务器性能优化,或有超算中心调优经验; 3.熟练使用性能分析工具链(如FlameGraph、eBPF、perf、Prometheus)及日志分析系统(ELK Stack); 4.具备脚本开发能力(Python/Shell),熟悉至少一种系统级编程语言(C/C++/Rust); 5.极强的逻辑分析与问题拆解能力,能够从海量数据中定位性能瓶颈。优秀的沟通能力,能清晰传递技术方案并推动跨团队协作。
工作职责
1.负责服务器整体性能(CPU、内存、存储、网络、I/O等)的深度分析与瓶颈定位,提出并实施优化方案; 2.针对高并发、低延迟、高吞吐量场景(如云计算、AI训练、大数据处理等),优化服务器硬件与软件的协同性能; 3.开发自动化性能监控与诊断工具,构建性能分析模型,实现问题预测与快速定位; 4.与硬件团队、软件架构师、内核开发人员及业务部门合作,推动性能优化方案落地(如NUMA调优、CPU调度策略、内存分级管理等); 5.支持客户或业务团队解决实际生产环境中的性能瓶颈问题,提供技术指导与优化报告; 6.跟踪服务器领域技术趋势(如DPU/IPU加速、CXL内存扩展、新型存储协议),探索性能提升的创新方向。
包括英文材料
学历+
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
内核+
https://www.youtube.com/watch?v=C43VxGZ_ugU
I rummage around the Linux kernel source and try to understand what makes computers do what they do.
https://www.youtube.com/watch?v=HNIg3TXfdX8&list=PLrGN1Qi7t67V-9uXzj4VSQCffntfvn42v
Learn how to develop your very own kernel from scratch in this programming series!
https://www.youtube.com/watch?v=JDfo2Lc7iLU
Denshi goes over a simple explanation of what computer kernels are and how they work, alonside what makes the Linux kernel any special.
eBPF+
https://ebpf.io/get-started/
eBPF is a revolutionary technology that can run sandboxed programs in the Linux kernel without changing kernel source code or loading a kernel module.
Perf+
https://perfwiki.github.io/main/
perf is powerful: it can instrument CPU performance counters, tracepoints, kprobes, and uprobes (dynamic tracing).
https://www.brendangregg.com/bpf-performance-tools-book.html
This book can help you get the most out of your systems and applications, helping you improve performance, reduce costs, and solve software issues.
[英文] perf Examples
https://www.brendangregg.com/perf.html
These are some examples of using the perf Linux profiler, which has also been called Performance Counters for Linux (PCL), Linux perf events (LPE), or perf_events.
https://www.youtube.com/watch?v=M6ldFtwWup0
Prometheus+
https://grafana.com/docs/grafana/latest/getting-started/get-started-grafana-prometheus/
Prometheus is an open source monitoring system for which Grafana provides out-of-the-box support.
https://prometheus.io/docs/tutorials/getting_started/
Prometheus is a system monitoring and alerting system.
脚本+
[英文] Scripting language
https://en.wikipedia.org/wiki/Scripting_language
https://zhuanlan.zhihu.com/p/571097954
一个脚本通常是解释执行而非编译。脚本语言通常都有简单、易学、易用的特性,目的就是希望能让程序员快速完成程序的编写工作。
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Rust+
https://www.youtube.com/watch?v=BpPEoZW5IiY
In this comprehensive Rust course for beginners, you will learn about the core concepts of the language and underlying mechanisms in theory.
https://www.youtube.com/watch?v=lzKeecy4OmQ
Full Rust 101 Crash Course for beginners.
https://www.youtube.com/watch?v=rQ_J9WH6CGk
相关职位
社招5年以上云智能集团
1、设计并实现高效的AIGC工程/图像/视频处理软硬件一体化方案,参与媒体计算产品全生命周期开发。 2、负责系统性能调优,识别并解决关键瓶颈,提升稳定性与效率。 3、开发和维护底层驱动、基础软件及图像/视频SDK,确保硬件(ASIC/FPGA/GPU)与应用高效协同。
更新于 2025-09-08
社招7年以上云智能集团
1.负责服务器GPU超节点软件系统方案,主导互连软件的架构设计、研发交付、应用优化(训练及推理场景下SHMEM技术,KV Cache,共享内存,互连传输软件)等, 参与模块实现,问题攻关; 2.参与下一代数据中心服务器超节点定义、如数据面软硬件协同方案; 3. 参与行业领先的互连标准定义,以及行业生态的推动及落地; 4. 参与创新研究,发表相关技术论文,申请专利。
更新于 2025-08-01
社招5年以上云智能集团
1. 负责智能网卡的网卡驱动和RDMA驱动开发和实现; 2. 负责智能网卡在AI智算,存储等领域软硬件结合优化,创新研究; 3. 通过智能网卡的软硬件创新与优化,包括高性能网络协议的硬件卸载优化,帮助云产品基础设施持续提升技术竞争力。
更新于 2025-08-07