百度分布式存储研发工程师(J78383)
社招全职ACG地点:北京状态:招聘
任职要求
-熟练掌握至少一种下列编程语言:C/C++、Java -熟悉常用的数据结构、算法设计 -熟悉存储设备、文件系统、Linux操作系统原理 -对分布式存储系统有浓厚的兴趣,并且善于学习、乐于去挑战在云计算环境下超大规模云存储系统面临的各种挑战 -富有激情和创造力,学习能力强,良好的团队合作能力 -有开放云产品研发或使用经验优先,包括Amazon AWS、Azure、GCE、aliyun -熟悉分布式系统理论,有大规模分布式系统设计架构经验(包括Hadoop/HDFS/Openstack/Ceph/mongodb/dynamodb/aws-s3/GFS/BigTable等),其中 熟悉ceph系统优先考虑 -熟悉数据库技术,有数据库内核或者nosql数据库的开发经验优先 -熟悉操作系统内核,特别是存储设备、文件系统等部分优先
工作职责
-设计、开发和优化公有云存储系统类产品,包括但不限于对象存储、分布式块存储服务、云消息队列服务、云Cache服务、关系型数据库、冷数据存储服务、数据传输服务等等 -开发和优化大规模高性能服务软件 -为百度开放云行业客户提供分布式存储技术和产品解决方案 -联动大数据、云计算、边缘云、视频云等多团队打造整体高性能解决方案
包括英文材料
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
AWS+
https://aws.amazon.com/
Amazon Web Services offers reliable, scalable, and inexpensive cloud computing services. Free to join, pay only for what you use.
Azure+
https://azure.microsoft.com/
Invent with purpose, realize cost savings, and make your organization more efficient with Microsoft Azure’s open and flexible cloud computing platform.
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
HDFS+
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.
https://www.ibm.com/cn-zh/think/topics/hdfs
Hadoop 分布式文件系统 (HDFS) 是一种管理大型数据集的文件系统,可在商用硬件上运行。
Ceph+
https://docs.ceph.com/en/squid/start/beginners-guide/
The purpose of A Beginner’s Guide to Ceph is to make Ceph comprehensible.
https://www.youtube.com/watch?v=oEKJnHAfSiw
DynamoDB+
https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/GettingStartedDynamoDB.html
You’ll learn how to connect to, create, and manage DynamoDB tables in the following sections.
https://dynobase.dev/dynamodb-tutorials/
Collection of tutorials and articles to help you solve problems, make decisions and understand DynamoDB.
https://www.hellointerview.com/learn/system-design/deep-dives/dynamodb
DynamoDB is a fully-managed, highly scalable, key-value service provided by AWS.
https://www.scylladb.com/learn/dynamodb/introduction-to-dynamodb/
Amazon DynamoDB is a cloud-native NoSQL primarily key-value database.
https://www.youtube.com/watch?v=2k2GINpO308
In this video, I explain to you the core concepts of dynamodb and walk you through the console.
内核+
https://www.youtube.com/watch?v=C43VxGZ_ugU
I rummage around the Linux kernel source and try to understand what makes computers do what they do.
https://www.youtube.com/watch?v=HNIg3TXfdX8&list=PLrGN1Qi7t67V-9uXzj4VSQCffntfvn42v
Learn how to develop your very own kernel from scratch in this programming series!
https://www.youtube.com/watch?v=JDfo2Lc7iLU
Denshi goes over a simple explanation of what computer kernels are and how they work, alonside what makes the Linux kernel any special.
NoSQL+
https://nosql-database.org/
Everything about NoSQL Systems – Types, Benefits, and Real-World Uses
https://piaosanlang.gitbooks.io/mongodb/content/section1.1.html
NoSQL(NoSQL = Not Only SQL ),即"不仅仅是SQL",指的是非关系型的数据库。是对不同于传统的关系型数据库管理系统的统称。
https://www.youtube.com/watch?v=0buKQHokLK8
NoSQL databases can operate in multiple modes: as key-value store, document store or wide column store.
相关职位
社招核心本地商业-基
为数仓和机器学习平台提供高可用、高可靠、超大规模的HDFS文件存储和Hive MetaStore元数据存储服务,解决海量文件带来的元数据瓶颈和成本问题,同时提供行业一流的可靠性可用性指标,实现数据跨机房容灾,实现数据分层流转,尽可能降低业务的容量管理成本
更新于 2025-06-22
社招ACG
-设计、开发和优化公有云存储系统类产品,包括但不限于对象存储、分布式块存储服务、云消息队列服务、云Cache服务、关系型数据库、冷数据存储服务、数据传输服务等等 -开发和优化大规模高性能服务软件 -为百度开放云行业客户提供分布式存储技术和产品解决方案 -联动大数据、云计算、边缘云、视频云等多团队打造整体高性能解决方案
更新于 2025-06-10