快手对象存储研发工程师/专家
社招全职5年以上D7194地点:北京状态:招聘
任职要求
1、本科及以上学历,5 年以上工作经验(分布式存储、数据库、大数据等领域); 2、熟练掌握 Java/Go 至少一种,对工程质量有很高的自我要求; 3、对分布式存储的稳定性、一致性、高性能、成本优化等方向有深入理解; 4、对技术有强烈的进取心,能快速掌握最前沿的技术,喜欢挑战性的工作; 5、有较强的责任心和抗压能力,有良好的沟通能力和团队精神。 具有以下条件者优先: 1、有大规模分布式存储经验者优先,包括 AWS S3、Azure、HDFS、HBase、Spanner、Dynamodb、BigTable 等; 2、有大数据生态体系经验者优先,熟悉至少两种相关组件(如 Yarn、Spark、Flink、Kafka、Hudi)的原理、架构和应用; 3、有云服务一线厂商从业经历者优先。
工作职责
1、负责快手自研 EB 级对象存储的研发工作,为业务提供海量、安全、低成本、高可靠、高可用的对象存储服务; 2、负责对象存储核心模块与技术难题的重点攻关,包括海量元数据的管理、存储分层、功能与体验上的持续改进等; 3、吸收业界的新理论和成果,结合最新的软/硬件技术(如 GPU Direct Storage、Burst Buffer 等),灵活运用至业务实现降本增效。
包括英文材料
学历+
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
AWS+
https://aws.amazon.com/
Amazon Web Services offers reliable, scalable, and inexpensive cloud computing services. Free to join, pay only for what you use.
Azure+
https://azure.microsoft.com/
Invent with purpose, realize cost savings, and make your organization more efficient with Microsoft Azure’s open and flexible cloud computing platform.
HDFS+
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.
https://www.ibm.com/cn-zh/think/topics/hdfs
Hadoop 分布式文件系统 (HDFS) 是一种管理大型数据集的文件系统,可在商用硬件上运行。
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
Yarn+
[英文] Introduction
https://yarnpkg.com/getting-started
Yarn is an established open-source package manager used to manage dependencies in JavaScript projects.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
相关职位
社招5年以上D7194
1、负责快手自研 EB 级对象存储的研发工作,为业务提供海量、安全、低成本、高可靠、高可用的对象存储服务; 2、负责对象存储核心模块与技术难题的重点攻关,包括海量元数据的管理、存储分层、功能与体验上的持续改进等; 3、吸收业界的新理论和成果,结合最新的软/硬件技术(如 GPU Direct Storage、Burst Buffer 等),灵活运用至业务实现降本增效。
更新于 2025-02-12
社招5年以上D7194
1、负责快手自研 EB 级对象存储的研发工作,为业务提供海量、安全、低成本、高可靠、高可用的对象存储服务; 2、负责对象存储核心模块与技术难题的重点攻关,包括海量元数据的管理、存储分层、功能与体验上的持续改进等; 3、吸收业界的新理论和成果,结合最新的软/硬件技术(如 GPU Direct Storage、Burst Buffer 等),灵活运用至业务实现降本增效。
更新于 2025-02-12
社招3年以上基础后端
参与公司分布式存储产品研发工作,支撑社交、推荐、搜索、电商、广告等核心业务场景; 负责产品能力建设,针对业务发展需要推进系统演进,提供高可用、高可靠、高性价比的存储产品; 学习和吸纳业界优秀技术和理论成果,积极探索和拓展新的产品能力,持续提升产品的技术和服务水平;
更新于 2025-08-05