快手高级数据研发工程师(主站)-【数据平台】
社招全职5年以上D11761地点:北京状态:招聘
任职要求
1、计算机相关专业,本科及以上学历,5年以上互联网数据开发经验,对数据建设与业务服务有足够的热情,有良好的业务合作意识; 2、熟悉Java、Scala、Python等任意一门高级编程语言,具有扎实的计算机科学功底,扎实的编程基础和数据结构算法基础,极强的编程能力和问题解决能力; 3、精通数据仓库架构及原理,具备大型数据仓库架构设计、模型设计和处理性能调优等相关经验; 4、熟练掌握离线、实时(非必需)大数据开发技术,对AB的数据架构有一定的了解,熟悉Hive、Spark、Flink 、ClickHouse、Doris、Kafka、Flink 等相关技术原理并有相关开发经验; 5、逻辑思维清晰严谨,善于交流,有良好的团队合作精神和协调推进能力,有与产品、客户端等多方密切配合的经验和意识; 6、有用户触达、搜索、推荐或者流量数据领域工作经验者优先,有一定后端工程能力者优先。
工作职责
1、理解业务场景和需求,设计开发对应的离线与实时数据仓库模型,给出高效数据解决方案,持续保障数据质量和规范性。
包括英文材料
学历+
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Scala+
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
系统设计+
https://roadmap.sh/system-design
Everything you need to know about designing large scale systems.
https://www.youtube.com/watch?v=F2FmTdLtb_4
This complete system design tutorial covers scalability, reliability, data handling, and high-level architecture with clear explanations, real-world examples, and practical strategies.
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
Doris+
https://doris.apache.org/docs/gettingStarted/what-is-apache-doris
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
相关职位
社招3-5年D8193
1、参与快手主站核心业务功能的开发、架构设计和改造优化,深入发掘和分析业务需求,撰写高质量技术文档; 2、接受高并发、海量数据的挑战,分析和发现系统的优化点,负责推动系统的架构改造和可用性的提升; 3、对现有业务系统的架构进行抽象和升级,形成新的通用平台,提升研发效率; 4、具有一定的技术前瞻性,对技术有热情,善于研究分析业内主流产品技术实现。
更新于 2025-08-19
社招3-5年D4825
1、参与快手主站核心业务功能的开发、架构设计和改造优化,深入发掘和分析业务需求,撰写高质量技术文档; 2、接受高并发、海量数据的挑战,分析和发现系统的优化点,负责推动系统的架构改造和可用性的提升; 3、对现有业务系统的架构进行抽象和升级,形成新的通用平台,提升研发效率; 4、具有一定的技术前瞻性,对技术有热情,善于研究分析业内主流产品技术实现。
更新于 2025-09-10
社招2年以上D12087
我们是一群对技术有着极致追求的后端工程师,负责建设公司级的业务平台、业务中台,致力于为所有业务的开发提效。作为公司春节等大型活动的主力研发团队,加入我们即有机会参与百万级QPS的高并发项目挑战; 1、参与快手核心产品需求的开发,深入发掘和分析业务需求,撰写技术方案和系统设计,以及相关的代码开发; 2、接受高并发、海量数据的挑战,分析和发现系统的优化点,负责推动系统的性能和可用性的提升; 3、接受中台/平台系统设计与实现复杂度的挑战,分析和发现系统的优化点,负责推动系统的合理性、可靠性、可用性的提升; 4、为团队引入创新的技术、创新的解决方案,用创新的思路解决问题。
更新于 2025-07-22