快手高级Java开发工程师(开发治理平台)-【数据平台】
社招全职3年以上D2815地点:北京状态:招聘
任职要求
1、本科及以上学历,计算机相关专业,三年以上工作经验; 2、熟悉Hive、Spark、Flink、Clickhouse、Druid等开源大数据计算和分析引擎; 3、熟悉主流的Java开源框架,对Netty、 Spring等框架有深入的了解和使用; 4、精通多线程编程,熟悉常见的缓存、消息队列等中间件,熟悉MySQL; 5、热爱技术,对代码质量和开发规范有近乎苛刻的要求; 6、有足够的耐心梳理和解决复杂而又繁多的产品研发问题,善于沟通与团队协作。
工作职责
1、主导(参与)规划和设计大数据离线和实时开发治理产品的后端技术体系以及软件架构; 2、充分利用微服务、容器化等技术构建高可用、高扩展和低耦合高内聚的数据中台服务; 3、了解业界相关技术体系,为快手数据产品研发引入创造性的技术方案,解决面临的各种复杂问题和挑战。
包括英文材料
学历+
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Spring+
https://liaoxuefeng.com/books/java/spring/index.html
Spring是一个支持快速开发Java EE应用程序的框架。它提供了一系列底层容器和基础设施,并可以和大量常用的开源框架无缝集成,可以说是开发Java EE应用程序的必备。
https://spring.io/guides/gs/rest-service
https://spring.io/quickstart
Level up your Java code and explore what Spring can do for you.
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
缓存+
https://hackernoon.com/the-system-design-cheat-sheet-cache
The cache is a layer that stores a subset of data, typically the most frequently accessed or essential information, in a location quicker to access than its primary storage location.
https://www.youtube.com/watch?v=bP4BeUjNkXc
Caching strategies, Distributed Caching, Eviction Policies, Write-Through Cache and Least Recently Used (LRU) cache are all important terms when it comes to designing an efficient system with a caching layer.
https://www.youtube.com/watch?v=dGAgxozNWFE
消息队列+
https://www.youtube.com/watch?v=xErwDaOc-Gs
中间件+
https://www.youtube.com/watch?v=1oWPUpMheGk
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
相关职位
社招5-10年D11431
1、主导(参与)规划和设计快手新一代 Data + AI 生产管治平台的后端技术体系以及软件架构,包括 离线/实时开发平台、数据安全、数据地图、大模型数据同步/任务调度等系统; 2、充分利用模型微调、提示词工程、RAG等大模型技术构建智能开发 / 运维 / 治理等生产智能化能力; 3、充分利用微服务、容器化等技术构建高可用、高扩展和低耦合高内聚的数据中台服务; 4、了解业界相关技术体系,为快手数据产品研发引入创造性的技术方案,解决面临的各种复杂问题和挑战。
更新于 2025-08-25
社招J8UXV
1、负责字节跳动大数据平台的权限、审计等安全产品规划与建设,包括态势感知、权限管理、隐私保护和访问控制等,满足安全监管需求; 2、深入理解业务场景,与业务部门深度合作,设计架构并落地产品; 3、追求极致,探索数据安全治理的前沿方向,打造业内一流的数据治理产品体系; 4、探索设计基于大数据、机器学习、AI的智能数据安全系统与保护能力。
更新于 2022-01-06
社招4年以上软硬件服务-Sa
1、参与餐饮SaaS数据平台的整体架构建设工作,包括但不限于在线多维分析引擎、数据存储引擎、实时计算引擎、平台数据治理、数据服务、数据质量、数据产品等能力设计与研发等; 2、研究美团餐饮SaaS业务的数据特点,探索带来成本大幅优化的计算、存储方案,构建下一代智能报表系统的底层基础能力与产品通用解决方案; 3、理解数据湖、大数据分析引擎或数据库引擎工作原理,熟悉Parquet、ORC、Arrow等列存储技术方案,理解Doris、ClickHouse、Hive、Presto等至少一种分析引擎的工作原理,熟悉实时计算系统Flink、Storm、Spark至少一种计算框架的工作原理; 4、精通OLAP SQL优化与业务逻辑编排,对BI分析引擎有理解者优先
更新于 2025-04-17