快手高级Java开发工程师(分析平台)-【数据平台】
社招全职1-3年J0012地点:北京 | 上海状态:招聘
任职要求
1、本科及以上学历,计算机相关专业,3年以上后端研发经验,精通 Java 编程,具有扎实的多线程及分布式系统架构设计能力; 2、具备 LLM 应用开发实战经验,熟悉 Prompt Engineering,深入理解 RAG 流程及向量数据库应用,熟悉 LangChain、LangGraph4j、Spring …
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、深度参与公司级智能数据分析平台的研发,负责智能问答(ChatBI)、Text2SQL、智能看板解读、智能配置看板等核心智能化能力的设计与实现; 2、基于 LangChain4j/LangGraph4j等框架构建企业级 AI 应用架构,利用 RAG、Agent、 Prompt Engineering、模型微调等技术,解决数据分析场景下的语义理解与复杂推理问题; 3、负责微服务架构的设计与优化,整合大数据 OLAP 引擎(如 Doris/ClickHouse)与 AI 服务,构建高性能、低延迟的数据中台服务; 4、跟进 LLM 在数据分析领域的最新进展(如 NL2SQL 优化、多轮对话上下文管理),将创新技术转化为可落地的产品能力,提升数据分析的准确率与效率。
包括英文材料
学历+
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
还有更多 •••
相关职位
社招3年以上D2815
1、参与公司级BI系统的研发工作,涉及取数、数据可视化和多维分析等核心数据分析能力建设; 2、充分利用微服务、大数据OLAP引擎、内存加速引擎等技术构建高可用、高扩展和低耦合高内聚的数据中台服务; 3、熟悉业界分析技术体系,为快手数据产品研发引入创造性的技术方案,解决面临的各种复杂问题和挑战。
更新于 2024-10-09上海
社招1-3年J0012
1、主导(参与)规划和设计快手新一代 Data + AI 生产管治平台的后端技术体系以及软件架构,包括 离线/实时开发平台、数据安全、数据地图、大模型数据同步/任务调度等系统; 2、充分利用模型微调、提示词工程、RAG等大模型技术构建智能开发 / 运维 / 治理等生产智能化能力; 3、充分利用微服务、容器化等技术构建高可用、高扩展和低耦合高内聚的数据中台服务; 4、了解业界相关技术体系,为快手数据产品研发引入创造性的技术方案,解决面临的各种复杂问题和挑战。
更新于 2026-03-17北京
社招3年以上D2815
1、主导(参与)规划和设计大数据离线和实时开发治理产品的后端技术体系以及软件架构; 2、充分利用微服务、容器化等技术构建高可用、高扩展和低耦合高内聚的数据中台服务; 3、了解业界相关技术体系,为快手数据产品研发引入创造性的技术方案,解决面临的各种复杂问题和挑战。
更新于 2024-10-10北京