字节跳动OLAP引擎专家/架构师/负责人-向量化引擎
社招全职O3505地点:上海状态:招聘
任职要求
1、良好的 C++ 编程基础; 2、熟悉主流的 OLAP 引擎的优化原理,向量化执行、SIMD、列式存储; 3、熟悉 Teradata、Oracle、TiDB、MySQL、OceanBase 等数据库内核优先(不要求熟悉所有技术栈); 4、熟悉 Spark、Presto、Druid、Kylin、Hive、Impala 等主流大数据系统原理及源码; 5、具备大规模系统的故障诊断与性能优化能力。
工作职责
1、打造业界领先的 PB 级 OLAP 引擎,并支撑字节跳动相关产品线; 2、产品定位类似于业内的 Velox、Photon、OmniRuntime 等产品; 3、产品旨在提供一套统一的通过 C++ 实现的极致性能的向量化执行引擎; 4、该执行引擎将加速 Spark SQL 及 Presto 为代表的大数据 OLAP 引擎,并通过火山引擎公有云产品 LAS 对外提供服务。
包括英文材料
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
OLAP+
https://www.youtube.com/watch?v=iw-5kFzIdgY
OLAP (for online analytical processing) is software for performing multidimensional analysis at high speeds on large volumes of data from a data warehouse, data mart, or some other unified, centralized data store.
Teradata+
https://www.youtube.com/watch?v=2XtG8TKpV7w&list=PLMrTtbMO6mv_WKUILqw17BvvC_RAZLoeh
Oracle+
[英文] Oracle Tutorial
https://www.oracletutorial.com/
On this website, you can learn Oracle Database fast and easily.
https://www.youtube.com/watch?v=QHYuuXPdQNM&list=PL_c9BZzLwBRJ8f9-pSPbxSSG6lNgxQ4m9
TiDB+
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
OceanBase+
https://en.oceanbase.com/docs/common-oceanbase-database-10000000001028975
This topic describes how to deploy OceanBase Database in a demo environment, a cluster, or a container for quick hands-on experience.
内核+
https://www.youtube.com/watch?v=C43VxGZ_ugU
I rummage around the Linux kernel source and try to understand what makes computers do what they do.
https://www.youtube.com/watch?v=HNIg3TXfdX8&list=PLrGN1Qi7t67V-9uXzj4VSQCffntfvn42v
Learn how to develop your very own kernel from scratch in this programming series!
https://www.youtube.com/watch?v=JDfo2Lc7iLU
Denshi goes over a simple explanation of what computer kernels are and how they work, alonside what makes the Linux kernel any special.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Presto+
[英文] What is Presto?
https://prestodb.io/what-is-presto/
https://www.tutorialspoint.com/apache_presto/index.htm
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Impala+
[英文] Impala Tutorials
https://impala.apache.org/docs/build/html/topics/impala_tutorial.html
This section includes tutorial scenarios that demonstrate how to begin using Impala.
相关职位
社招S1549
1、打造业界领先的 PB 级 OLAP 引擎,并支撑字节跳动相关产品线; 2、产品定位类似于业内的 Velox、Photon、OmniRuntime 等产品; 3、产品旨在提供一套统一的通过 C++ 实现的极致性能的向量化执行引擎; 4、该执行引擎将加速 Spark SQL 及 Presto 为代表的大数据 OLAP 引擎,并通过火山引擎公有云产品 LAS 对外提供服务。
更新于 2022-08-17
社招3年以上技术类-开发
1、负责蚂蚁钱包科技用户增长领域相关数据资产建设,数据化指引/数据治理工作;沉淀营销、流量等业务的优质数据资产,打造用户增长领域高质量、高复用的精品指标。 2、熟悉隐私安全法律法规,制定数据风险管理领域的解决方案。规划业务大图和主要流程、针对复杂的业务问题能做提炼和拆解,帮助数据安全、合规、高效流转。 3、帮助海外钱包建设指标体系和经营分析体系,提升海外钱包客户经营效率
更新于 2025-09-01
社招技术类-开发
1、消费信贷决策研发平台的设计和落地,综合利用算法&数据&工程能力,为风控策略专家提供自动化的研发、测算、分析工具,提高决策速度和精度。 2.、基于消费信贷的业务特点,设计和落地可用性高、易用性高的决策部署平台,包括但不限于流程编排、Agent开发、计算加速、耗时优化等。 3、与风控策略专家、算法科学家和数据工程师深度合作,基于强化学习、大模型、多模态数据融合等技术,构建下一代具备自主決策能力的风控智能決策引擎。 4、为亿级用户提供服务的高可用底座,满足金融级别的可用性和百万级吞吐的高并发要求。
更新于 2025-06-19