字节跳动大数据开发工程师-交易中台
社招全职2年以上A05424地点:上海状态:招聘
任职要求
1、计算机相关专业本科以上学历,掌握计算机组成原理、计算机系统结构、计算机网络和操作系统等计算机基础理论; 2、对Hadoop生态圈的相关技术,比如MapReduce、Spark、Hive、Flink的原理掌握并且有实战经验; 3、熟练掌握SQL, 可以使用SQL进行复杂的分析, 并给予执行计划掌握并对SQL进行调优处理,熟练使用Java、Python和Shell等编程语言进行数据加工处理; 4、从事数据仓库领域2年以上,熟悉数据仓库模型设计、ETL研发,有海量数据处理经验; 5、掌握数据分层架构和维度建模方法, 了解数据指标体系建设和数据分析方法; 6、善于沟通,工作积极主动,责任心强,具备良好的团队协作能力。 额外加分项: 1、Github等开源社区贡献者; 2、具有数据挖掘和统计学相关背景; 3、具备大规模分布式服务设计能力和经验。
工作职责
1、广告各类在线业务的离线和实时数据体系建设,完成数据主题域的模型设计、实现和维护; 2、数据服务接口及产品需求研发迭代,代码回查、BUG修复及日常服务运维; 3、针对海量数据处理和查询需求,设计适应业务变化的合理的数据体系,满足多样性的需求。
包括英文材料
学历+
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
MapReduce+
https://www.youtube.com/watch?v=bcjSe0xCHbE
https://www.youtube.com/watch?v=cHGaQz0E7AU
In this video I explain the basics of Map Reduce model, an important concept for any software engineer to be aware of.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
相关职位
社招A139602
1、负责和参与字节跳动本地生活交易系统相关业务的设计开发与维护工作,持续为平台、商家、服务商、达人以及服务商等角色提供稳定、高效、极致体验的交易系统; 2、负责购物车、提下单、支付、订单查询与管理、售后退款、审核仲裁等交易正逆向核心交易系统的架构设计与需求研发,高效率支持酒旅、到家以及到店等业务发展; 3、负责交易中台的系统研发和架构设计,优化交易系统的稳定性、可扩展性以及安全性,持续提升亿级订单规模下交易系统的高可用、高扩展以及高性能,持续提升研发效率。
更新于 2023-12-22
社招5-7年技术
1.负责滴滴国际化外卖业务数据需求开发和相应业务数据中台基建工作; 2.参与部门数据治理体系的规范化落地,保障数据稳定性; 3.参与部门新技术探索和架构升级方案制定及落地;
更新于 2025-09-12
社招S1822
1、负责字节跳动(抖音、火山、今日头条、西瓜视频等APP)电商交易中台、订单中台相关业务的设计开发及维护工作,为用户、商家提供优质的服务体验; 2、负责购物车、提单&下单、订单查询、订单管理、订单中心、交易流程等核心交易系统架构设计和系统研发,持续进行系统架构优化,高效率的支持业务发展; 3、负责交易中台的系统研发和架构设计,保障技术先进性,优化核心交易系统的稳定性、可扩展性、安全性,持续提升亿级订单规模下,交易系统的可用性与性能及研发效率。
更新于 2022-06-07