字节跳动Java 开发专家-数据研发方向
社招全职A224433地点:上海状态:招聘
任职要求
1、精通 Java 语言,包括 JVM、多线程、高并发、网络等; 2、熟悉 Spring 等开源框架; 3、熟悉分布式、缓存、消息等原理; 4、扎实的计算机基础,对技术有热情,愿意不断尝试新技术和业务挑战; 5、熟悉 Hadoop ecosystem 常用开源框架者优先,例如 Flink/Spark 等; 6、熟悉 AI 等前沿技术者优先; 7、需具备英文沟通能力。
工作职责
1、主导业界领先的大数据研发平台的设计与研发,包括智能数据开发 & 运维平台、高性能分布式调度系统等; 2、深入理解业务场景,设计合理架构,支持包括抖音、头条等在内的全球业务; 3、探索大数据研发前沿方向,参与需求、产品讨论,确定中长期演进方向。
包括英文材料
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
JVM+
https://www.freecodecamp.org/news/jvm-tutorial-java-virtual-machine-architecture-explained-for-beginners/
https://www.youtube.com/watch?v=e2zmmkc5xI0
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
Spring+
https://liaoxuefeng.com/books/java/spring/index.html
Spring是一个支持快速开发Java EE应用程序的框架。它提供了一系列底层容器和基础设施,并可以和大量常用的开源框架无缝集成,可以说是开发Java EE应用程序的必备。
https://spring.io/guides/gs/rest-service
https://spring.io/quickstart
Level up your Java code and explore what Spring can do for you.
缓存+
https://hackernoon.com/the-system-design-cheat-sheet-cache
The cache is a layer that stores a subset of data, typically the most frequently accessed or essential information, in a location quicker to access than its primary storage location.
https://www.youtube.com/watch?v=bP4BeUjNkXc
Caching strategies, Distributed Caching, Eviction Policies, Write-Through Cache and Least Recently Used (LRU) cache are all important terms when it comes to designing an efficient system with a caching layer.
https://www.youtube.com/watch?v=dGAgxozNWFE
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
相关职位
社招A195773
1、参与字节跳动大数据研发治理平台设计与研发,面向ToB场景主导高可扩展技术架构设计及落地; 2、深入理解客户需求,设计合理技术方案,支持ToB客户需求实现及交付; 3、结合大模型、主动元数据等新技术,参与智能数据工程领域产品技术创新。
更新于 2024-12-25
社招I8374
1、负责字节跳动分布式数据治理平台的架构设计与产品研发工作,包括治理评估体系、框架引擎、数据服务能力及自动化工具能力的建设等; 2、深入理解业务场景,与业务部门深度合作,设计解决方案并落地,推动数据治理平台能力提升和业务价值落地; 3、深入分析用户需求和产品优化方向,打磨产品细节,持续提升服务质量和用户体验; 4、追求极致,探索数据治理前沿方向,构建业内领先的数据治理产品。
更新于 2023-03-28
社招3-5年基础后端
1、负责数据治理平台的架构设计与产品研发工作,包括治理评估体系、框架引擎、数据服务能力及自动化工具能力的建设等; 2、探索大模型技术在智能运维领域中的落地应用,提供基于大模型的智能诊断、根因定位、异常检测、故障自愈等运维能力; 3、深入分析用户需求和产品优化方向,打磨产品细节,持续提升服务质量和用户体验,以系统化、智能化能力高效支撑成本及稳定性治理; 4、追求极致,探索数据治理前沿方向,持续跟进大模型前沿技术、开源方案及其在智能运维领域的应用,构建业内领先的数据治理产品。
更新于 2025-08-22