小红书Java开发专家(数据治理平台)
社招全职3-5年基础后端地点:北京 | 上海 | 杭州状态:招聘
任职要求
1、扎实的计算机基础,熟练掌握Java语言和面向对象编程,熟悉JVM、类加载、多线程、IO、网络等基本原理; 2、熟练掌握Spring框架,熟练使用Redis、MySQL等基本组件,熟悉分布式、缓存、消息队列、微服务等架构机制; 3、熟悉大数据生态,包括Hadoop、Hive、Spark、Kafka、Starrocks、Flink等,对大数据组件有一定的开发经验并了解原理; 4、具备较强的业务和产品sense,具备良好的沟通和团队协作能力,能够主导问题分析、方案规划、推动落地的过程,积极主动,愿意不断尝试新技术和业务挑战; 5、有AIOPS相关建设经验者优先,尤其是稳定性方面的智能运维,有大型项目的落地经验,包括但不限于异常检测、日志检测、故障诊断、根因分析等
工作职责
1、负责数据治理平台的架构设计与产品研发工作,包括治理评估体系、框架引擎、数据服务能力及自动化工具能力的建设等; 2、探索大模型技术在智能运维领域中的落地应用,提供基于大模型的智能诊断、根因定位、异常检测、故障自愈等运维能力; 3、深入分析用户需求和产品优化方向,打磨产品细节,持续提升服务质量和用户体验,以系统化、智能化能力高效支撑成本及稳定性治理; 4、追求极致,探索数据治理前沿方向,持续跟进大模型前沿技术、开源方案及其在智能运维领域的应用,构建业内领先的数据治理产品。
包括英文材料
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
JVM+
https://www.freecodecamp.org/news/jvm-tutorial-java-virtual-machine-architecture-explained-for-beginners/
https://www.youtube.com/watch?v=e2zmmkc5xI0
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
Spring+
https://liaoxuefeng.com/books/java/spring/index.html
Spring是一个支持快速开发Java EE应用程序的框架。它提供了一系列底层容器和基础设施,并可以和大量常用的开源框架无缝集成,可以说是开发Java EE应用程序的必备。
https://spring.io/guides/gs/rest-service
https://spring.io/quickstart
Level up your Java code and explore what Spring can do for you.
Redis+
[英文] Developer Hub
https://redis.io/dev/
Get all the tutorials, learning paths, and more you need to start building—fast.
https://www.runoob.com/redis/redis-tutorial.html
REmote DIctionary Server(Redis) 是一个由 Salvatore Sanfilippo 写的 key-value 存储系统,是跨平台的非关系型数据库。
https://www.youtube.com/watch?v=jgpVdJB2sKQ
In this video I will be covering Redis in depth from how to install it, what commands you can use, all the way to how to use it in a real world project.
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
缓存+
https://hackernoon.com/the-system-design-cheat-sheet-cache
The cache is a layer that stores a subset of data, typically the most frequently accessed or essential information, in a location quicker to access than its primary storage location.
https://www.youtube.com/watch?v=bP4BeUjNkXc
Caching strategies, Distributed Caching, Eviction Policies, Write-Through Cache and Least Recently Used (LRU) cache are all important terms when it comes to designing an efficient system with a caching layer.
https://www.youtube.com/watch?v=dGAgxozNWFE
消息队列+
https://www.youtube.com/watch?v=xErwDaOc-Gs
微服务+
https://learn.microsoft.com/en-us/training/modules/dotnet-microservices/
Microservice applications are composed of small, independently versioned, and scalable customer-focused services that communicate with each other by using standard protocols and well-defined interfaces.
https://microservices.io/
Microservices - also known as the microservice architecture - is an architectural style that structures an application as a collection of two or more services.
https://spring.io/microservices
Building small, self-contained, ready to run applications can bring great flexibility and added resilience to your code.
https://www.ibm.com/think/topics/microservices
Microservices, or microservices architecture, is a cloud-native architectural approach in which a single application is composed of many loosely coupled and independently deployable smaller components or services.
https://www.youtube.com/watch?v=CqCDOosvZIk
https://www.youtube.com/watch?v=hmkF77F9TLw
Learn about software system design and microservices.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
StarRocks+
https://docs.starrocks.io/docs/quick_start/
These Quick Start guides will help you get going with a small StarRocks environment.
https://itnext.io/introduction-to-starrocks-a-new-modern-analytical-database-1db2177d26e1
Recently, I had the opportunity to explore StarRocks which is the new kid in the block when talking about massive scale databases which are able to handle petabytes of data.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
相关职位
社招I8374
1、负责字节跳动分布式数据治理平台的架构设计与产品研发工作,包括治理评估体系、框架引擎、数据服务能力及自动化工具能力的建设等; 2、深入理解业务场景,与业务部门深度合作,设计解决方案并落地,推动数据治理平台能力提升和业务价值落地; 3、深入分析用户需求和产品优化方向,打磨产品细节,持续提升服务质量和用户体验; 4、追求极致,探索数据治理前沿方向,构建业内领先的数据治理产品。
更新于 2023-03-28
社招A195773
1、参与字节跳动大数据研发治理平台设计与研发,面向ToB场景主导高可扩展技术架构设计及落地; 2、深入理解客户需求,设计合理技术方案,支持ToB客户需求实现及交付; 3、结合大模型、主动元数据等新技术,参与智能数据工程领域产品技术创新。
更新于 2024-12-25
社招3年以上技术类-开发
1、参与本地生活核心流量系统的架构设计、搭建和优化迭代,为平台的转化和用户增长提供强有力的支撑; 2、参与本地生活投放系统的业务迭代需求开发,并搭建对应的平台化和工具化能力; 3、参与运营及算法的核心业务能力的开发支持,构建大数据量、高性能、高稳定性的在线预估和排序等算法工程系统架构,构建高灵活性的运营系统。 4、参与本地生活投放和触点等业务数据治理及离线链路和平台的能力建设。
更新于 2025-10-10