小红书【2026校招】Java开发工程师-数据平台
校招全职后端开发地点:上海 | 杭州状态:招聘
任职要求
1、本科及以上学历,计算机、软件工程等相关专业优先; 2、熟练掌握Java编程语言,熟悉SQL及Hive优化; 3、有高可用系统的设计经验和能力,具备高并发、海量数据的处理能力; 4、深入理解Hadoop生态组件(HD…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、负责生产平台Dataverse 从Data For BI 数据平台升级为Data For AI+BI 数据平台,包括打造Notebook,个人开发环境,支持代码类任务(PySpark、Scala Spark、UDF、Ray、Python等)的高效开发和调试; 2、负责生产平台Dataverse Data+AI 数据血缘的建设,从在线、近线、到离线,覆盖算法链路特征、索引、模型、词表、样本等血缘链路的建设,支持算法全链路排障、内容理解和数据治理; 3、打造DataEngineer Agent、DataScience Agent,辅助数据开发工程师、数据科学家完成日常的数据处理、分析、建模的工作; 4、负责生产平台Dataverse 日常需求迭代,稳定性保障和问题排查等工作,具体模块包括数据同步、任务开发、数据测试、数据发布、任务运维及调度系统。
包括英文材料
学历+
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
高可用+
https://redis.io/blog/high-availability-architecture/
A high available architecture is when there are a number of different components, modules, or services that work together to maintain optimal performance, irrespective of peak-time loads.
https://www.ibm.com/think/topics/high-availability
High availability (HA) is a term that refers to a system’s ability to be accessible and reliable close to 100% of the time.
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
还有更多 •••
相关职位

校招
1. 参与搜索/推荐/用户增长相关的业务迭代,负责对应的技术架构设计,并完成高质量的代码实现和单元测试; 2. 参与需求评审,并对产品方案提出自己的想法和建议; 3. 对在线系统进行极致的性能优化,解决各类潜在系统技术风险,保证系统的安全、稳定、快速运行。
更新于 2025-08-07广州
校招后端开发
1、参与公司电商广告业务的后端研发,包括不限于电商、广告、本地生活等业务场景; 2、通过代码复用、工程/架构升级等方式持续性提升个人以及团队的研发效率; 3、关注线上产品的体验和质量,优化产品性能和交互,为用户提供顺畅购买体验的产品链路。
更新于 2025-09-06上海|杭州|北京
校招
1、参与软件项目的架构设计、详细设计、开发测试工作,严格把控代码质量,确保系统稳定性及可扩展性; 2、参与系统问题定位与分析工作,快速定位和解决系统运行中的问题,优化问题解决流程; 3、遵循 RESTful API 设计规范完成接口设计开发工作,确保数据交互的高效性和安全性,实现系统高效集成。
更新于 2025-06-22广州