网易高级数据开发工程师
社招全职1-3年网易有道地点:北京状态:招聘
任职要求
1. 计算机相关专业本科及以上学历,有1~3年及以上数据仓库、ETL工作经验 2. 熟悉SQL、Shell等相关技术,有海量数据处理、ETL及任务调度优化、数据仓库建模等经验 3. 熟悉Spark/Flink/Hadoop/Hive/Kafka/TiDB/数据湖等大数据技术者优先 4. 掌握数据治理方法,有数据标准、数据质量、数据安全相关经验 5. 熟练使用Java或Scala语言 6. 逻辑清晰、对数据敏感,较好的业务理解能力,良好的语言沟通与表达能力
工作职责
1. 参与升学中心数据仓库设计与研发,完成数据建模的设计和开发以及数据监控,性能优化等相关技术工作 2. 结合升学中心业务特点,进行指标/标签体系的搭建 3. 参与数仓研发质量保障体系的完善和实施,打造稳定可靠的数据服务和保障体系 4. 调研和跟进大数据技术发展趋势进行相关数据方案的探索落地 5. 编写和维护数仓文档
包括英文材料
学历+
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
面向对象+
https://liaoxuefeng.com/books/java/oop/index.html
面向对象编程,英文是Object-Oriented Programming,简称OOP。
https://liaoxuefeng.com/books/python/oop/index.html
面向对象编程——Object Oriented Programming,简称OOP,是一种程序设计思想。
https://www.youtube.com/watch?v=SiBw7os-_zI
Learn the basics of object-oriented programming all in one video.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Scala+
相关职位
社招技术类
1、负责公司内视频云业务数据的开发和维护,为点直播业务与视频云研发团队提供快速、准确、灵活的数据仓库支持; 2、深入理解业务逻辑,完成数据模型设计及优化工作; 3、完成海量数据的获取、清洗、分类、整合等数据处理工作; 4、设计并实现对BI分析及报表展现、数据产品开发; 5、独立完成数据问题的排查与处理,解决数据质量与性能问题;
更新于 2025-02-13
社招3-5年网易游戏(互娱)
1、负责建设中台数据仓库架构,包括元数据管理、ETL调度、数据集成、OLAP等子系统的设计和开发; 2、制定和推广数据字典,建立完善的元数据管理规范,负责数据质量监控和数据资产管理; 3、搭建和维护中台数据仓库表,解决业务人员在仓库系统流程、工具使用、数据处理等建到的问题; 4、深入了解网易游戏、藏宝阁、网易大神等业务,负责数据仓库和其它业务系统接口; 5、基于对数据的理解和业务需求,对数据进行整理、分析和用户画像搭建。
更新于 2025-08-04
社招5年以上软硬件服务-充电
1、基于美团的数据平台进行离线和实时数据仓库建设,数据分析以及预测。 2、梳理业务系统数据,进行数据模型设计和开发,产出支持业务分析的基础数据,保障数据的准确性、易用性、及时性。 3、负责业务的数据需求、数据报表、OLAP开发以及临时数据提取的开发任务 4、参与技术决策和技术选型,制定流程规范,完善数据质量监控和数据治理。 5、针对海量IoT数据进行数据处理和模型训练,提升健康运维的效率。
更新于 2025-06-20