美团高级数据开发工程师
社招全职5年以上软硬件服务-充电宝业务部地点:北京状态:招聘
任职要求
1、本科及以上学历,计算机相关专业,5年以上开发工作经验,3年以上数据系统开发与设计经验。 2、熟悉数据仓库建模、ETL 设计开发,有数据质量与数据治理相关经验。 3、具备实际的大数据业务开发经验,熟练使用Hadoop、Hive、Spark、Flink、Doris、Kafka等大数据离线和实时计算相关框架,并深入知晓原理。 4、设计并实现过海量数据分析系统,熟悉微服务架构,熟悉数据挖掘与分布式计算。 5、精通SQL,Java、Python两种及以上,对数据结构和算法设计有较为深刻的理解。 6、扎实的算法基础,熟悉常用机器学习模型及原理,具备较好建模调优能力。 7、优秀的沟通表达能力和团队协作能力,良好的分析问题和解决问题的能力。 具备以下条件优先 1、熟悉机器学习、数据挖掘;有数据分析预测经验者; 2、在大型互联网公司有海量数据分析经验者优先 ; 3、IoT相关产业数据处理经验者优先,熟悉硬件数据分析从业者优先。
工作职责
1、基于美团的数据平台进行离线和实时数据仓库建设,数据分析以及预测。 2、梳理业务系统数据,进行数据模型设计和开发,产出支持业务分析的基础数据,保障数据的准确性、易用性、及时性。 3、负责业务的数据需求、数据报表、OLAP开发以及临时数据提取的开发任务 4、参与技术决策和技术选型,制定流程规范,完善数据质量监控和数据治理。 5、针对海量IoT数据进行数据处理和模型训练,提升健康运维的效率。
包括英文材料
学历+
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
数据治理+
https://www.ibm.com/think/topics/data-governance
Data governance is the data management discipline that focuses on the quality, security and availability of an organization’s data.
https://www.youtube.com/watch?v=uPsUjKLHLAg
Building data fabric eliminates the technological complexities of data governance so users can connect to the right data at the right time, regardless of where it resides.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Doris+
https://doris.apache.org/docs/gettingStarted/what-is-apache-doris
Kafka+
https://developer.confluent.io/what-is-apache-kafka/
https://www.youtube.com/watch?v=CU44hKLMg7k
https://www.youtube.com/watch?v=j4bqyAMMb7o&list=PLa7VYi0yPIH0KbnJQcMv5N9iW8HkZHztH
In this Apache Kafka fundamentals course, we introduce you to the basic Apache Kafka elements and APIs, as well as the broader Kafka ecosystem.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
微服务+
https://learn.microsoft.com/en-us/training/modules/dotnet-microservices/
Microservice applications are composed of small, independently versioned, and scalable customer-focused services that communicate with each other by using standard protocols and well-defined interfaces.
https://microservices.io/
Microservices - also known as the microservice architecture - is an architectural style that structures an application as a collection of two or more services.
https://spring.io/microservices
Building small, self-contained, ready to run applications can bring great flexibility and added resilience to your code.
https://www.ibm.com/think/topics/microservices
Microservices, or microservices architecture, is a cloud-native architectural approach in which a single application is composed of many loosely coupled and independently deployable smaller components or services.
https://www.youtube.com/watch?v=CqCDOosvZIk
https://www.youtube.com/watch?v=hmkF77F9TLw
Learn about software system design and microservices.
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
相关职位
社招1-3年网易有道
1. 参与升学中心数据仓库设计与研发,完成数据建模的设计和开发以及数据监控,性能优化等相关技术工作 2. 结合升学中心业务特点,进行指标/标签体系的搭建 3. 参与数仓研发质量保障体系的完善和实施,打造稳定可靠的数据服务和保障体系 4. 调研和跟进大数据技术发展趋势进行相关数据方案的探索落地 5. 编写和维护数仓文档
更新于 2025-04-03
社招技术类
1、负责公司内视频云业务数据的开发和维护,为点直播业务与视频云研发团队提供快速、准确、灵活的数据仓库支持; 2、深入理解业务逻辑,完成数据模型设计及优化工作; 3、完成海量数据的获取、清洗、分类、整合等数据处理工作; 4、设计并实现对BI分析及报表展现、数据产品开发; 5、独立完成数据问题的排查与处理,解决数据质量与性能问题;
更新于 2025-02-13
社招3-5年网易游戏(互娱)
1、负责建设中台数据仓库架构,包括元数据管理、ETL调度、数据集成、OLAP等子系统的设计和开发; 2、制定和推广数据字典,建立完善的元数据管理规范,负责数据质量监控和数据资产管理; 3、搭建和维护中台数据仓库表,解决业务人员在仓库系统流程、工具使用、数据处理等建到的问题; 4、深入了解网易游戏、藏宝阁、网易大神等业务,负责数据仓库和其它业务系统接口; 5、基于对数据的理解和业务需求,对数据进行整理、分析和用户画像搭建。
更新于 2025-08-04