顺丰YWS-大数据仓库开发工程师
社招全职3-5年地点:深圳状态:招聘
任职要求
1、计算机或相关专业,本科及以上学历; 2、熟练掌握企业级数据仓库体系架构,数据仓库模型、分层体系构建、元数据管理、数据质量监控等,具备扎实数据仓库建模理论知识; 3、精通SQL语言,如HiveSQL、SprakSQL、SQL等(HiveSQL为必须项),且有丰富的HiveSQL、SprakSQL性能调优经验; 4、熟悉Hadoop/Hive/Spar…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、负责公司企业级大数据数据仓库的建设,包含且不限于数据接入、数据建模和数据治理等工作内容。
2、负责公司人财物数据产品的落地建设,包括以下要求:
①、分析、挖掘和引导用户需求,针对业务场景进行数据建模
②、输出可落地、可复用、可推广的数据解决方案,以数据驱动业务,助力公司降本增效战略。包括英文材料
学历+
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
还有更多 •••
相关职位
社招3-5年
1. 根据业务或数据产品经理的需求实现对业务指标数据的实时采集、清洗、计算,以及数据采集等支持业务的决策; 2. 不断完善和创新实时数仓架构,开发通用业务数据服务,负责系统性能优化,技术难题攻关; 3. 按时保质的完成工作,配合产品、测试、前端完成系统交付和目标达成
更新于 2025-06-05深圳
社招3-5年
1、人资、财务、采购综合相关的业数底盘模型、报表研发 2、人资、财务、采购综合数据的准确性时效性监控和优化 3、人资、财务、采购综合内部数据资产建设,存量表及新增表管理,以及日常的运维工作 4、人资、财务、采购综合相关数据分析、数据产品、数据变现等分析型工作
更新于 2025-09-03深圳
社招3-5年
1、采购综合线相关的业数底盘模型、报表研发 2、采购综合数据的准确性时效性监控和优化 3、负责采购综合内部数据资产建设,存量表及新增表管理,以及日常的运维工作;
更新于 2025-08-25深圳