
微店数仓(2026校招)(J10981)
校招全职地点:杭州状态:招聘
任职要求
【任职要求】 1. 工作经验: 计算机/数学/统计相关专业本科及以上学历,3年以上数据仓库或BI相关开发经验。 2. 硬核SQL能力: 精通复杂SQL编程,具备海量数据下的性能调优经验;熟悉至少一种大数据计算引擎(Hive/Spark SQL/ClickHouse)。 3. 数据建模能力: 掌握数据仓库理论(Kimball维度建模),熟悉星型/雪花模型,有实际的数据分层建设经验(ODS/DWD/DWS/ADS)。 4. 工具链: 熟悉…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
【工作职责】 1. 数据体系建设: 负责公司核心业务(如:用户增长、电商交易、广告变现等)数据仓库的模型设计与开发,构建稳定、高效的数据底层架构。 2. ETL与数据治理: 负责复杂业务数据的清洗、加工、汇总,处理数据倾斜、任务调优等ETL性能问题,保障数据链路的SLA。 3. 数据内容建设: 将数仓模型结果转化为直接可用的分析表(如:用户行为宽表、财务汇总表),并基于业务需求进行数据探查(Ad-hoc查询),为运营策略提供数据支撑。 4. 数据产品化: 配合前端或BI开发人员,设计底层数据模型以支持可视化报表(如Tableau/FineBI)的敏捷开发,确保图表展示的高效与准确。
包括英文材料
学历+
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
还有更多 •••