菜鸟菜鸟-数据研发工程师-杭州
社招全职1年以上技术类-数据地点:杭州状态:招聘
任职要求
职位要求: 1、从事数据仓库领域工作1年以上,熟悉数据仓库模型设计方法论,并有实际模型设计及ETL开发经验,具有较强的动手能力和学习能力; 2、熟悉一门数据处理语言,如SQL、JAVA、Python、Perl等,熟悉unix或者linux操作; 3、具备扎实的专业基础,良好的沟通能力和团队合作,主动积极,乐于面对挑战; 4、掌握大型数据库开发技术,以及关系型数据库开发如Oracle、Teradata、DB2、Mysql等其中的一种以上,灵活运用SQL实现海量数据ETL加工处理; 5、熟悉数据仓库领域知识和管理技能,包括但不局限于:元数据管理、数据质量、性能调优等; 6、对实时数据处理技术,如Flink、Storm、Spark Streaming等有一定的理解或经验者优先。
工作职责
职位描述: 1、负责菜鸟全球供应链大数据的采集、存储、处理,通过分布式大数据平台加工数据,支持业务管理决策; 2、参与菜鸟全球供应链大数据体系的模型设计、开发、维护,通过元数据、质量体系有效的管理和组织EB级的数据; 3、参与菜鸟全球供应链大数据产品的研发,通过数据分析和算法洞察数据背后的商业机会点,探索大数据商业化。
包括英文材料
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
ETL+
https://www.ibm.com/think/topics/etl
ETL—meaning extract, transform, load—is a data integration process that combines, cleans and organizes data from multiple sources into a single, consistent data set for storage in a data warehouse, data lake or other target system.
https://www.youtube.com/watch?v=OW5OgsLpDCQ
It explains what ETL is and what it can do for you to improve your data analysis and productivity.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Perl+
https://www.perl.org/learn.html
Useful links if you are interested in learning Perl
https://www.runoob.com/perl/perl-tutorial.html
本教程适合想从零开始学习 Perl 编程语言的开发人员。当然本教程也会对一些模块进行深入,让你更好的了解 Perl 的应用。
Unix+
[英文] The UNIX® Standard
https://www.opengroup.org/membership/forums/platform/unix
https://www.youtube.com/watch?v=IrDUcdpPmdI
UNIX is an operating system which was first developed in the 1970s, and has been under constant development ever since.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Oracle+
[英文] Oracle Tutorial
https://www.oracletutorial.com/
On this website, you can learn Oracle Database fast and easily.
https://www.youtube.com/watch?v=QHYuuXPdQNM&list=PL_c9BZzLwBRJ8f9-pSPbxSSG6lNgxQ4m9
Teradata+
https://www.youtube.com/watch?v=2XtG8TKpV7w&list=PLMrTtbMO6mv_WKUILqw17BvvC_RAZLoeh
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Apache Storm+
[英文] Tutorial
https://storm.apache.org/releases/2.6.0/Tutorial.html
In this tutorial, you'll learn how to create Storm topologies and deploy them to a Storm cluster.
https://www.baeldung.com/apache-storm
This tutorial will be an introduction to Apache Storm, a distributed real-time computation system.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
相关职位
社招技术类-数据
1.负责蚂蚁集团国际数据体系的建设,通过数据+算法+工程化能力,处理和萃取数据特征支持上层的数据运营决策; 2.参与大数据基础架构、产品技术的规划建设,包括数据合规、数据资产、数据产品、数据质量及稳定性保障体系建设。
更新于 2025-06-30
社招2年以上技术类-数据
1)直面业务问题,制定风险管理领域(隐私风险、法务风险、机构风险、流动性风险、市场风险、内控&操作风险等等)的数据解决方案,建设数据资产,并协同产技落地产品技术能力,助力风险业务提升数字化和智能化的能力。 2)能够主动推动安全合规技术以及产品平台的不断迭代优化,主导能力在业务侧的推广运营落地,让蚂蚁业务数据安全、合规、高效流动.;
更新于 2025-04-10
社招
1、负责核心业务域数据体系的规划和建设,通过数据产品和数据服务等方式,高效支撑业务场景的数据需求 2、深度理解业务,通过对业务策略和痛点的分析,制定系统性端到端的数据解决方案并落地 3、负责数据资产建设、数据质量与稳定性管理,构建共享融通的数据平台,让数据标准更规范、数据获取更高效
更新于 2025-05-23