阿里巴巴数据技术及产品部-AI 数据研发工程师-数据公共层建设/数据语义层建设
社招全职2年以上技术类-数据地点:杭州状态:招聘
任职要求
1、技术基础扎实:熟练掌握 Java、Python、SQL 中一种或多种数据处理语言,具备良好的编程习惯与工程化思维; 2、数据处理与平台经验:熟悉主流大数据技术栈(如 Hadoop、Spark、Flink、Paimon、MaxCompute 等),有实时/离线ETL开发、湖仓一体建设、流批融合项目经验者优先; 3、数据资产意识强:对元数据管理、数据血缘、数据标准、质量规则等数据治理核心领域有深入理解或实战经验,能将治理能力产品化、自动化; …
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、主导面向Agent的新型数据语义层架构设计与落地,推动数据资产从数据表向智能体可理解、可调用、可推理的消费模式演进; 2、负责端到端数据建模设计与开发交付,基于流批一体架构(Flink + Spark + Paimon)实现业务模型的统一构建与服务; 3、负责数据质量治理与链路稳定性保障,建立覆盖全链路的监控告警、血缘追踪,确保关键数据任务满足SLA要求;
包括英文材料
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
编程规范+
[英文] Google Style Guides
https://google.github.io/styleguide/
Every major open-source project has its own style guide: a set of conventions (sometimes arbitrary) about how to write code for that project. It is much easier to understand a large codebase when all the code in it is in a consistent style.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
还有更多 •••