虎鲸文娱优酷-数据研发工程师-北京/杭州
社招全职2年以上地点:北京 | 杭州状态:招聘
任职要求
1. 本科及以上,计算机相关专业,3 年以上数据研发经验; 2. 熟练掌握 Hadoop、Hive、Spark、ODPS 等大数据框架,SQL 扎实,有 Java/Python 经验; 3. 有大型数仓建设经验…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 负责优酷数据平台的架构设计和落地,建设面向多业态的数据中台; 2. 负责数据资产建设,推进数据标准化和治理,提升数据使用效率; 3. 负责各业务域的数据模型搭建和场景应用,并持续优化沉淀; 4. 用好 AI 工具提升研发效率,构建基于大模型的数据应用,赋能业务。
包括英文材料
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
还有更多 •••