百度EDAP平台研发工程师(J81458)
社招全职3年以上ACG地点:北京状态:招聘
任职要求
-计算机及相关专业,从事大数据领域相关开发3年及以上 -有大数据处理系统设计等相关开发或优化经验 -熟悉HDFS/Hive/Spark/Flink等Hadoop生态技术 -熟悉JA…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
-负责大数据平台产品开发,包括数据采集、开发、分析、运维等平台化功能 -负责大数据平台技术迭代,优化平台架构,面向大规模高并发数据处理场景提升平台性能 -深入理解智能云生态,协同适配云上产品,支持业务场景,提升产品易用性和用户体验 -深入理解项目需求,支持私有化产品功能开发与版本迭代,确保项目顺利验收
包括英文材料
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
HDFS+
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.
https://www.ibm.com/cn-zh/think/topics/hdfs
Hadoop 分布式文件系统 (HDFS) 是一种管理大型数据集的文件系统,可在商用硬件上运行。
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
还有更多 •••
相关职位
暂无相关职位