希音高级/资深大数据开发工程师-上海(CRM)
社招全职信息技术类地点:上海状态:招聘
任职要求
1.熟悉Presto、Hadoop、Spark、Flink等大数据大数据框架,有大规模数据处理经验。 2.优化数据存储方案(如HBase、Hive、Kafka等),提升数据查询和访问效率。 3.熟悉数据仓库实施方法论、并支…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1.承接实时/离线大数据处理流程开发,满足平台内业务数据需求。 2.对大数据服务进行性能调优,保障集群的高效与平稳运行,提升系统稳定性和可扩展性。 3.持续升级计算存储架构,更好支持业务发展。 工作
包括英文材料
Presto+
[英文] What is Presto?
https://prestodb.io/what-is-presto/
https://www.tutorialspoint.com/apache_presto/index.htm
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
还有更多 •••
相关职位
社招技术团队开发
1. 大数据可视化配置平台开发: 负责大数据可视化配置平台的设计与开发,独立完成核心功能模块的实现。 针对大数据场景下的SQL查询进行优化,提升数据查询和处理的性能。 设计和实现高性能、高可用的Java后端服务,支撑可视化平台的稳定运行。 2. 技术研究与创新: 跟踪大数据和可视化技术的最新发展趋势,持续优化平台功能。 研究并引入新技术,提升系统的性能、可扩展性和用户体验。 解决技术难题,推动技术创新在实际项目中的应用。 3. 团队协作与指导: 与产品经理、前端开发、数据分析师等团队成员紧密合作,确保项目高效推进。 参与技术方案的讨论和设计,提出可行性建议并推动落地。 分享技术经验,帮助团队成员共同成长。
更新于 2025-03-11上海