希音spark组件专家
社招全职6年以上信息技术类地点:南京 | 上海 | 深圳状态:招聘
任职要求
1.至少6年以上相关经验,有扎实的计算机编程基础,精通java/scala,熟悉jvm的原理和调优。 2.精通spark/hive/flink组件原理和内核优化,有超大规模数据计算的架构设计和优化经验。 3.掌握大数据行业趋势,熟悉K…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1.大数据新技术规划、调研、选型及推广落地。 2.负责大数据组件内核开发优化,推进组件容器化,进行组件二次开发与适配等工作。 3.日常负责大数据框架组件的性能优化,稳定性保障,异常监控及线上问题对接解决。 4.参与平台功能研发,提供业务系统化的解决方案。
包括英文材料
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Scala+
JVM+
https://www.freecodecamp.org/news/jvm-tutorial-java-virtual-machine-architecture-explained-for-beginners/
https://www.youtube.com/watch?v=e2zmmkc5xI0
性能调优+
https://goperf.dev/
The Go App Optimization Guide is a series of in-depth, technical articles for developers who want to get more performance out of their Go code without relying on guesswork or cargo cult patterns.
https://web.dev/learn/performance
This course is designed for those new to web performance, a vital aspect of the user experience.
https://www.ibm.com/think/insights/application-performance-optimization
Application performance is not just a simple concern for most organizations; it’s a critical factor in their business’s success.
https://www.oreilly.com/library/view/optimizing-java/9781492039259/
Performance tuning is an experimental science, but that doesn’t mean engineers should resort to guesswork and folklore to get the job done.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
还有更多 •••
相关职位
社招6年以上信息技术类
1.大数据新技术规划、调研、选型及推广落地。 2.负责大数据组件内核开发优化,推进组件容器化,进行组件二次开发与适配等工作。 3.日常负责大数据框架组件的性能优化,稳定性保障,异常监控及线上问题对接解决。 4.参与平台功能研发,提供业务系统化的解决方案。
更新于 2025-07-11南京|深圳
社招3-5年数据引擎
1、基于 Spark 等核心计算引擎参与公司 AGI预训练 数据采集、去重等核心链路的重构,从引擎层设计适配方案支撑 AGI 数据处理; 2、负责 Spark、Celeborn、Hive 等离线计算引擎的维护、性能优化与稳定性保障;
更新于 2026-01-12上海|北京|杭州
社招A236855A
1、迁移方案支持:协同销售、产品等角色,在导师指导下完成企业迁移上云的技术支持,包括但不限于:需求调研,云上架构设计、迁移方案制定及风险评估等; 2、技术实施与排障:在导师指导下完成产品开通部署、迁移方案实施落地、业务系统割接等动作;能够主动发现并解决迁移过程中的基础技术问题,做好技术风险管控和实施进度管理,保障业务系统顺利迁移; 3、工具与流程优化:参与迁移工具/脚本的开发与优化,沉淀迁移场景的标准化文档和自动化脚本,提升迁移效率; 4、技术输出与沉淀:整理迁移案例经验,输出技术文档、操作手册及行业解决方案,推动知识共享。
更新于 2025-02-10西安