
奇虎36025春-大数据平台研发工程师(北京)-4512(J11614)
校招全职开发类地点:北京状态:招聘
任职要求
1.本科或研究生以上学历,计算机科学、软件工程等相关专业; 2.具备扎实的计算机操作系统和网络基础知识,熟悉分布式系统的设计原理和实现方法,了解Hadoop大数据生态系统; 3.有分布式存储和计算系统的开发和维护实战经验,熟悉至少一种业界主流分布式计算产品、调度框架,如YARN、Spark、Flink、Hive、Trino、Starrocks、Iceberge等优先; 4.熟练掌握Java或C++编程语言,有用Java和C++编程语言,重视代码质量,熟悉DevOps流程者优先; 5.熟悉Linux工具、掌握内核原理、具备良好的分析问题解决问题的能力、有分布式计算产品调优者优先; 6.有较强的学习能力,有良好的团队合作精神,善于沟通和交流。
工作职责
1.参与 MapReduce/Yarn/Hive 相关大数据基础组件的二次开发; 2.负责分布式计算平台的研发建设工作,包括离线计算/实时计算/资源调度/数仓等系统的开发、优化; 3.持续推动公司大数据计算技术栈的优化和演进,提供稳定高效服务,支撑业务在大数据计算方向的各类需求,改善用户体验; 4.深入优化业务场景,针对性提供技术解决方案; 5.跟踪业界最新的分布式计算技术,并应用于实际系统中。
包括英文材料
学历+
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
Yarn+
[英文] Introduction
https://yarnpkg.com/getting-started
Yarn is an established open-source package manager used to manage dependencies in JavaScript projects.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
StarRocks+
https://docs.starrocks.io/docs/quick_start/
These Quick Start guides will help you get going with a small StarRocks environment.
https://itnext.io/introduction-to-starrocks-a-new-modern-analytical-database-1db2177d26e1
Recently, I had the opportunity to explore StarRocks which is the new kid in the block when talking about massive scale databases which are able to handle petabytes of data.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
DevOps+
https://roadmap.sh/devops
Step by step guide for DevOps, SRE or any other Operations Role in 2025
https://zhuanlan.zhihu.com/p/562036793
DevOps中的Dev指的是Development(开发),Ops指的是Operations(运维),用一句话来说,DevOps就是打通开发运维的壁垒,实现开发运维一体化。
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
内核+
https://www.youtube.com/watch?v=C43VxGZ_ugU
I rummage around the Linux kernel source and try to understand what makes computers do what they do.
https://www.youtube.com/watch?v=HNIg3TXfdX8&list=PLrGN1Qi7t67V-9uXzj4VSQCffntfvn42v
Learn how to develop your very own kernel from scratch in this programming series!
https://www.youtube.com/watch?v=JDfo2Lc7iLU
Denshi goes over a simple explanation of what computer kernels are and how they work, alonside what makes the Linux kernel any special.
相关职位

校招算法类
1.基于LLM、协同过滤、信息检索、图算法等技术优化推荐算法效果,提高产品体验与流量效率; 2.利用数据、经验和客观规律,改善生态环境,提升运营能力; 3.探索LLM等技术的应用创新,探索LLM4Rec在业务上的真实落地;
更新于 2025-08-14

校招算法类
1.基于机器学习、协同过滤、信息检索、图算法等技术优化推荐算法效果,提高产品体验与流量效率; 2.利用数据、经验和客观规律,改善生态环境,提升运营能力; 3.探索基于深度学习、NLP等技术的应用创新;
更新于 2025-04-10
实习住宿业务内容管理
1. 协助完成携程酒店平台大促营销方案的执行落地,包括但不限于, 五一、国庆、春促,保证活动顺利上线; 2. 协助进行项目管理,协调并对接资源端、设计、投放、市场等部门,推进内外部合作,实现项目落地并优化相关流程。 3. 协助团队制定各类营销活动的站内外投放策略,并根据数据表现进行策略优化并迭代,持续提升营销效率 4. 参与营销创意脑暴,协助团队将创意转化为可落地方案,整合公司内部资源,完成创新项目的上线 5. 支持营销素材的审核、材料的制作与翻译、活动配置等营销团队日常运营工作。
更新于 2024-04-02