
同程旅行大数据工程师
社招全职3年以上地点:苏州状态:招聘
任职要求
1、本科及以上学历,计算机、数学或相关专业优先; 2、3年及以上大数据开发和治理经验,有数据仓库、数据湖、数据可视化等相关项目经验优先; 3、熟悉Hadoop、Spark、Flink、Hive、HBase等大数据技术和工具,能够熟练编写和调试相应的程序; 4、熟练掌握java、Scala编程语言,熟练使用springboot等开发框架,有良好的编码习惯和代码质量意识; 5、了解Linux操作系统和Shell编程,能够独立完成服务器配置和维护等相关工作; 6、具备良好的沟通和团队协作能力,能够与业务人员、数据分析师和其他开发人员协作完成项目任务; 7、具备较强的学习能力和自我驱动能力,能够快速学习新的技术和工具,主动推动工作的进展和改进; 8、有商业化开发经验优先。
工作职责
1、负责大数据平台的开发和治理,包括数据采集、存储、处理、计算和展示等全生命周期的工作; 2、负责数据仓库和数据湖的设计和实现,提供高效、可靠、安全、可扩展的数据存储和计算能力; 3、熟悉数据架构、数据模型、数据质量和数据安全等方面的知识,能够设计和优化数据模型、ETL流程和数据治理流程; 4、熟练掌握Hadoop、Spark、Flink、Hive、HBase等大数据技术和工具,能够根据业务需求选择并使用适当的技术; 5、熟悉数据可视化和报表工具,如Tableau、PowerBI、FineBI等,能够根据开源报表系统定制化开发报表; 6、关注最新的大数据技术和行业发展趋势,参与技术选型和架构设计,推动大数据平台的技术创新和业务应用。
包括英文材料
学历+
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Scala+
Spring Boot+
https://spring.io/guides/gs/spring-boot
his guide provides a sampling of how Spring Boot helps you accelerate application development.
https://www.youtube.com/watch?v=Nv2DERaMx-4&list=PLzUMQwCOrQTksiYqoumAQxuhPNa3HqasL
The author teaches you how to use Spring Boot from a complete beginner, to building a REST API with a real database, Dockerising it and deploying it to the cloud.
开发框架+
[英文] Understanding Modern Development Frameworks: A Guide for Developers and Technical Decision-makers
https://www.freecodecamp.org/news/understanding-modern-development-frameworks-guide-for-devs/
编程规范+
[英文] Google Style Guides
https://google.github.io/styleguide/
Every major open-source project has its own style guide: a set of conventions (sometimes arbitrary) about how to write code for that project. It is much easier to understand a large codebase when all the code in it is in a consistent style.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
相关职位
社招5年以上研发类
1、通过用户分析、平台分析、内容分析等挖掘业务增长机会,通过 AB 实验、模型搭建及落地等助力科学决策与业务增长; 2、洞察业务诉求,提出增长策略,为广告、游戏等业务提供数据基建与数据科学服务;
更新于 2025-07-16
社招3年以上研发类
1、负责海量用户数据的分析和挖掘,构建用户画像体系; 2、负责用户特征分析与洞察,搭建用户价值分级模型,以及在广告场景/个性化推荐等场景的落地应用,助力业务转化效果提升; 3、负责应用链路效果分析,洞察优化点,持续深入挖掘数据策略应用机会。
更新于 2024-04-30
社招5-7年研发类
1、负责海量用户行为数据与内容数据挖掘,构建高质量的用户画像体系,包括用户基础属性、行业兴趣偏好等; 2、负责用户行为分析与预测,搭建人群优选、用户价值分级等算法模型,应用于广告场景/个性化推荐等场景,助力业务效果提升; 3、负责人群应用链路效果分析,洞察优化点,持续深入挖掘数据策略应用机会并落地实践。