理想汽车数据开发实习生
实习兼职数据开发地点:北京状态:招聘
任职要求
1. 实习期保证6个月以上,全职到岗,计算机、大数据等相关专业大学本科及以上学历在读; 2. 熟悉SQL/Java/python等至少一种开发语言; 3. 了解hadoop/hive/spark/storm/flink等原理; 4. 了解数据库、范式,了解数据仓库,有使用数据库和编程语言解决实际问题的经历; 5. 有较好的学习能力,沟通表达、积极主动; 6. 对数据价值探索充满热情,能快速分析和理解问题; 加分项: 7. 热爱互联网和新技术,具有较强的快速学习能力; 8. 同时具备多种数仓(实时数仓、离线数仓、流批一体数仓)开发经验; 9. 掌握向量库和图数据库的应用开发者优先。
工作职责
1. 参与车辆数据平台数据仓库及数据应用服务的研发工作; 2. 参与实时、离线、流批一体数据仓库的建设,数据方案设计,模型开发,指标体系的开发和数据治理; 3. 支持业务团队的数据分析工作,负责面向业务的统计报表,数据提取等工作; 4. 参与解决项目中的问题和技术难题,线上疑难问题排查和解决; 5. 理解数据仓库架构,在项目实施的过程中,发现并解决各种维度/粒度的数据问题。
包括英文材料
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
学历+
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Apache Storm+
[英文] Tutorial
https://storm.apache.org/releases/2.6.0/Tutorial.html
In this tutorial, you'll learn how to create Storm topologies and deploy them to a Storm cluster.
https://www.baeldung.com/apache-storm
This tutorial will be an introduction to Apache Storm, a distributed real-time computation system.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
数据仓库+
https://www.youtube.com/watch?v=9GVqKuTVANE
From Zero to Data Warehouse Hero: A Full SQL Project Walkthrough and Real Industry Experience!
https://www.youtube.com/watch?v=k4tK2ttdSDg
相关职位
实习车辆控制
1. 参与车控算法的RD和开发项目,负责车控算法数仓建立、数据集成的工作; 2. 设计和开发高效、可扩展的ETL数据管道,优化数据清洗、转换和加载流程; 3. 参与数据仓库(如Hive、ClickHouse)、实时数仓(如Flink、Kafka)的架构设计与开发; 4. 对接业务需求,开发数据服务接口,为数据分析、机器学习等场景提供高质量数据支持; 5. 解决大数据集群的性能瓶颈,调优Hadoop/Spark/Flink等框架的资源利用率与计算效率。
社招软件研发
作为蔚来汽车整车应用软件中心数据算法组的数据开发实习生,你的职责包括: 1. 基于智能网联汽车大数据,建立功能与业务分析框架,定量分析用户行为,推进整车应用软件的持续更新。 2. 参与团队平台化数字化建设,提供数据建模与开发支持。 3. 与产品经理、数据科学家深度合作,打造车-云-算法闭环生态。
更新于 2023-06-28

实习金山世游
岗位职责: 1、使用 Go 语言构建产品化的云原生高性能数据平台,支持日均上亿条的流式数据处理任务; 2、使用 Python 开发工具链和自动化脚本,提升数据开发、分析和运营的工作效率; 3、应用 FinOps 方法论,基于数据分析,解决云服务的成本分析和优化问题,建立资源使用效能评估体系; 4、与数据分析师协同工作,为复杂的分析场景和模型,提供可以落地的解决方案,提高分析工作的效率; 5、在排除故障和数据修复等领域,提出有建设性的解决方案,构建数据质量监控体系。
更新于 2025-08-12