滴滴地图事业部-大数据研发实习生
实习兼职技术类地点:北京状态:招聘
任职要求
1.教育背景:本科及以上学历,计算机、数学、统计学、交通工程或相关专业; 2.编程能力:熟悉Python,掌握基本的数据分析工具(如 Pandas、NumPy、Matplotlib 等); 3.数据基础:熟悉SQL,能够进行基础的数据查询和处理; 4.分析能力:具备一定的统计学基础,熟悉常用的数据分析方法; 5.学习能力:对交通安全、大数据分析方向有兴趣,愿意积极学习并快速上手; 6.协作能力:具备良好的沟通能力和团队协作能力,能够配合完成多团队间的协作任务; 7.时间要求:保证至少4个月以上的实习时间,每周至少4天工作时间。 加分项 1.有大数据处理经验(如 Hadoop、Spark)或熟悉相关工具者优先; 2.对交通安全、出行领域有基本的业务了解或研究经验; 3.具备机器学习基础知识或相关项目经验; 4.有技术博客、开源项目或竞赛经历者优先。
工作职责
岗位职责 1.协助进行交通安全相关的大数据分析,支持安全运营和策略优化; 2.基于交通大数据,研究和挖掘安全风险特征,分析用户行为并提供数据支持; 3.协助搭建交通安全数据监控和分析工具,支持业务团队实现风险预警和干预; 4.编写数据分析报告,展示分析结果并提出优化建议; 5.协助团队完成数据的清洗、整理、可视化等相关工作; 6.跟踪和学习与交通安全、数据分析相关的前沿技术,探索新的方法和工具。
包括英文材料
学历+
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
Pandas+
[英文] 10 minutes to pandas
https://pandas.pydata.org/docs/user_guide/10min.html
This is a short introduction to pandas, geared mainly for new users.
[英文] Cookbook - pandas
https://pandas.pydata.org/docs/user_guide/cookbook.html#cookbook
This is a repository for short and sweet examples and links for useful pandas recipes.
https://www.kaggle.com/learn/pandas
Solve short hands-on challenges to perfect your data manipulation skills.
https://www.youtube.com/watch?v=2uvysYbKdjM
I'm super excited for this one. We're doing another complete Python Pandas tutorial walkthrough.
https://www.youtube.com/watch?v=Mdq1WWSdUtw
Filtering, Joins, Indexing, Data Cleaning, Visualizations
NumPy+
https://numpy.org/doc/stable/user/absolute_beginners.html
NumPy (Numerical Python) is an open source Python library that’s widely used in science and engineering.
[英文] NumPy - Learn
https://numpy.org/learn/
Below is a curated collection of educational resources, both for self-learning and teaching others, developed by NumPy contributors and vetted by the community.
https://www.kaggle.com/code/themlphdstudent/learn-numpy-numpy-50-exercises-and-solution
This kernel uses exercises of NumPy from the Machine Learning Plus webpage
https://www.youtube.com/watch?v=KHoEbRH46Zk
If you've heard of Pandas and NumPy, you may think one is simply a superset of the other.
https://www.youtube.com/watch?v=QUT1VHiLmmI
Learn the basics of the NumPy library in this tutorial for beginners.
https://www.youtube.com/watch?v=VXU4LSAQDSc
This video serves as an introduction to the NumPy Python library.
Matplotlib+
https://matplotlib.org/stable/tutorials/index.html
This page contains a few tutorials for using Matplotlib.
https://www.youtube.com/watch?v=c9vhHUGdav0
This video serves as an introduction to the Matplotlib Python library.
https://www.youtube.com/watch?v=OZOOLe2imFo
In this video we do a complete Matplotlib crash course in Python.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
相关职位
实习技术类
参与滴滴国际化业务搜索引擎优化相关工作,聚焦出行场景,结合大模型的深度学习技术与地理信息领域知识,通过 Query深度解析与地理信息知识融合,探索大模型在地理信息检索领域文本理解应用。包括但不限于: 1,包括通过大模型的理解和泛化能力,实现query结构化理解、纠错改写、query意图分析等任务的统一。 2,搜索结果质量评价:利用大模型完成搜索结果质量打分; 3,相关性模型:对多种类别的召回结果进行统一的相关性计算; 4,探索大模型辅助语义召回、检索排序模型的优化迭代。
更新于 2025-07-23