顺丰大数据挖掘与分析工程师实习生
社招全职1年以下地点:深圳状态:招聘
任职要求
1.统计学/数学/计算机相关专业,本科及以上学历,有数据挖掘或数据分析项目或工作经验,有NLP经验优先; 2.熟练使用SQL/Python/等看开发语言,了解并掌握Hadoop/Hive/Spark等大数据相关技术技术,熟悉Tensorflow/Pytorch等框架优先; 3.熟悉数据分析,数据挖掘,机器学习等相关技术,能熟练使用聚类、回归、分类等算法及调优,可针对具体业务有效建模并应用实践; 4.具备良好的逻辑思维能力和数据敏感度,能够从海量数据中发现有价值的规律,并结合业务发掘数据价值;
工作职责
1.参与大规模企业数据的清洗、分析和挖掘,保证基础数据质量;结合内外部数据,构建企业画像,打造企业级数据底盘能力; 2.支持业务数据的定制化开发和挖掘,针对不同业务如:销售支持,客户保障,数据应用等,将业务需求抽象成数据挖掘逻辑; 3.参与搭建企业级数据服务能力,利用文本挖掘、自然语言处理、机器学习等方法,建模分析和解决企业在供应链、拓客、经营效率及经营风险上面临的实际问题,助力企业客户的经营发展;
包括英文材料
学历+
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
数据分析+
[英文] Data Analyst Roadmap
https://roadmap.sh/data-analyst
Step by step guide to becoming an Data Analyst in 2025
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位
社招1年以下
1.参与大规模企业数据的清洗、分析和挖掘,保证基础数据质量;结合内外部数据,构建企业画像,打造企业级数据底盘能力; 2.支持业务数据的定制化开发和挖掘,针对不同业务如:销售支持,客户保障,数据应用等,将业务需求抽象成数据挖掘逻辑; 3.参与搭建企业级数据服务能力,利用文本挖掘、自然语言处理、机器学习等方法,建模分析和解决企业在供应链、拓客、经营效率及经营风险上面临的实际问题,助力企业客户的经营发展;
更新于 2024-07-23
校招研发类
1、深入理解业务、产品的方向和需求,构建公司数据分析与数据挖掘体系,针对复杂的业务问题,规划、设计、实现基于数据挖掘的解决方案,充分实现数据的价值; 2、分析和研究数据与实际业务的关联关系,利用数据挖掘的先进技术,针对具体业务需求场景,进行建模分析; 3、基于海量用户行为数据和其他数据,开发设计面向常规算法不能解决问题的可扩展机器学习算法,并以实际业务应用为导向研发创新方法,产生创新应用; 4、为产品运营提供数据分析支持,包括网站数据分析、产品用户分析、行业分析等。
更新于 2025-08-18
实习研发类
1、深入理解业务、产品的方向和需求,构建公司数据分析与数据挖掘体系,针对复杂的业务问题,规划、设计、实现基于数据挖掘的解决方案,充分实现数据的价值; 2、分析和研究数据与实际业务的关联关系,利用数据挖掘的先进技术,针对具体业务需求场景,进行建模分析; 3、基于海量用户行为数据和其他数据,开发设计面向常规算法不能解决问题的可扩展机器学习算法,并以实际业务应用为导向研发创新方法,产生创新应用; 4、为产品运营提供数据分析支持,包括网站数据分析、产品用户分析、行业分析等。
更新于 2025-03-28