小米数据理解研发工程师实习生
实习兼职地点:武汉状态:招聘
任职要求
1.熟练掌握Java/python编程语言,熟悉Linux开发环境; 2.了解scala编程语言,熟悉spark/hadoop大数据并行计算框架; 3.扎实的编程能力,熟悉常用数据挖掘算法和数据结构; 4.对数据有一定的敏感度,能快速定位数据问题; 5.有强烈的上进心和自我驱动,学习适应能力强,乐观自信,能挑战自我不断追求卓越。
工作职责
1.负责小爱视频/音乐/电台/导航等垂域的数据理解工作,包括数据挖掘、清洗、审核、融合等; 2.负责相关业务的数据清洗融合流程的建设、维护、策略优化等; 3.和语义理解/基础数据爬取侧进行密切沟通,能够分析定位数据问题并提出合理解决方案; 4.负责各内容领域的数据问题分析和总结,并能提供建议和帮助改善数据处理流程和效果; 5.熟悉大模型优先,能利用大模型做数据挖掘。
包括英文材料
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Scala+
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
大数据+
https://www.youtube.com/watch?v=bAyrObl7TYE
https://www.youtube.com/watch?v=H4bf_uuMC-g
With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
相关职位
实习网易游戏(雷火)
你可以: - 创造和开发世界一流的游戏,倩女幽魂、逆水寒、天谕……下一款大作正等待着你; - 成为最具创造力团队的一员,和国内最顶级的游戏研发团队一起工作,解决各种挑战性问题; - 构建真实的虚拟世界,你的代码将为无数玩家带来梦幻般的体验; - 参与世界一流的游戏引擎技术研发,从客户端到服务端,跨越广阔的技术领域。
更新于 2025-07-11
实习网易游戏(雷火)
你可以: 1、成为最具创造力团队的一员,和国内外最顶级的游戏研发团队一起工作,解决各种挑战性问题; 2、构建真实的虚拟世界,与策划一起探讨游戏玩法并实现,你的代码将为无数玩家带来梦幻般的体验; 3、参与项目业务系统的服务端开发和维护; 4、参与项目工具的开发与维护。
更新于 2025-07-15
实习网易游戏(雷火)
你可以: 1、成为最具创造力团队的一员,和国内外最顶级的游戏研发团队一起工作,解决各种挑战性问题; 2、构建真实的虚拟世界,与策划一起探讨游戏玩法并实现,你的代码将为无数玩家带来梦幻般的体验; 3、参与项目业务系统的服务端开发和维护; 4、参与项目工具的开发与维护。
更新于 2025-07-25