百度算法实习生(J80417)
实习兼职ACG地点:上海状态:招聘
任职要求
-研究生在读,计算机、数学、数据挖掘、统计学相关专业,可实习6个月以上 -熟练应用python、shell、mysql; 有深度学习框架(PyTorch/TensorFlow/PaddlePaddle等)使用经验 -至少熟悉一种大数据处理技术,如MapReduce、Hive、Spark等 -对数据敏感,能从数据中发现问题、解决问题 -了解常用机器学习算法及应用场景,有半监督学习或多目标学习经验更佳 -有良好的逻辑思维能力、沟通协调能力
工作职责
-深入理解用户行为,完成数据挖掘需求,包括但不限于海量数据分析、特征挖掘等工作 -优化现有线上算法,如维护和升级已有的行为预测模型 -维护数据平台数据更新、数据监控等 -其他数据、算法相关工作
包括英文材料
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PaddlePaddle+
https://learnopencv.com/paddlepaddle/
PaddlePaddle (PArallel Distributed Deep LEarning) is an open-source deep learning framework released by Baidu in 2016.
https://www.paddlepaddle.org.cn/tutorials
本课程采用飞桨特色的「横纵式」 教学法,从易到难,学习难度逐层递进,并结合图形和案例进行讲解,力求让刚接触深度学习的读者可以快速理解。
MapReduce+
https://www.youtube.com/watch?v=bcjSe0xCHbE
https://www.youtube.com/watch?v=cHGaQz0E7AU
In this video I explain the basics of Map Reduce model, an important concept for any software engineer to be aware of.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位
实习A13881A
ByteIntern:面向2026届毕业生(2025年9月-2026年8月期间毕业),为符合岗位要求的同学提供转正机会。 1、对生物领域的现有模型进行开发和改进,包括但不限于: 1)蛋白质结构、功能、相互作用预测; 2)蛋白质-配体相互作用预测; 3)小分子性质建模等; 2、协助团队基于模型开发完成公开生物数据的收集、清理、分析和录入; 3、构建符合业务需求的数据集合并维护。
更新于 2025-04-15
实习
1. 针对智能驾驶的各个场景,协助进行VLM的prompt优化 2. 对于VLM表现不佳的场景,能够参与到实际项目中,配合并协助做一些数据收集和处理,以及VLM模型微调方面的工作 3. 对前沿技术感兴趣,能够阅读相关领域文献,保持对前沿技术的敏感性
更新于 2025-06-06
实习
1. 辅助完成AI在电动汽车智造领域的关键技术预研; 2. 参与汽车工厂侧算法项目预研,包括但不局限于基于图像的异常检测等项目,不断完善和优化算法,以技术驱动公司业务成长,体现技术价值;
更新于 2025-04-08