百度java/python 研发工程师实习生(J85515)
实习兼职MEG地点:北京状态:招聘
任职要求
-熟悉C++/Java/Php/Python/React的一种或多种 -熟悉常用的数据结构和算法 -了解流式计算等相关技术,有Spark/Hadoop/flink/Clickhouse等大数据处理经验优先经验者优先 -具备优秀的逻辑思维能力,对解决挑战性问题充满热情,善于解决问题和分析问题 -有强烈的上进心和求知欲,善于学习新事物,渴望用技术改变未来 -良好的团队合作精神,较强的沟通能力和学习能力
工作职责
-负责构建大数据分析平台以及数据分析和挖掘工作 -参与大数据处理引擎的架构设计和优化,支撑PB级数据的实时流式计算、秒级OLAP、Spark查询引擎系统的研发 -参与海量数据的建模、存储、查询和分析体系搭建 -对现有系统的不足进行分析,找到目前系统的瓶颈,提高系统性能
包括英文材料
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
React+
[英文] Quick Start - React
https://react.dev/learn
This page will give you an introduction to 80% of the React concepts that you will use on a daily basis.
https://www.youtube.com/watch?v=SqcY0GlETPk
Master React 18 with TypeScript! ⚛️ Build amazing front-end apps with this beginner-friendly tutorial.
https://www.youtube.com/watch?v=x4rFhThSX04
Learn modern React basics in the most interactive, hands-on way possible in the full course for beginners.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
相关职位
实习
1.负责小爱视频/音乐/电台/导航等垂域的数据理解工作,包括数据挖掘、清洗、审核、融合等; 2.负责相关业务的数据清洗融合流程的建设、维护、策略优化等; 3.和语义理解/基础数据爬取侧进行密切沟通,能够分析定位数据问题并提出合理解决方案; 4.负责各内容领域的数据问题分析和总结,并能提供建议和帮助改善数据处理流程和效果; 5.熟悉大模型优先,能利用大模型做数据挖掘。
更新于 2025-08-04
实习D11434
1、参与快手大数据体系的设计与建设,通过数据仓库、元数据、数据管理等体系,管理和建设几千P的数据; 2、利用Hive,Spark等组件处理百亿级数据,通过对数据的建设和应用理解,支持商业化业务的数据需求; 3、基于onedata的建模思路进行商业化数仓的建模实践; 4、 利用OLAP技术建设秒级别快速查询和分析平台,来支持商业化客户的数据洞察需求。
更新于 2024-10-18
实习
1.负责相关业务服务器端的研发工作,包括需求沟通、功能设计与开发等; 2.负责相关业务服务器相关的高并发架构设计、线上维护、性能调优等; 3.和产品/测试/运营进行密切沟通,能够根据需求提出合理技术方案; 4.负责软件开发过程中的问题分析和总结,提供建议和帮助改善研发流程。
更新于 2025-08-21