百度自动驾驶资深数据开发工程师(J94281)
社招全职5年以上IDG地点:北京状态:招聘
任职要求
-计算机相关专业,本科及以上学历,5年及以上工作经验,有数据开发技术leader经验优先 -熟练掌握Python、C++、Shell、Go等至少一种编程语言,熟悉主流Python服务框架或GO服务框架 -熟练运用容器化技术(如Docker、Kubernetes),对云计算平台的资源管理与调配有实际操作经验 -熟练Mysql、MongoDB、Redis、Doris、Clickhouse等数据库相关知识及使用场景,了解消息队列的原理和使用 -对大数据系统存储技术有一定了解,具备Elasticsearch、分布式文件存储(如HDFS/NFS)和对象存储(如S3/MinIO)使用开发实践经验 -熟悉自动驾驶数据闭环、模型训练、模型评测流程者优先
工作职责
-负责优化设计自动驾驶数据流水线,构建高可用、易扩展、低延迟的服务架构 -负责自动驾驶模型迭代相关的数据仓库、数据处理等方向的技术规划与开发工作 -负责设计开发用户端SDK、API支持自动驾驶数据高效、稳定、高并发低时延地读写 -负责设计及实现合理的数据生命周期管理策略,保证满足业务数据需求和存储成本控制需求
包括英文材料
学历+
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Docker+
https://www.youtube.com/watch?v=GFgJkfScVNU
Master Docker in one course; learn about images and containers on Docker Hub, running multiple containers with Docker Compose, automating workflows with Docker Compose Watch, and much more. 🐳
https://www.youtube.com/watch?v=kTp5xUtcalw
Learn how to use Docker and Kubernetes in this complete hand-on course for beginners.
Kubernetes+
https://kubernetes.io/docs/tutorials/kubernetes-basics/
This tutorial provides a walkthrough of the basics of the Kubernetes cluster orchestration system.
https://kubernetes.io/zh-cn/docs/tutorials/kubernetes-basics/
本教程介绍 Kubernetes 集群编排系统的基础知识。每个模块包含关于 Kubernetes 主要特性和概念的一些背景信息,还包括一个在线教程供你学习。
https://www.youtube.com/watch?v=s_o8dwzRlu4
Hands-On Kubernetes Tutorial | Learn Kubernetes in 1 Hour - Kubernetes Course for Beginners
https://www.youtube.com/watch?v=X48VuDVv0do
Full Kubernetes Tutorial | Kubernetes Course | Hands-on course with a lot of demos
MySQL+
https://juejin.cn/post/7190306988939542585
这是一篇 MySQL 通关一篇过硬核经验学习路线,包括数据库相关知识,SQL语句的使用,数据库约束,设计等。
[英文] MySQL Tutorial
https://www.mysqltutorial.org/
your go-to resource for mastering MySQL in a fast, easy, and enjoyable way.
https://www.youtube.com/watch?v=5OdVJbNCSso
MySQL SQL tutorial for beginners
https://www.youtube.com/watch?v=7S_tz1z_5bA
This beginner-friendly course teaches you SQL from scratch.
MongoDB+
https://learnxinyminutes.com/mongodb/
MongoDB is a NoSQL document database for high volume data storage.
https://studio3t.com/academy/#courses
The fastest way to learn MongoDB
https://www.youtube.com/watch?v=c2M-rlkkT5o
This video will give you and introduction to MongoDB in 1 Hour. Afterwards I recommend exploring aggregation, replication, and sharding.
https://www.youtube.com/watch?v=ExcRbA7fy_A&list=PL4cUxeGkcC9h77dJ-QJlwGlZlTd4ecZOA
You'll learn how to use MongoDB (a NoSQL database) from scratch. You'll also learn how to integrate it into a simple Node.js API.
Redis+
[英文] Developer Hub
https://redis.io/dev/
Get all the tutorials, learning paths, and more you need to start building—fast.
https://www.runoob.com/redis/redis-tutorial.html
REmote DIctionary Server(Redis) 是一个由 Salvatore Sanfilippo 写的 key-value 存储系统,是跨平台的非关系型数据库。
https://www.youtube.com/watch?v=jgpVdJB2sKQ
In this video I will be covering Redis in depth from how to install it, what commands you can use, all the way to how to use it in a real world project.
Doris+
https://doris.apache.org/docs/gettingStarted/what-is-apache-doris
ClickHouse+
[英文] Advanced Tutorial
https://clickhouse.com/docs/tutorial
Learn how to ingest and query data in ClickHouse using the New York City taxi example dataset.
https://www.youtube.com/watch?v=FtoWGT7kS-c
ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.
https://www.youtube.com/watch?v=Rhe-kUyrFUE&list=PL0Z2YDlm0b3gcY5R_MUo4fT5bPqUQ66ep
消息队列+
https://www.youtube.com/watch?v=xErwDaOc-Gs
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
HDFS+
https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.
https://www.ibm.com/cn-zh/think/topics/hdfs
Hadoop 分布式文件系统 (HDFS) 是一种管理大型数据集的文件系统,可在商用硬件上运行。
S3+
https://aws.amazon.com/s3/getting-started/
You can use Amazon S3 to store and retrieve any amount of data at any time, from anywhere.
https://www.youtube.com/watch?v=tfU0JEZjcsg
Amazon S3 is the oldest and one of the most popular services on AWS.
自动驾驶+
https://www.youtube.com/watch?v=_q4WUxgwDeg&list=PL05umP7R6ij321zzKXK6XCQXAaaYjQbzr
Lecture: Self-Driving Cars (Prof. Andreas Geiger, University of Tübingen)
https://www.youtube.com/watch?v=NkI9ia2cLhc&list=PLB0Tybl0UNfYoJE7ZwsBQoDIG4YN9ptyY
You will learn to make a self-driving car simulation by implementing every component one by one. I will teach you how to implement the car driving mechanics, how to define the environment, how to simulate some sensors, how to detect collisions and how to make the car control itself using a neural network.
相关职位
社招5年以上
1.深挖数据价值,构建和维护车端信号数据仓库体系和数据指标体系,为算法和数据闭环提供框架支持; 2.参与构建批流统一的数据分析平台,支持百亿级自动驾驶感知和全栈数据的快速定位和分析; 3.参与平台架构规划,负责前沿技术的跟踪研究,工具链的选型测试,解决、攻克数据平台的核心技术难题; 4.建立监控和反馈指标,持续优化改进产品的架构及性能,保证PB级数仓的数据质量和平台稳定性。
更新于 2025-05-14

社招5年以上技术
工作职责: 1、负责自动驾驶数据平台的核心架构设计、开发与运维,主导数据接入、解析、治理、挖掘及数据集管理等关键工具链的技术方案与实施。 2、设计并开发高可靠性、高效率的数据加工与自动化Pipeline,服务于模型训练与迭代。 3、保障平台在大规模数据与高并发场景下的稳定性、性能与成本优化。 4、深入理解自动驾驶多模态数据(传感器数据、真值系统、高精地图),主导数据标签体系设计与深度解析,为算法迭代提供核心数据支持。 5、作为技术核心,跨部门协同算法、仿真、测试团队,驱动数据闭环的落地与自动驾驶模型的高效迭代
更新于 2025-09-08
社招5年以上IDG
-负责自动驾驶出行业务-订单引擎或萝卜安全堡垒系统设计和开发工作 -负责实现自动驾驶全无人模式运营落地,保障大规模全无人、自动化24h运营效率和安全 -通过业务数据分析挖掘潜在线上问题和可优化点,推动落地,助力业务效果提升 -通过稳定性能力建设,保障所负责方向架构合理性,打造高效稳定、可扩展性强的系统支撑能力 -根据需求文档进行相关产品的开发,撰写开发文档,保质保量按时完成开发任务 -参与国际化业务架构设计及开发工作 -负责架构稳定性、性能优化、扩展性技术研发 -攻克高性能、高并发、高可用性等各种不同技术场景下的技术挑战,持续进行架构打磨优化
更新于 2025-06-12