
文远知行标注应用开发工程师
社招全职地点:广州状态:招聘
任职要求
本科及以上学位,计算机相关专业,有互联网公司工作经验 熟悉NodeJS/Python/C++/Go中的至少一种技术语言, 熟悉一种以上脚本语言,如Shell/Python/JS等 熟悉TCP/IP、HTTP协议、进程间通讯编程,熟悉 Linux,熟悉 Docker 熟练掌握数据库、缓存、消息队列的原理及应用 深刻理解计算机原理,有良好的数据结构和算法基础 学习能力强,对产品有强烈的责任心,具备良好的沟通能力和优秀的团队协作能力 有服务治理,稳定性提升,容量规划等高可用、高并发、分布式系统的实际经验为加分项 有 git 开源项目维护经验加分
工作职责
公司介绍: 文远知行(WeRide)成立于 2017 年,是全球领先的 L4 级自动驾驶科技公司,致力于“以无人驾驶改变人类出行”,已在全球超过 25 个城市开展自动驾驶研发、测试及运营,累积自动驾驶里程超1600万公里,应用场景覆盖智慧出行、智慧货运和智慧环卫,形成自动驾驶出租车、自动驾驶小巴、自动驾驶货运车、自动驾驶环卫车、高阶智能驾驶等五大产品矩阵,提供网约车、随需公交、同城货运、智能环卫、高阶智能驾驶解决方案等多种服务。 凭借“1个平台+3大场景+5大产品”的多元商业化战略,文远知行商业营收居同类自动驾驶企业之首,已与多家全球顶级主机厂和一级供应商达成战略合作伙伴关系,包括雷诺日产三菱联盟、宇通集团、博世、广汽集团等,不断为人类出行提供更多新选择。 文远知行目前的团队既有来自谷歌、微软、亚马逊、苹果、百度、滴滴的高阶工程师,也不乏刚从顶尖高校毕业的青年才俊。来到我们当中,你会发现这是一个专业、专注、有趣、有料的队伍。大家为了一个激动人心而富有挑战的目标走到一起,互相激励、脑力碰撞,为实现产品落地、创造社会价值、推进行业技术而努力。对有抱负的工程师,还有什么比这更有意义的呢?我们虚位以待,真诚期待技术过硬、志趣相投的小伙伴加入我们! 更多信息请访问:http://www.weride.ai,或关注官方微信号:文远知行WeRide Data团队介绍: “数据团队可以说是自动驾驶系统的血液,给自动驾驶各个模块提供养分,同时将它们紧密结合在一起。” 数据组的是打通自动驾驶数据闭环链路,包括数据采集、数据上传、数据平台搭建、数据索引、数据标注、模型训练等多个方面,打造一个完整的闭环,加快自动驾驶算法的迭代速度。同时,数据组也开发了多个数据分析和处理的工具链,全方位打造企业级的大数据平台。 岗位职责: 独立完成公司标注应用服务相关开发工作 确定接口协议,独立完成接口文档的编写并组织评审 深入解析代码, 提升代码执行效率, 加强代码兼容性 深入了解产品经理业务需求,能把业务需求转化为可落地研发技术方案
包括英文材料
学历+
JavaScript+
https://developer.mozilla.org/zh-CN/docs/Learn_web_development/Core/Scripting
[英文] Learn JavaScript
https://learnjavascript.online/
The easiest way to learn & practice modern JavaScript
[英文] Learn JavaScript
https://web.dev/learn/javascript
https://www.youtube.com/watch?v=zuKbR4Q428o
Write bulletproof JavaScript code with unit testing!
Node.js+
https://liaoxuefeng.com/books/javascript/nodejs/index.html
从本章开始,我们就正式开启JavaScript的后端开发之旅。
https://www.youtube.com/watch?v=32M1al-Y6Ag
This is an intro to Node.js. No frameworks or libraries.
https://www.youtube.com/watch?v=zb3Qk8SG5Ms&list=PL4cUxeGkcC9jsz4LDYc6kv3ymONOKxwBU
In this Node JS tutorial I'll introduce to what exactly Node is all about, why we'd use it and the technologies you'll need to be familiar with to get started.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
脚本+
[英文] Scripting language
https://en.wikipedia.org/wiki/Scripting_language
https://zhuanlan.zhihu.com/p/571097954
一个脚本通常是解释执行而非编译。脚本语言通常都有简单、易学、易用的特性,目的就是希望能让程序员快速完成程序的编写工作。
Bash+
[英文] The Bash Guide
https://guide.bash.academy/
A quality-driven guide through the shell's many features.
https://www.youtube.com/watch?v=tK9Oc6AEnR4
Understanding how to use bash scripting will enhance your productivity by automating tasks, streamlining processes, and making your workflow more efficient.
TCP/IP+
[英文] What is TCP/IP?
https://www.techtarget.com/searchnetworking/definition/TCP-IP
TCP/IP stands for Transmission Control Protocol/Internet Protocol and is a suite of communication protocols used to interconnect network devices on the internet.
HTTP+
https://developer.mozilla.org/zh-CN/docs/Web/HTTP
超文本传输协议(HTTP)是一个用于传输超媒体文档(例如 HTML)的应用层协议。它是为 Web 浏览器与 Web 服务器之间的通信而设计的,但也可以用于其他目的。
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
Docker+
https://www.youtube.com/watch?v=GFgJkfScVNU
Master Docker in one course; learn about images and containers on Docker Hub, running multiple containers with Docker Compose, automating workflows with Docker Compose Watch, and much more. 🐳
https://www.youtube.com/watch?v=kTp5xUtcalw
Learn how to use Docker and Kubernetes in this complete hand-on course for beginners.
缓存+
https://hackernoon.com/the-system-design-cheat-sheet-cache
The cache is a layer that stores a subset of data, typically the most frequently accessed or essential information, in a location quicker to access than its primary storage location.
https://www.youtube.com/watch?v=bP4BeUjNkXc
Caching strategies, Distributed Caching, Eviction Policies, Write-Through Cache and Least Recently Used (LRU) cache are all important terms when it comes to designing an efficient system with a caching layer.
https://www.youtube.com/watch?v=dGAgxozNWFE
消息队列+
https://www.youtube.com/watch?v=xErwDaOc-Gs
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
服务治理+
https://cloudnativecn.com/blog/istio-traffic-management-series-service-management-concept-theory/
通过阅读本文读者可以初步理解 Istio 流量治理的概念和相关知识框架。
https://juejin.cn/post/6844904006033080334
服务治理主要包括服务发现、负载均衡、限流、熔断、超时、重试、服务追踪等。我们今天要讲的,就是服务发现的内容。
高可用+
https://redis.io/blog/high-availability-architecture/
A high available architecture is when there are a number of different components, modules, or services that work together to maintain optimal performance, irrespective of peak-time loads.
https://www.ibm.com/think/topics/high-availability
High availability (HA) is a term that refers to a system’s ability to be accessible and reliable close to 100% of the time.
高并发+
https://www.baeldung.com/concurrency-principles-patterns
In this tutorial, we’ll discuss some of the design principles and patterns that have been established over time to build highly concurrent applications.
https://www.baeldung.com/java-concurrency
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
https://www.oreilly.com/library/view/concurrency-in-go/9781491941294/
You’ll understand how Go chooses to model concurrency, what issues arise from this model, and how you can compose primitives within this model to solve problems.
https://www.oreilly.com/library/view/modern-concurrency-in/9781098165406/
With this book, you'll explore the transformative world of Java 21's key feature: virtual threads.
https://www.youtube.com/watch?v=qyM8Pi1KiiM
https://www.youtube.com/watch?v=wEsPL50Uiyo
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
Git+
https://www.youtube.com/watch?v=rH3zE7VlIMs
Learn Git from start to finished in this full course written by ThePrimeagen.
相关职位
实习淘天集团日常实习
1. 参与企业办公场景智能体架构的探索,协助实现AI Agent系统的功能模块开发; 2. 在导师指导下学习RAG、知识图谱等技术,参与基础推理框架的搭建与测试; 3. 接触大模型基础调优技术,学习通过数据标注与效果评估提升系统性能; 4. 配合业务团队完成需求分析,参与具体业务场景的功能实现与效果验证; 5. 协助优化智能体交互体验,参与系统基础能力的评估测试; 6. 学习Agent领域前沿技术(如ReAct/COT等),参与技术调研与demo实现。
更新于 2025-05-06
校招钉钉2026届秋
1. 构建会议AI算法工程体系(评测系统、监控系统、标注系统、AI Agent架构) 2. 会议AI基础模型的接入和优化(模型调研、训练和推理,算法工程建设) 3. AI产品的功能完善,智能化提升(AI听记、会议AI助理等线上产品的开发) 4. AI解决方案的设计和应用落地(找到具有客户价值的落地场景,真正为客户解决问题)
更新于 2025-08-27

社招1年以上研发
1、负责AI数据工程系统的建设,包括数据采集、清洗、合成、标注等环节; 2、开发检索增强生成(RAG)系统,结合向量检索与全文检索能力,提升LLM在行业场景中的回答准确性; 3、设计并实现AI Agent系统,包括任务规划、工具调用、决策执行等核心能力,赋能业务场景的智能化应用; 4、优化检索算法,设计混合检索策略(关键词+语义+...),解决长尾查询、模糊意图匹配等挑战;参与检索增强、Tool Use、Agent等方向的研发工作; 5、完成典型RAG场景下的链路构建与部署优化,包括通过内容抽取、文本切片进行知识库构建,优化检索、重排序、生成模型的部署方式,提升推理效率。
更新于 2025-10-11