
文远知行Senior Analytics Infra Engineer
社招全职3年以上地点:广州状态:招聘
任职要求
Job Description
1.文远知行拥有大量的无人驾驶测试数据,作为一名数据工程师,你将构建关键数据集与数据访问工具,与我们的伙伴团队们一起,从道路测试数据中洞悉算法性能,用数据驱动决策,加速无人驾驶技术的进步;
2.文远知行致力于实践敏捷开发的团队协作与结果导向的效能评价,推崇以创造性方式解决问题,拥抱变化,持续的学习与提高。
Individual Responsibility
1.设计,开发和发布高效和稳健的Data Pipeline,为我们的伙伴团队提供直观的分析数据;
2.开发,维护基于网页的数据产品,使海量的无人驾驶测试数据更易检索与访问;
3.协助其他工程师与数据科学家获得所需数据和统计结果;
4.诊断与解决现有Pipeline中的问题,并设计开发其继任者;
5.团队协作,代码…登录查看完整任职要求
微信扫码,1秒登录
工作职责
无
包括英文材料
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
SQL+
https://liaoxuefeng.com/books/sql/introduction/index.html
什么是SQL?简单地说,SQL就是访问和处理关系数据库的计算机标准语言。
https://sqlbolt.com/
Learn SQL with simple, interactive exercises.
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Redshift+
https://aws.amazon.com/awstv/watch/d67f8f62aca/
This video demonstrates how to quickly set up an Amazon Redshift data warehouse and analyze data using Query Editor v2.
https://aws.amazon.com/redshift/getting-started/
Find resources to get started with Amazon Redshift, a cloud data warehouse.
Hive+
[英文] Hive Tutorial
https://www.tutorialspoint.com/hive/index.htm
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
https://www.youtube.com/watch?v=D4HqQ8-Ja9Y
Presto+
[英文] What is Presto?
https://prestodb.io/what-is-presto/
https://www.tutorialspoint.com/apache_presto/index.htm
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
还有更多 •••
相关职位
错误:unaccepted status code found: 408 expected: [200], MeilisearchApiError Message: empty meilisearch message (path "POST /indexes/jd_idx/search" with method "Search")