logo of tesla

特斯拉(高级/资深)可靠性工程师 (Senior/Staff) Site Reliability Engineer, Fleetnet

社招全职软件平台地点:上海状态:招聘

任职要求


Must
5+ years building and maintaining SaaS infrastructure with a healthy mix of….
Expert skills with Linux, networking, storage and virtualization automation with tools like Kubernetes, Terraform, Ansible, Chef et aliq.
Setting up and supporting CI/CD.
Proficiency in a high-level language like Python, Go, Ruby and/or Java.
Scaling through data-driven capacity planning, within both physical data centers and Cloud infrastructure (AWS, GCP or Azure) nice to have.
Troubleshooting and full-cycle incident response (mitigation, correction, prevention).
Strong belief in spreading (& acquiring) knowl…
登录查看完整任职要求
微信扫码,1秒登录

工作职责


THE ROLE
We're the small, expert team creating the next-generation server-side infrastructure to support the manufacturing and functionality of fleets of Tesla products, and we're looking for seasoned SREs with domain expertise in one or more of: containers, public clouds and cloud-native apps.
Today, Tesla owners rely on our services to safely and securely summon their cars with a tap on their mobile phones -- a feature enabled by one of the many over-the-air updates we've delivered to the Tesla vehicle fleet. Tesla engineering relies on our data and analytics platform to make Tesla products better and safer. And, when an owner needs assistance, Tesla service and support rely our applications to understand and respond to the situation. Tomorrow, we will apply fleet learning to dispatch and deliver real-time road conditions to millions of autonomous vehicles and manage distributed energy generation & storage at grid scale.
Join us and you will work alongside world-class software and data engineers on some of the newest and most challenging IoT, manufacturing and service engineering problems in the world today. The platform you help us build and automate will be used daily by millions of Tesla owners (and tens of thousands of Tesla employees) to improve and enhance the functionality of our cars, chargers, and batteries worldwide.

RESPONSIBILITIES
Design and write software that enables rapid prototyping by development teams, while ensuring the highest levels of reliability and availability.
Work directly with our factory firmware team to provide highly available factory-facing services.
Drive the migration of large-scale, distributed fleet applications towards cloud-native microservices.
Influence architectural decisions with focus on security, scalability and high-performance.
Automate the build and deployment of infrastructure using Docker, Kubernetes & other orchestration technologies in a hybrid-cloud environment.
Setup and maintain monitoring, metrics & reporting systems for fine-grained observability and actionable alerting.
包括英文材料
SaaS+
Linux+
Kubernetes+
Terraform+
Ansible+
CI+
还有更多 •••
相关职位

logo of xiaohongshu
社招3年以上机器学习平台

【业务介绍】 我们是小红书内稠密类模型(LLM/MLLM/SD/CV/NLP)统一的AI平台QuickSilver,负责调度公司内所有稠密类模型训练与推理资源,基于自建的训推引擎,为公司所有AI算法同学迭代业务模型提供端到端一站式AI服务;包括数据管理,模型管理,模型训练、压缩、推理、部署,服务管理,资源调度等一系列能力。 工作职责: 1、负责稠密类模型训练推理开发平台的架构设计和核心功能研发 2、设计和实现大模型训练部署流程,包括模型fine-tuning、推理服务化等 3、构建云原生架构,设计高可用、高性能的微服务体系 4、优化平台性能,提升系统稳定性和可扩展性

北京|上海|深圳
logo of xiaohongshu
社招3年以上机器学习平台

1、负责模型训练平台核心功能开发和架构设计,包括传统CN/NLP/SD/LLM等多场景支持 2、负责大模型后训练工具平台化建设,包括后预训练、微调、对齐等技术落地 3、设计和实现高性能分布式训练系统,打造端到端训练解决方案 4、优化训练调度和资源管理,提升集群利用率和训练效率 5、开发模型训练监控诊断工具,建设可观测性体系

北京|上海|深圳
logo of didi
社招5年以上技术

关于我们: 滴滴国际化Fintech业务,是滴滴国际化战略的重要组成板块。近年来,滴滴Fintech在拉美地区积极探索和开展电子支付、信贷、信用卡、商户收单等业务,为当地用户带来更便捷、优质、更高性价比的金融服务。我们诚挚邀请真诚、可靠、勇于挑战的您和我们一起,携手并肩,拥抱金融出海的浪潮,和滴滴Fintech一起快速成长。 职位描述: 1、参与并完成风控平台基建研发,包括决策引擎、特征平台、核身、模型、名单、图数据库、监控平台、Databus等多个方向 2、建设提效工具,提升风控研发流程的效率。 3、积极跟其他团队沟通和配合,推动项目进展,讨论并提出有建设性的意见。

更新于 2025-09-30北京
logo of didi
社招5年以上技术

滴滴国际化Fintech业务,是滴滴国际化战略的重要组成板块。其支付业务,已经覆盖了全球十多个国家,在中国互联网公司出海中出类拔萃。 自2021年开始,滴滴Fintech在拉美地区大力发展电子支付和信贷业务。短短2年时间,其个人信贷业务,已经在墨西哥的Fintech玩家中位于第一梯队;其电子钱包业务也在巴西的Fintech玩家中也名列前茅,实现快速增长。此外,滴滴Fintech还在拉美地区积极探索和开展信用卡、商户收单等业务,为当地用户带来更便捷、优质、更高性价比的金融服务,实现多个从0到1的突破。 我们诚挚邀请真诚、 可靠、勇于挑战的您和我们一起,携手并肩,拥抱金融出海的浪潮。和滴滴Fintech一起,实现从0到1,从1到100的快速成长 职位描述: 1、参与并完成风控平台基建研发,包括决策引擎、特征平台、核身、模型、名单、图数据库、监控平台、Databus等多个方向 2、建设提效工具,提升风控研发流程的效率。 3、积极跟其他团队沟通和配合,推动项目进展,讨论并提出有建设性的意见。

更新于 2025-03-19上海