京东国际产研 高可用/AI infra研发岗
社招全职3年以上软件开发岗地点:北京状态:招聘
任职要求
1.具有 3 年以上 AI 基础架构、分布式系统、高性能计算(HPC)或大型云平台开发经验; 2.精通 Python,具备扎实的数据结构与算法功底,编码风格严谨; 3.加分项:深入理解 PyTorch/Megatron-LM/DeepSpeed 的底层实现源码及运行机制; 4.加分项:精通 GPU/NPU …
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1.参与京东跨境电商AI Infra技术架构体系建设,制定和推进架构规范的落实; 2.负责复杂技术项目的顶层方案设计,完成关键技术问题判断和事情的拆解; 3.调度系统优化:参与智算操作系统研发,优化 Kubernetes 或 Ray 的 GPU 资源调度能力; 4.高可用性保障:解决GPU集群的故障恢复(Fault Tolerance)与弹性容错(弹性 Checkpoint); 5.效能与可观测性:构建集群效能评估模型,精准度量算力资源利用率(MFU/HFU); 6.跟踪行业趋势和技术前沿,根据业务实际需求,为团队引入新技术和新方案;
包括英文材料
分布式系统+
https://www.distributedsystemscourse.com/
The home page of a free online class in distributed systems.
https://www.youtube.com/watch?v=7VbL89mKK3M&list=PLOE1GTZ5ouRPbpTnrZ3Wqjamfwn_Q5Y9A
HPC+
https://www.ibm.com/think/topics/hpc
HPC is a technology that uses clusters of powerful processors that work in parallel to process massive, multidimensional data sets and solve complex problems at extremely high speeds.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
还有更多 •••