快手资深游戏AI Infra工程师-【游戏事业部】
社招全职5年以上D8039地点:杭州状态:招聘
任职要求
1.5年以上相关开发经验,熟悉Go/C++/Python/Java等至少两门语言; 2.有TensorFlow、Pytorch使用或者优化经验,有机器学习平台开发经验优先; 3.了解GPU、CUDA、神经网络、RAG等技术,有大模型工程优化经验优先; 4.对AI底层硬软件基础设施,AI工程化和大模型推理优化感兴趣; 5. 了解transformer的结构,了解模型训练、微调和推理的过程和原理; 6. 有以下一项或多项技术能力者优先: 1) 有GPU、NPU、ARM、或其他AI芯片等平台上AI落地实践经验; 2)熟悉并行计算和分布式计算技术,有 MPI、OpenMP、CUDA 等编程经验,或有PTX、汇编优化方法; 3) 熟悉模型训练框架如Deepspeed、Megatron,pytorch等; 4) 熟悉TensorRT-LLM、vLLM、sglang等任一推理框架,有实际操作经验; 5) 熟悉多层次的软件优化经验,如模型剪枝/量化/压缩、分布式优化(PD分离)、存储优化、kv-cache优化、动态Batching、IO优化、算子/指令集优化等。
工作职责
1.负责AI平台架构设计和AI工程化技术实现; 2.通过AI基础设施和软硬件协同优化来提升公司AI模型训练和推理的效率; 3. 负责云侧或端侧大模型和小模型推理服务开发、性能优化、上线部署等工作。
包括英文材料
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
Megatron+
https://www.youtube.com/watch?v=hc0u4avAkuM
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
vLLM+
https://www.newline.co/@zaoyang/ultimate-guide-to-vllm--aad8b65d
vLLM is a framework designed to make large language models faster, more efficient, and better suited for production environments.
https://www.youtube.com/watch?v=Ju2FrqIrdx0
vLLM is a cutting-edge serving engine designed for large language models (LLMs), offering unparalleled performance and efficiency for AI-driven applications.
缓存+
https://hackernoon.com/the-system-design-cheat-sheet-cache
The cache is a layer that stores a subset of data, typically the most frequently accessed or essential information, in a location quicker to access than its primary storage location.
https://www.youtube.com/watch?v=bP4BeUjNkXc
Caching strategies, Distributed Caching, Eviction Policies, Write-Through Cache and Least Recently Used (LRU) cache are all important terms when it comes to designing an efficient system with a caching layer.
https://www.youtube.com/watch?v=dGAgxozNWFE
相关职位
社招网易游戏(互娱)
- 参与强化学习、模仿学习、进化算法的落地工作,包括但不限于智能体、平衡性测试等; - 基于强化学习、模仿学习等AI技术为游戏产品打造更强力、更多样、更拟人的AI机器人; - 参与开发强化学习训练和部署平台。
更新于 2025-06-05
社招3-5年网易游戏(互娱)
- 参与强化学习、模仿学习、进化算法的落地工作,包括但不限于智能体、平衡性测试等; - 基于强化学习、模仿学习等AI技术为游戏产品打造更强力、更多样、更拟人的AI机器人; - 参与开发强化学习训练和部署平台
更新于 2025-08-24
社招网易伏羲
1. 参与网易伏羲游戏AI产品的质量保障,包括AIGC、自然语言、用户画像、强化学习等算法测试和性能测试; 2.对产品需求进行分析、评审,提前规避有问题需求,设计测试点、测试用例;积极发现和推动解决项目中的进度、流程、质量风险,并能对项目提出合理建议; 3. 挖掘研发和测试过程中的痛点,通过开发测试平台、工具的等方式来提升研发的质量和效率; 4. 对前沿测试技术有较高的敏感度,思考AI在测试领域的应用,能够引入并应用到工作中。
更新于 2025-04-03
社招3年以上网易游戏(雷火)
我们正在寻找一位资深的游戏AI与动作系统开发工程师,加入我们的游戏开发团队。您将负责塑造游戏中角色的刺激与反应行为,以及动作系统的设计和实现。您的工作将直接影响游戏的动态表现和玩家的沉浸感,为玩家创造一个充满活力和挑战性的游戏体验。 主要职责: 1)设计和实现游戏中角色的刺激与反应系统,确保AI角色的行为具有多样性和不可预测性。 2)开发和优化游戏动作系统,包括角色动画、运动控制和物理交互。 3)与游戏设计师紧密合作,将创意转化为可实现的AI行为和游戏机制。 4)优化AI和动作系统的性能,确保游戏在不同平台上的流畅运行。 5)跟踪最新的游戏AI技术和动作系统开发趋势,将其应用于项目中。
更新于 2025-06-03