虎牙AI Infra 工程师(推理集群方向)
社招全职MJ004366地点:广州状态:招聘
任职要求
* 计算机相关专业,本科及以上学历 * 熟练掌握至少一门语言:C++ / Python / Go * 熟悉 Linux 系统及网络基础 核心能力: 熟悉 AI 推理框架或引擎,如: * PyTorch / TensorFlow * ONNX Runtime * 熟悉 GPU 架构及 CUDA 编程,了解显存管理和并行计算 * 有大规模分布式系统或集群经验(如 Kubernetes) * 熟悉模型部署流程(训练 → 导出 → 推理服务) 加分项 * 有 LLM…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
职位概述:负责公司大规模 AI 推理集群的设计、建设与优化,支撑模型在线服务(LLM / CV / 推荐等)的高性能、低延迟与高可用运行。你将深度参与从模型部署到系统调优的全链路基础设施建设。 * 负责 AI 推理集群(GPU/CPU)的架构设计与落地,包括资源调度、服务部署、弹性扩缩容等 * 搭建和维护模型推理服务框架(如 SGlang、TensorRT、vLLM 等) * 优化推理性能(延迟、吞吐、成本),包括: * 模型量化(INT8/FP16/FP8) * Kernel 优化 / CUDA 调优 * Batch 策略 / KV Cache 优化(LLM场景) * 构建高可用推理服务体系(灰度发布、A/B、自动回滚) * 设计和实现推理调度系统(多模型、多租户、优先级控制) * 与算法团队协作,将模型高效部署上线并持续优化 * 构建监控与观测体系(QPS、Latency、GPU 利用率等) * 推进推理成本优化(算力利用率、Spot实例、混部等)
包括英文材料
学历+
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
还有更多 •••