平头哥平头哥-边缘AI芯片推理框架开发工程师-上海
社招全职5年以上技术-芯片地点:上海状态:招聘
任职要求
1. 电子工程,计算机等相关专业硕士及以上学历。 2. 具备3年以上AI推理优化相关工作经验,深刻理解并行计算和CUDA编程,熟悉TensorRT和TensorRT-LLM的模型部署和优化。 3. 熟练掌握Python、C/C++编程语言,掌握AI Vibe Coding…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1. 负责AI模型以及大模型(LLM、VLM、VLA) 的部署和推理优化,结合AI软硬件特性实现优化计算和推理效率优化,包括但不限于多模型部署、KV Cache 管理、 Inflight Batching 等技术。 2. 在边缘AI上适配SOTA开源框架和SOTA模型,分析解决适配过程中发现的功能、性能与精度问题,为客户提供问题支持和解决方案。 3. 深入挖掘边缘AI软件栈和系统性能瓶颈,提出软硬件的加速解决方案。
包括英文材料
学历+
CUDA+
https://developer.nvidia.com/blog/even-easier-introduction-cuda/
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA.
https://www.youtube.com/watch?v=86FAWCzIe_4
Lean how to program with Nvidia CUDA and leverage GPUs for high-performance computing and deep learning.
TensorRT+
https://docs.nvidia.com/deeplearning/tensorrt/latest/getting-started/quick-start-guide.html
This TensorRT Quick Start Guide is a starting point for developers who want to try out the TensorRT SDK; specifically, it demonstrates how to quickly construct an application to run inference on a TensorRT engine.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
还有更多 •••