英伟达Software Engineer, LLM Inference

社招全职2025-09-29地点：上海状态：招聘

扫码手机上打开

任职要求

• Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)
• 3+ years of relevant software development experience.
• Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.
• Strong curiosity about artificial intellig…

登录查看完整任职要求

微信扫码，1秒登录

工作职责

NVIDIA has continuously reinvented itself over two decades. NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world.This is our life’s work — to amplify human imagination and intelligence. AI becomes more and more important in Auto Driving and AI City. NVIDIA is at the forefront of the Auto Driving and AI City revolution and providing powerful solutions for them. All these solutions are based on GPU-accelerated libraries, such as CUDA, TensorRT and V/LLM inference framework etc. Now, we are now looking for an LLM inference framework developer engineer based in Shanghai.
What you’ll be doing
:• Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance
• Performance analysis, optimization and tuning
• Closely follow academic developments in the field of artificial intelligence and feature update
• Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams

📮 投递简历 ✨AI模拟面试

难度：

包括英文材料

C+

PyTorch+

相关职位

Compute Architecture Software Engineer

社招

• You will develop and optimize software solutions to accelerate LLM inference using GPU technology. • Collaborate closely with a world-class team of engineers to implement and refine GPU-based algorithms. • Analyze and determine the most effective methods to improve performance, ensuring seamless execution across diverse computing environments. • Engage in both individual and team projects, contributing to NVIDIA's mission of leading the AI revolution. • Work in an empowering and inclusive environment to successfully implement groundbreaking AI solutions.

更新于 2025-09-25上海

Principal Software Engineer

社招Software

- Keep up to date with and utilize the latest developments in LLM system optimization.- Take the lead in designing innovative system optimization solutions for internal LLM workloads.- Optimize LLM inference workloads through innovative kernel, algorithm, scheduling, and parallelization technologies.- Continuously develop and maintain internal LLM inference infrastructure.- Discover new LLM system optimization needs and innovations.

更新于 2025-10-17北京

Senior Software Engineer

社招Software

- Keep up to date with and utilize the latest developments in LLM system optimization.- Discover/solve impactful technical problems, advance state-of-the-art LLM technologies, and translate ideas into production.- Optimize LLM inference workloads through innovative kernel, algorithm, scheduling, and parallelization technologies.- Continuously maintain internal LLM inference infrastructure.

更新于 2025-10-17北京

Senior Software Test Development Engineer - Deep Learning

社招

We are looking for a Software Test development engineer in NVIDIA’s AI SWQA team.The position is in NVIDIA AI Software Quality Assurance team that defines, develops and performs tests to validate robustness and measure the performance of NVIDIA‘s AI software and GPU Infrastructure for autonomous driving, healthcare, speech recognition, natural language processing, and a wide variety of other AI scenarios. This team collaborates with multiple AI product teams to develop new products; derive and improve complex test plans; and improve our workflow processes for a diverse range of GPU computing platforms. You should grow with being in the critical path supporting developers working for billion-dollar business lines as well as intimately understanding the values of responsiveness, thoroughness and teamwork. You should constantly foster and implement efficiency improvements across your domain. Join the team which is building software which will be used by the entire world! What you’ll be doing: • Work closely with global cross-functional teams to understand the test requirements and take ownership of product quality. • Plan/design/execute/report/automate test plan/test case/test reports. • Manage bug lifecycle and co-work with inter-groups to drive for solutions. • Automate test cases and assist in the architecture, crafting and implementing of test frameworks. • In-house repro and verify customer issues/fixes.

更新于 2025-11-05上海