logo of nvidia

英伟达Developer Technology Engineer - AI

社招全职地点:上海 | 北京 | 深圳状态:招聘

任职要求


• A degree from university in an engineering or computer science related discipline (BS; MS or PhD preferred).
• 2+ working experience is required.
• Strong knowledge of C/C++ and/or Fortran.
• Deep knowledge of software design, programming techniques, and algorithms.
• Expert knowledge of LLM training/inference optimization, including but not limited to development and optimization experience in distributed tr…
登录查看完整任职要求
微信扫码,1秒登录

工作职责


• Working directly with key application developers (especially LLM) to understand the current and future problems they are solving, creating and optimizing core parallel algorithms and data structures to provide the best solutions using GPUs, through both library development and direct contribution to the applications. This includes training and inference optimization for large language models, directly contributing to frameworks such as Megatron and TRTLLM, SGLang, vLLM...
• Collaborating closely with the architecture, research, libraries, tools, and system software teams at NVIDIA to influence the design of next-generation architectures, software platforms, and programming models, including by investigating impact on application performance and developer productivity.
• Engaging in deep optimization of high-performance operators, involving but not limited to CUDA deep optimization, instruction and compiler optimization. These optimizations will directly support customers or be integrated into products like cuDNN, cuBLAS, and CUTLASS...
• Some travel is required for conferences and for on-site visits with developers.
包括英文材料
C+
Fortran+
还有更多 •••
相关职位

logo of nvidia
社招

• Study and develop cutting-edge techniques in deep learning, graphs, machine learning, and data analytics, and perform in-depth analysis and optimization to ensure the best possible performance on current- and next-generation GPU architectures. • Work directly with key customers to understand the current and future problems they are solving and provide the best AI solutions using GPUs. • Collaborate closely with the architecture, research, libraries, tools, and system software teams at NVIDIA to influence the design of next-generation architectures, software platforms, and programming models.

更新于 2025-07-08北京|上海|深圳
logo of nvidia
社招

• Understand the responsibilities associated with embodied AI and strive to enhance them. • Develop on frameworks like IsaacSim and Isaac Lab, ensuring flawless performance. • Profile and investigate the performance of optimized code together with our internal team. • Discuss your approach and results with NVIDIA engineers to continuously improve processes. • Optimize GPU-based physics simulator performance for world-class results. • Collaborate closely with architecture, research, libraries, tools, and system software teams to invent and develop next-generation architectures, software platforms, and programming models.

更新于 2025-11-14上海|北京
logo of nvidia
社招

N/A

更新于 2025-10-10北京|上海
logo of nvidia
社招

• Study and develop cutting-edge techniques in CUDA programming, profiling, optimization. Application domains include deep learning, graphic, machine learning, and data analytics, and perform in-depth analysis and optimization to ensure the best possible performance on current- and next-generation GPU architectures. • Work directly with key customers to understand the current and future problems they are solving and provide the best AI solutions using GPUs. • Collaborate closely with the architecture, research, libraries, tools, and system software teams at NVIDIA to influence the design of next-generation architectures, software platforms, and programming models.

更新于 2025-11-11北京|上海