英伟达Computer Architecture Intern - LLM, 2026
任职要求
• Proven experience in software engineering, particularly in GPU programming and LLM inference. • Strong proficiency in programming languages such as Python, C++, and CUDA. • A solid understanding of deep learning frameworks and techniques. • Outstanding problem-solving skills and the ability to work collaboratively in a tea…
工作职责
• Develop and refine software solutions to expedite LLM SW stack (could be within inference/post train or pre-train phase) by harnessing the power of GPU technology. • Collaborate closely with a world-class team of engineers to implement and refine GPU-based algorithms. • Analyze and determine the most effective methods to improve performance, ensuring seamless execution across diverse computing environments. • Engage in both individual and team projects, contributing to NVIDIA's mission of leading the AI revolution. • Work in an empowering and inclusive environment to successfully implement groundbreaking AI solutions.
NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an intern deep learning system performance architect to join our AI performance modelling, analysis and optimization efforts. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads. You will make your contributions to our dynamic technology focused company. What you’ll be doing: • Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products. • Develop analytical models for the state of the art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency. • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uni-processor and multiprocessor configurations. • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.
We are now looking for a GeForce/ProViz Performance Engineer Intern! This position offers the chance to create a significant impact in a dynamic, technology focused company. As a member of the Performance Lab team, you will reach firsthand GPUs and optimize performance from designing stage till whole product lifetime, architectures to extend the state of the art in Gaming, Professional Visualization, Cloud Gaming, Data Center efficiency and performance. What you’ll be doing: • Identify, run graphics, studio and WinAI benchmarks across servers, PCs, workstations and laptops. • Compose competitive analysis reports for internal and external customers to position NVIDIA products appropriately using their evaluation. • Develop and maintain automation scripts for games/studio/WinAI performance and system monitoring data collection on Windows and Linux to speed up providing business and engineering insights. • Develop, implement and maintain tools to improve testing efficiency.
NVIDIA is leading company of AI computing. At NVIDIA, our employees are passionate about AI, HPC , VISUAL, GAMING. Our Solution Architect team is more focusing to bring NVIDIA new technology into difference industries. We help to design the architecture of AI computing platform, analyze the AI and HPC applications to deliver our value to customers. This role will be instrumental in leveraging NVIDIA's cutting-edge technologies to optimize open-source and proprietary large models, create AI workflows, and support our customers in implementing advanced AI solutions. What you’ll be doing: • Drive the implementation and deployment of NVIDIA Inference Microservice (NIM) solutions • Use NVIDIA NIM Factory Pipeline to package optimized models (including LLM, VLM, Retriever, CV, OCR, etc.) into containers providing standardized API access • Refine NIM tools for the community, help the community to build their performant NIMs • Design and implement agentic AI tailored to customer business scenarios using NIMs • Deliver technical projects, demos and customer support tasks • Provide technical support and guidance to customers, facilitating the adoption and implementation of NVIDIA technologies and products • Collaborate with cross-functional teams to enhance and expand our AI solutions