英伟达System Software Engineer Intern, Systems Infrastructure, Summer 2026
任职要求
NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to solve, that matter to the world, and that only we can tackle. This is our life’s work, to amplify human imagination and intelligence, and expand what is possible. Make the choice to join us today. The NVIDIA Infrastructure Group is seeking world-class programmers to design, implement, and debug the next generation of large-scale, general-purpose graphics and computing chips. In this role, you will help build the core verification infrastructure that drives the development of our GPU and Tegra chips. This strongly object-oriented C++ and Python infrastructure encompasses several extensive applications that allow us to ef…工作职责
N/A
• Help design, develop, and improve scalable infrastructure to support the next generation of AI applications, including copilots and agentic tools. • Drive improvements in architecture, performance, and reliability, enabling teams to bring to bear LLMs and advanced agent frameworks at scale. • Stay informed of the latest advancements in AI infrastructure and contribute to continuous innovation.
• Build internal profiling/analysis tools for real world application perf/power analysis at system from small to large scale. • Build infrastructure or services for data visualization/mining and management. • Work with our users to build their perf/power models on top of our tools for next generation HW design.
A key part of Nvidia's strength is our unique, advanced, development tools and environments that enable our incredible pace of delivering new technology to market. We are looking for hard-working, and creative people who passionate about joining a dynamic agile software team with high production quality standards that can help across our infrastructure. The roles below offer the opportunity to play a critical part in every stage of development of GPU technology, and to learn and improve the daily workflows of the world’s top chip designers and to apply machine and deep learning to every part of our chip development pipeline. All of our roles require excellent interpersonal skills and flexibility/adaptability for working in a dynamic environment with different frameworks and requirements. What you’ll be doing: • Develop the user interface and front-end application for comprehensive workflows for the development of new graphics chips. • Working on backend and frontend design and development of proprietary web applications for hardware development. • Interact directly with end users. • Analyzing performance bottlenecks in the workflow and application. • Build infrastructure and microservices to support the hardware development teams.
Assist in designing, building, and maintaining a scalable and reliable cloud infrastructure Collaborate with developers, operations, and security teams to ensure that the infrastructure is performing optimally and securely Monitoring and alarm systems for our cloud infrastructure, applications, and services Monitor system performance, identify and resolve issues proactively, and troubleshoot incidents when they arise Develop and implement automation tools to streamline processes and improve operational efficiency Participate in the development of disaster recovery and business continuity plans Document infrastructure and processes to ensure knowledge transfer and institutional memory Stay up-to-date with emerging trends and technologies in cloud-native computing and SRE practices