英伟达System Software Engineer Intern, AI Infrastructure, Summer 2026
任职要求
• Pursuing a Bachelor's degree or higher in Computer Science or other related field. • Experience with Python (required) and JavaScript, • Knowledge of software engineering principles, OOP/functional programming, and writing high-performance, maintainable code. • Practical experience in AI, machine learning, or agent frameworks (e.g., LangChain, OpenAI Functions). • Exposure to microservices, web apps, or databases (SQL/NoSQL), containers (Docker), Kubernetes, or CI/CD pipe…
工作职责
• Help design, develop, and improve scalable infrastructure to support the next generation of AI applications, including copilots and agentic tools. • Drive improvements in architecture, performance, and reliability, enabling teams to bring to bear LLMs and advanced agent frameworks at scale. • Stay informed of the latest advancements in AI infrastructure and contribute to continuous innovation.
• Build internal profiling/analysis tools for real world application perf/power analysis at system from small to large scale. • Build infrastructure or services for data visualization/mining and management. • Work with our users to build their perf/power models on top of our tools for next generation HW design.
• Implement the leading AI technologies to streamline Perflab workflows, optimizing test costs and improving efficiency. • Work with NVIDIA's global experts to tackle intriguing challenges and drive impactful project developments. • Use PerfLab's advanced infrastructure to compose, build, and maintain software systems, tools, and AI agents, providing immense value to our diverse customers.
We are now looking for a Performance Engineer Intern to support our growing investments in perf testing of various company datacenter products and applications. Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world, all while striving to deliver the highest possible performance of our products.You will be part of global Performance Lab team, improving our capacity to expertly and accurately benchmark state-of-the-art datacenter applications and products. We also work to develop infrastructures and solutions that enhance the team’s ability to gather data through automation and designing efficient processes for testing a wide variety of applications and hardware. The data that we collect drives marketing/sales collaterals as well as engineering studies for future products. You will have the opportunity to work with multi-functional teams and in a dynamic environment where multiple projects will be active at once and priorities may shift frequently. What you’ll be doing: • Benchmark, profile, and analyze the performance of AI workloads specifically tailored for large-scale LLM training and inference, as well as High-Performance Computing (HPC) on NVIDIA supercomputers and distributed systems. • Aggregate and produce written reports with the testing data for internal sales, marketing, SW, and HW teams. • Develop Python scripts to automate the testing of various applications. • Collaborate with internal teams to debug and improve performance issues. • Assist with the development of tools and processes that improve our ability to perform automated testing. • Setup and configure systems with appropriate hardware and software to run benchmarks.