英伟达GPU Power Analysis Intern - 2026
任职要求
• Pursuing MS or PhD in related fields. • Basic understanding of concepts of energy consumption, estimation, and low power design. • Familiarity with Verilog and ASIC design principles, including knowledge of logic cells. • Good verbal/written English and interpersonal skills; much collaboration with design teams is expected. • Strong coding skills, preferably in Python, C++. • Ability to formulate and analyze algorithms, and comment on their tim…
工作职责
• Use internally developed tools and industry standard pre-silicon gate-level and RTL power analysis tools, to help improve product power efficiency. • Develop and share best practices for performing pre-silicon power analysis, Enhance internal power tools and automate best practices • Perform comparative power analysis, to spot trends and anomalies, that warrant more scrutiny. • Interact with architects and RTL designers to help them interpret their power data and identify power bugs; drive them to implement fixes. • Select and run a wide variety of workloads for power analysis, Collaborate with performance and architecture teams to validate performance of the workloads • Prototype a new architectural feature in Verilog and analyze power.
• Work proactively on GPU/SOC feature/IP bring-up and characterization including creating validation, tuning and optimization methodologies, develop proper test plan and test cases. • Perform system level use case analysis/profiling and feature return-on-investment investigations, prototype and validation, design methods and control policies to bring the new technology into production and transcend product goals. • Collaborate across System Architecture, DFT, ASIC, SW/FW, platform, validation, and production teams throughout the product life cycle on system-level architecture, design, productization, debugging, and deployment for complex silicon designs to improve quality, safety, and manufacturability. • Design tools/script to automate product definitions, data collection, test case execution, and results analysis. Provide detailed data analysis of functionality, performance, and latency. • Hands on actions on silicon bring-up, validation, and debug; Coordinate product level feature deployment to achieve high product quality at aggressive schedule.
• Build internal profiling/analysis tools for real world application perf/power analysis at system from small to large scale. • Build infrastructure or services for data visualization/mining and management. • Work with our users to build their perf/power models on top of our tools for next generation HW design.
NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an intern deep learning system performance architect to join our AI performance modelling, analysis and optimization efforts. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads. You will make your contributions to our dynamic technology focused company. What you’ll be doing: • Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products. • Develop analytical models for the state of the art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency. • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uni-processor and multiprocessor configurations. • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.
An exciting internship opportunity to make an immediate contribution to AMD's next generation of technology innovations awaits you! We have a multifaceted, high-energy work environment filled with a diverse group of employees, and we provide outstanding opportunities for developing your career. During your internship, our programs provide the opportunity to collaborate with AMD leaders, receive one-on-one mentorship, attend amazing networking events, and much more. Being part of AMD means receiving hands-on experience that will give you a competitive edge. Together We Advance your career! JOB DETAILS: Location: Shanghai Onsite/Hybrid: This role requires the student to work full time (40 hours a week), either in a hybrid or onsite work structure throughout the duration of the co-op/intern term. Duration: January 1, 2026 - June 30, 2026 WHAT YOU WILL BE DOING: We are seeking highly motivated AI/ML Engineering Intern to join our AMD Research team. In this role: You will develop machine learning models to optimize GPU power/performance tradeoffs using real-world silicon data. We will train you to deploy models via MLOps pipelines for AMD’s internal tools. Your responsibility will include analyzing hardware telemetry data (power, thermal, clocks) to identify efficiency bottlenecks. You will collaborate with hardware engineers to validate models on next-gen AMD GPUs. Learning Outcomes: Master GPU-accelerated ML workflows. Gain hands-on experience with industrial-scale MLOps.