logo of nvidia

英伟达Performance Engineer Intern, Deep Learning and HPC - 2025

实习兼职地点:上海状态:招聘

任职要求


• Currently pursuing a Bachelor's degree (or higher) in Computer Science, Electrical Engineering, or a related field.
• Experienced in programming and debugging with scripting languages such as Python or Unix shell.
• Strong data analysis skills and the ability to summarize findings in a written report
• Hands-on experience with Linux based systems. Familiarity using a container platform such as Docker or Singularity. Experience with compiling and running software from source code.
• Fast and self-learning capabilities with strong analytical and problem-solving skills.
• Good English verbal and written interpersonal skills to improve collaboration with coworkers

Ways to stand out from the crowd:
• Background with GPU/CPU benchmarking
• Familiar with ML/DL techniques, algorithms and frameworks like TensorFlow or PyTorch.
• Experience in AI model development, training, evaluation and deployment on Cloud, Cluster or on-premises. Familiar with cloud provisioning and scheduling tools (Kubernetes, SLURM).
• Exposure to testing automation for various applications.
We have some of the most forward thinking and hardworking people in the world working for us and our best-in-class engineering teams are rapidly growing. We are building a team that will help shape the future of data center computing. If you are passionate about new technologies, care about improving efficiency and quality, and want to be at the forefront of AI & HPC & Gaming, we would love for you to join us.

工作职责


We are now looking for a Performance Engineer Intern to support our growing investments in perf testing of various company datacenter products and applications. Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world, all while striving to deliver the highest possible performance of our products.You will be part of global Performance Lab team, improving our capacity to expertly and accurately benchmark state-of-the-art datacenter applications and products. We also work to develop new scripts that enhance the team’s ability to gather data through automation and designing efficient processes for testing a wide variety of applications and hardware. The data that we collect drives marketing/sales collaterals as well as engineering studies for current and future products. You will have the opportunity to work with multi-functional teams and in a dynamic environment where multiple projects will be active at once and priorities may shift frequently.
What you’ll be doing:
• Benchmark, profile, and analyze the performance of AI workloads specifically tailored for large-scale LLM training and inference, as well as High-Performance Computing (HPC) on NVIDIA supercomputers and distributed systems.
• Aggregate and produce written and visual reports with the testing data for internal sales, marketing, SW, and HW teams
• Setup and configure systems with appropriate hardware and software to run benchmarks
• Collaborate with internal teams to debug and improve performance issues
• Develop Python scripts to automate the testing of various applications
• Assist with the development of tools and processes that improve our ability to perform automated testing
包括英文材料
Python+
Unix+
Bash+
Linux+
Docker+
TensorFlow+
PyTorch+
Kubernetes+
Slurm+
HPC+
相关职位

logo of apple
实习Machine

As a Machine Learning (ML) Engineer, you will be entrusted with the critical role of innovating and applying state-of-the-art research in ML to tackle complex data problems. The solutions you develop will significantly impact future Apple products and the broader ML development ecosystem. You will work with a multidisciplinary team to actively participate in the data-model co-design and co-development practice. Your responsibilities will extend to the design and development of a comprehensive data curation framework. You will also create robust model evaluation pipelines, integral to the continuous improvement and assessment of ML models. Additionally, your role will entail an in-depth analysis of collected data to underscore its influence on model performance. Furthermore, you will have the opportunity to showcase your groundbreaking research work by publishing and presenting at premier academic venues. Your work may span a variety of topics, including but not limited to: * Designing and implementing semi-supervised, self-supervised representation learning techniques for maximizing the power of both limited labeled data and large-scale unlabeled data. * Developing evaluation protocols centered on the end-to-end user experience, with a focus on anticipating potential failure modes, edge cases, and anomalies. * Employing data selection techniques such as novelty detection, active learning, and core-set selection for diverse data types like images, 3D models, natural language, and audio. * Uncovering patterns in data, setting performance targets, and leveraging modern statistical and ML-based methods to model data distributions. This will aid in reducing redundancy and addressing out-of-distribution samples.

更新于 2025-07-29
logo of anker
实习

About Us Anker Innovations is a technology company dedicated to creating industry-leading smart devices for entertainment, travel, and smart homes. At the forefront of AI innovation, we develop reliable, high-quality AI applications to enhance the quality of care and provide exceptional user experiences. We are seeking driven individuals who are passionate about technology to help build cutting-edge, consumer-facing solutions. Job Summary We are seeking an AI Intern with a focus on JD-AI (Joint Deep AI) Engineering to join our dynamic team. In this role, you will assist in the design, development, and deployment of deep learning models for edge IoT devices. You will collaborate with AI engineers to optimize models and applications for real-time image and video processing on embedded systems. This internship provides a unique opportunity to gain hands-on experience with cutting-edge AI technologies in the rapidly evolving field of smart devices and IoT. Key Responsibilities - Assist in the development, testing, and optimization of AI models for tasks such as object detection, segmentation, tracking, and action recognition. - Contribute to the enhancement of model performance for improved customer satisfaction and operational efficiency. - Help implement AI model compression techniques to deploy models effectively on IoT embedded platforms. - Collaborate with an agile development team to meet project requirements and deadlines. - Write clean, maintainable, and scalable code with a focus on performance and extensibility. - Support the maintenance of organized technical documentation throughout the project lifecycle.

更新于 2025-01-06
logo of nvidia
实习

We are looking for a Generative AI Intern Engineer to join the NVIDIA Developer Technology group (Devtech) and work with a team of experienced engineers on innovative uses of AI for games and content creation. The Devtech team works with NVIDIA researchers and leading game developers to bring cutting edge AI research from across NVIDIA and the industry to gamers and 3D professionals in high performance packages such as real-time inferenced graphics, physics and animations. What you’ll be doing: • Research and implement innovative generative AI algorithms for game engines and authoring tools, including real-time neural graphics, physics based animation and diffusion models. • Develop neural graphics, animation and physics models and maintain open-source projects for both game-making and user runtimes. Integrate them into mainstream game engines and DCC tools. • Use various optimization techniques, such as tensor fusion and quantization, to fit the AI models onto user devices and maximize the performance of inference for real-time gaming. • Collaborate with game developers on optimizations and improvements for specific GenAI applications. • Interact closely with the architecture and driver teams at NVIDIA in ensuring the best possible experience on current generation hardware, and on determining trends and features for next generation architectures.

更新于 2025-09-26
logo of apple
实习Students

• This role will involve in Software and Hardware R&D from initial phase, working on different wireless technologies(including 5G, LTE, NFC, GNSS technology, WiFi/Bluetooth, UWB) from design/prototype phase till finalization and post production/launch to end users. • You will work with our excellent Apple Engineering team on innovative project, in which you will be deeply involved into Product design, feature investigation, evaluation them in real world. • This includes to study and qualify relevant features at system level, from functional and performance perspective. Run focused experiments to understand and identify root cause of any issues and raise proposal of innovative ideas to influence the design from user experience point of view. • Assist to build up comprehensive database with various data from live network and real user scenarios • Work closely with Experienced Field Design Engineers and multi-functional teams to deep dive the data • Develop models based on Large-Scale data to predict potential issues, to support decision-making processes across teams

更新于 2025-10-15