英伟达Senior Software Test Development Engineer - Deep Learning
任职要求
• BS or higher degree in CS/EE/CE or equivalent. • 5+ years of software quality assurance or test automation background with knowledge of test infrastructure and strong analysis skills. • Scripting language (Python, Bash) knowledge and UNIX/Linux experience. • Good Python software development or test development experience. • Good user/development experience of virtualization like VM & Docker container & k8s • Excellent English written and oral communication skills. • Multiple GPUs P2P workload developing/testing • Able to juggle conflicting/changing priorities and maintain a positive attitude while experiencing challenging and dynamic schedules. • Proven success in leveraging AI (development) tools to significantly improve eff…
工作职责
We are looking for a Software Test development engineer in NVIDIA’s AI SWQA team.The position is in NVIDIA AI Software Quality Assurance team that defines, develops and performs tests to validate robustness and measure the performance of NVIDIA‘s AI software and GPU Infrastructure for autonomous driving, healthcare, speech recognition, natural language processing, and a wide variety of other AI scenarios. This team collaborates with multiple AI product teams to develop new products; derive and improve complex test plans; and improve our workflow processes for a diverse range of GPU computing platforms. You should grow with being in the critical path supporting developers working for billion-dollar business lines as well as intimately understanding the values of responsiveness, thoroughness and teamwork. You should constantly foster and implement efficiency improvements across your domain. Join the team which is building software which will be used by the entire world! What you’ll be doing: • Work closely with global cross-functional teams to understand the test requirements and take ownership of product quality. • Plan/design/execute/report/automate test plan/test case/test reports. • Manage bug lifecycle and co-work with inter-groups to drive for solutions. • Automate test cases and assist in the architecture, crafting and implementing of test frameworks. • In-house repro and verify customer issues/fixes.
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. We are looking for a Senior Software Design Engineer to join our Ethernet Switch SONiC Network OS team. As a Design Engineer at NVIDIA, you will work closely with NVIDIA’s development and architecture teams and gain deep understanding of NVIDIA products and technologies. What you’ll be doing: • Be part of NVIDIA SONiC R&D team, contribute code to SONiC worldwide community. • Design and implement robust, maintainable, and efficient automation test suite. • Work with experienced teams which are well known in the SONiC community. • Senior software developer as part of a networking oriented software development team. • Portable networking software implementation in an embedded environment (Ethernet switch/router product line) • Full development cycle methodology (from design to final delivery to customer in high scale data centers)
NVIDIA is now looking for LLM Train Framework Engineers for the Megatron Core team. Megatron Core is open-source, scalable, and cloud-native frameworks built for researchers and developers working on Large Language Models (LLM) and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, alignment, customization, evaluation, deployment, and tooling to optimize performance and user experience. Build on Megatron Core Framework's capabilities by inventing advanced distributed training algorithms and model optimizations. Collaborate with partners to implement optimized solutions. What you’ll be doing: • Build and develop open source Megatron Core. • Address extensive AI training and inference obstacles, covering the entire model lifecycle including orchestration, data pre-processing, conducting model training and tuning, and deploying models. • Work at the intersection of AI applications, libraries, frameworks, and the entire software stack. • Spearhead advancements in model architectures, distributed training strategies, and model parallel approaches. • Enhance the pace of foundation model training and optimization through mixed precision formulas and advanced NVIDIA GPU structures. • Performance tuning and optimizations of deep learning framework and software components. • Research, prototype, and develop robust and scalable AI tools and pipelines.
Maturing and productizing new features entails a diversity of activities, including:• Architecting new designs to enable new functions or to improve performance. • Leading engineering efforts to develop, tune, and verify algorithms and software using fundamental physics, control systems, planning algorithms, and/or vehicle dynamics. • Debugging and addressing different issues identified in simulations and in test drives. • Collaborating with our globally distributed team to enhance the software architecture, improving development processes and tooling • Defining and verifying product requirements through detailed analysis, simulation, in-car testing, and benchmarking existing products. • Maturing prototype software to production quality.
We are looking for a Software Test development engineer in NVIDIA’s AI SWQA team.The position is in NVIDIA AI Software Quality Assurance team that defines, develops and performs tests to validate robustness and measure the performance of NVIDIA‘s AI software and GPU Infrastructure for autonomous driving, healthcare, speech recognition, natural language processing, and a wide variety of other AI scenarios. This team collaborates with multiple AI product teams to develop new products; derive and improve complex test plans; and improve our workflow processes for a diverse range of GPU computing platforms. You should grow with being in the critical path supporting developers working for billion-dollar business lines as well as intimately understanding the values of responsiveness, thoroughness and teamwork. You should constantly foster and implement efficiency improvements across your domain. Join the team which is building software which will be used by the entire world! What you’ll be doing: • Work closely with global cross-functional teams to understand the test requirements and take ownership of product quality. • Plan/design/execute/report/automate test plan/test case/test reports. • Manage bug lifecycle and co-work with inter-groups to drive for solutions. • Automate test cases and assist in the architecture, crafting and implementing of test frameworks. • In-house repro and verify customer issues/fixes.