英伟达Senior Networking Software Engineer
任职要求
• Bachelor or above in Computer Science or computer engineering or related discipline or equivalent experience • 5+ years of practical experience • Strong C/C++ programming skills • Experiences using C/C++ in Linux (user and/or kernel modes) • Good networking knowledge and be familiar with L2 bridging, L3 routing and protocols such as VLAN, STP, OSPF, BGP, PIM etc. • Advantage – knowledge at RDMA, DPDK, NCCL, DeepEP, SONiC and Network virtualization Ways to stand out from the crowd: • Demonstrated ability to work in a cross-functional team environment • Self-motivated, ability to work with little definition and supervision while multi-tasking and prioritizing across a number of projects and initiatives
工作职责
NVIDIA is seeking for an experienced Software Engineer with a strong background in networking and virtualization technologies. This full-time position requires close collaboration with other NVIDIA SW and SW architecture teams, pre-sale teams and different industry leading architecture and R&D teams to develop leading SW solutions and stack. The role requires a strong knowledge of server/Cloud networking and virtualization technologies and top SW development skills. Kernel & DPDK background is strongly preferred. What you’ll be doing: • Working with NVIDIA top customers RnD, getting familiar with their applications and needs. • Building cutting edge POC prototypes which later to be productize and delivered. • Closely work with various development and architecture teams and take major part of SW development.
NVIDIA networking is a world-leader fast-growing company which supports the most powerful super computers and the largest data centers in the world. We make outstanding artificial intelligence happen with NVIDIA GPUs that accelerate the computing platform and networking solutions based on InfiniBand, Ethernet, or RoCE (RDMA over Converged Ethernet). We believe in our people and products and seek excellent people to join us!The Networking Solutions Architects team is looking for a hardworking, keen software networking engineer to join the team and support the Spectrum-X networking platform which is a revolutionary solution for building multi-tenant, hyperscale AI clouds with Ethernet. As a Networking Solutions Architect you will have a real impact on the business, while working closely with our customers, marketing and R&D teams. What you’ll be doing: • Work as customer technical specialist to address customer requirements and technical challenges during the pre-sales activities of the Spectrum-X solution. • Run and own proof of concept activities introducing our products and integrating them to new and existing accounts. • Support numerous levels of software running on NVIDIA's Ethernet Switches and BlueField Smart NIC. • Debug networking and performance issues and provide solutions to customers. • Work closely with our R&D teams to solve customer issues • Participate in building SW products roadmap by providing customer product requirements and feedback to engineering and marketing teams.
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. We are looking for a Senior Software Design Engineer to join our Ethernet Switch SONiC Network OS team. As a Design Engineer at NVIDIA, you will work closely with NVIDIA’s development and architecture teams and gain deep understanding of NVIDIA products and technologies. What you’ll be doing: • Be part of NVIDIA SONiC R&D team, contribute code to SONiC worldwide community. • Design and implement robust, maintainable, and efficient automation test suite. • Work with experienced teams which are well known in the SONiC community. • Senior software developer as part of a networking oriented software development team. • Portable networking software implementation in an embedded environment (Ethernet switch/router product line) • Full development cycle methodology (from design to final delivery to customer in high scale data centers)
• Contribute to design review and product features requirements under the whole Ethernet/ NIC/DPU/Switch portfolio. Design and build setup topologies with an emphasis on an emulation of customer large scale / complex environments. • Collaborating closely with multi-functional teams, including hardware engineers, software developers, and domain experts, to deliver optimized solutions that meet the demanding requirements of AI workloads. • Design, mentorship for testing automation team to implement tests. Generate comprehensive test reports during release execution procedure, assist with reproduction and debugs complex customer use cases, with determination of the issue root cause, be an engineering PIC for the full verification cycles of the customer use cases. • Complete end-to-end test scenarios in different scopes: Regression, Performance, Functional and Scale; Report the progress of testing and provide summary reports of testing activity. • Profiling, Benchmarking, and Analyzing Deep Learning models to identify areas for optimization and improvement in terms of performance, efficiency, and accuracy, with a strong emphasis on networking aspects. • Providing insights and recommendations based on the analysis of large-scale training results, specifically focusing on networking bottlenecks and optimizations, to improve model outcomes and achieve business objectives.
A key part of NVIDIA's strength is our sophisticated analysis / debugging tools that empower NVIDIA engineers to improve perf and power efficiency of our products and the running applications. We are looking for forward-thinking, hard-working, and creative people to join a multifaceted software team with high standards! This software engineering role involves developing tools for AI researchers and SW/HW teams running AI workload in GPU cluster.As a member of the software development team, we will work with users from different departments like Architecture teams, Software teams. Our work brings the users intuitive, rich and accurate insight in the workload and the system, and empower them to find opportunities in software and hardware, build high level models to propose and deliver the best hardware and software to our customers, or debugging tricky failures and issues to help improve the performance and efficiency of the system. What you’ll be doing: • Build internal profiling and analysis tools for AI workloads at large scale • Build debugging tools for common encountered problems like memory or networking • Create benchmarking and simulation technologies for AI system or GPU cluster • Partner with HW architects to propose new features or improve existing features with real world use cases