英伟达Senior Software Engineer, Enterprise AI Software
任职要求
• A history of using advanced programming skills to build distributed compute systems, backend services, microservices, and cloud technologies. • Experience productionizing and deploying LLM models. • Effective experience working with multi-functional teams, principals, and architects across organizational boundaries. • Mentorship and the ability to grow teams and team members. • Deep technical expertise in distributed containerized applications using Docker, Kubernetes, Helm Charts. • Passion for building scalable and performant microservice applications. • Excellent interpersonal skills and the flexibility to lead multi-functional efforts. • Proven experience debugging and analyzing the performance of distributed microservices or cloud systems. • A degree in Computer Science, Computer Engineering, or a relate…
工作职责
• Design, build, and optimize containerized inference execution for LLM applications, ensuring efficiency and scalability. These applications may run in container orchestration platforms like Kubernetes to enable scalable and robust deployment. • Ensure the performance and scalability of NIMs through comprehensive performance measurement and optimization. • Apply container expertise to create and optimize the basic building blocks of NIMs, influencing the development of many models and related products within NVIDIA. • Collaborate, brainstorm, and improve the designs of inference solutions and APIs with a broad team of software engineers, researchers, SREs, and product management. • Mentor and collaborate with team members and other teams to foster growth and development. Demonstrate a history of learning and enhancing both personal skills and those of colleagues.
TCS(Tencent Cloud-native Suite) is a Cloud-Native Platform for Enterprise Cloud-Native Transformation supports on-premise bare metal or third-party IaaS deployments. Product Solutions Architect (PDSA) is a key pre-sales technical position in Tencent Cloud - TCS product team. Our PDSAs are experienced solution architects with professional knowledge and industry insight and are, ultimately, the pivotal role in supporting the global sales team by providing cloud-native solutions, striving to meet product sales targets, and leading key projects for customer onboarding. Job Responsibilities: ● Acting as a subject matter expert on Tencent Cloud - TCS products (container, microservice, message queue), providing training and enablement to the account and Solution Architect (SA) teams. ● Supporting regional events and marketing programs to promote Tencent TCS solutions to potential customers. ● Delivering optimized solutions and advocating best practices to customers by utilizing the full capabilities of Tencent TCS products. ● Gathering and analyzing customer feedback to enhance product offerings, improve market competitiveness and help build the product roadmap continually. ● Monitoring key industry trends and technological shifts, offering trusted advice to customers for optimizing their IT infrastructure and improving their user experience. ● Expanding the product ecosystem through collaboration with global channels, industrial partners and third-party Independent Software Vendors (ISVs). Creating position papers, and aligning advocacy efforts with Tencent Cloud’s core business policies across multiple departments (business, security, legal, and government affairs). ● Reporting to the Head of the product architect team, this role requires a dynamic, curious, and technology-driven candidate who thrives in challenging environments and can navigate ambiguity with strategic insight.
Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing! An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join the team and see how we can make a lasting impact on the world.NVIDIA is hiring senior software engineers in its Infrastructure, Planning and Process Team (IPP), to accelerate AI adoption across various engineering workflows within the company. IPP is a global organization within NVIDIA. The group works with various other teams within NVIDIA such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure and software development workflow needs. As a senior engineer on AI Workflow, you will create and establish tools and software solutions that leverage Large Language Models and agentic AI to automate end to end software engineering workflows and enhance the productivity of engineers across NVIDIA. What you’ll be doing: • Develop and implement solutions throughout software development lifecycles to improve developer efficiency, accelerate feedback loops, and boost release reliability • Experience designing, developing, and deploying AI agents to automate software development workflows and processes. • Continuously measure and report on the impact of AI interventions, showing progress in metrics such as cycle time, change failure rate, and mean time to recovery (MTTR). • Build and deploy predictive models to identify high-risk commits, forecast potential build failures, and flag changes that have a high probability of failures. • Research emerging AI technologies and engineering best practices to continuously evolve our development ecosystem and maintain a competitive edge.
NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server platforms. Simulations play a significant role in building scalable systems at Speed of Light! You will work with world class engineering teams across HW and SW. What you’ll be doing: • Contribute to architect and develop simulation platform for next-gen NVIDIA DGX platforms. • Build, integrate and enhance simulator components with new HW features and write supporting technical documents. • Bring full SW stack up on DGX Simulator; work closely with hardware modeling, kernel & platform driver teams distributed globally. • Improve performance, fix bugs across user and kernel stack, and automate execution flow.
• Define the end-to-end technical architecture for the NIM Factory, from container build systems and CI/CD to Kubernetes deployment patterns and runtime optimization. • Drive technical strategy and roadmap, making high-impact decisions on frameworks, technologies, and standards that empower dozens of engineering teams. • Architect and influence the design of workflow orchestration systems that underpin the NIM factory. • Coach and mentor senior engineers across the organization, fostering a culture of technical excellence, innovation, and knowledge sharing. • Champion best practices in software development, including API design, automation, observability, and secure supply chain management. • Collaborate with leadership across research, backend, SRE, and product to align technical vision with product goals and influence technical roadmaps.