logo of nvidia

英伟达Software Architect, Enterprise AI Software

社招全职地点:上海状态:招聘

任职要求


• 12+ years of experience designing and building large-scale, production distributed systems.
• Proven track record in a technical leadership or architect role, setting technical direction while staying hands-on with implementation.
• Deep architectural expertise in cloud-native technologies, including Kubernetes, containers, and microservices.
• Exceptional ability to coach, teach, and influence senior engineers; a passion for raising the technical bar of the entire organization.
• Strong foundation in modern software development practices, with proficiency in languages like Python for building tooling and services.
• Experience architecting solutions for GPU-accelerated or other high-performance computing workloads.
• Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to diverse audiences and drive consensus.
• A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience.

Ways to stand out from the crowd:
• Hands-on with LLM inference stacks (Triton Inference Server, TensorRT-LLM, vLLM, FasterTransformer, KServe).
• Experience optimizing large-model serving (KV cache sharding/paging, tensor/sequence parallelism, speculative decoding, dynamic batching).
• Experience architecting next-generation container build systems or CI/CD platforms at scale.
• Background with workflow orchestration engines (e.g., Temporal, Airflow) for complex, distributed processes.
• Expertise in designing multi-tenant, multi-cluster, or edge/air-gapped deployment architectures.
We are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and creative people in the world working for us. If you're creative and autonomous with a real passion for technology, we want to hear from you.

工作职责


• Define the end-to-end technical architecture for the NIM Factory, from container build systems and CI/CD to Kubernetes deployment patterns and runtime optimization.
• Drive technical strategy and roadmap, making high-impact decisions on frameworks, technologies, and standards that empower dozens of engineering teams.
• Architect and influence the design of workflow orchestration systems that underpin the NIM factory.
• Coach and mentor senior engineers across the organization, fostering a culture of technical excellence, innovation, and knowledge sharing.
• Champion best practices in software development, including API design, automation, observability, and secure supply chain management.
• Collaborate with leadership across research, backend, SRE, and product to align technical vision with product goals and influence technical roadmaps.
包括英文材料
Kubernetes+
Python+
大模型+
TensorRT+
vLLM+
缓存+
CI+
CD+
Airflow+
相关职位

logo of amd
社招 Sales /

The Software Architect position will work at AMD RCOm Lab in Nanjing to support enterprise customers, server OEM/ODM partners of AMD as well as ISV users that build their AI solutions on AMD Instinct / Radeon GPU products. He will need to provide support to our partners and end customers via e-mail, phone call, wechat and often on site with medium travel frequencies. Excellent customer skills with a proven ability to work with demanding customers. Excellent verbal and written communication skills in English and be able to communicate in English fluently. Ability to take ownership of issues and ensure they are resolved in a timely manner. Provide technical training to customers as required. Proactive, self-direction and highly disciplined. Oversee experience is a plus. The Role: providing front-line support and working with different functional teams to solve technical issues of AMD Instinct GPU and/or Radeon GPU on AI applications including LLM and RAG etc.

更新于 2025-09-01
logo of amazon
社招Solution

Would you like to join one of the fastest-growing teams within Amazon Web Services (AWS) and shape the future of human-AI collaboration? Join us in helping customers across all industries to maximize the value and benefits of AWS Generative AI services while pioneering innovative AI agent solutions that transform how we work. As a Technical Account Manager (TAM) specializing in Generative AI in AWS Enterprise Support, you will play a crucial role in two transformative missions: guiding customers' AI initiatives across AWS's comprehensive AI/ML portfolio, and spearheading the development of collaborative AI agents that revolutionize TAM operations. This is not a sales role, but rather a unique opportunity to serve as both an AI transformation advisor and innovation leader, working with organizations ranging from start-ups to Fortune 500 enterprises. Within the Enterprise Support team, TAMs focusing on Generative AI contribute significantly to ensuring the success of key enterprise customers in their AI journey while also driving internal innovation. As a strategic expert, you'll architect AI solutions, guide technology adoption, and pioneer new ways for humans and AI to work together. This support extends from strategic planning to implementation guidance, while simultaneously developing internal AI agent solutions that showcase the art of possible in human-AI collaboration. Every day will bring new and exciting challenges on the job while you: - Act as a strategic advisor for customers' Generative AI initiatives and internal AI agent innovation - Drive the development and implementation of collaborative AI agents within the TAM organization - Lead technical discussions around AWS AI services including Bedrock, Claude, and Amazon Q. - Make recommendations on AI architecture, security, cost optimization, and operational excellence - Champion internal AI agent success stories to inspire customer innovation - Complete analysis and present periodic reviews of AI workload performance - Guide customers in developing responsible AI practices while ensuring security and compliance - Foster an ecosystem where AI and humans progress together through knowledge sharing - Work with AWS AI/ML service teams to advocate for customer needs - Participate in customer requested meetings (onsite or via phone) - Work directly with Amazon Web Service engineers to ensure rapid resolution of AI-related issues - Available in non-business hours to handle urgent issues Location: Beijing, Shanghai, Guangzhou, Shenzhen, Hong Kong (HK visa sponsorship available)

更新于 2025-07-22
logo of amazon
社招Solution

Every day will bring new and exciting challenges on the job while you: - Act as a strategic advisor for customers' Generative AI initiatives and internal AI agent innovation - Drive the development and implementation of collaborative AI agents within the TAM organization - Lead technical discussions around AWS AI services including Bedrock, Claude, and Amazon Q. - Make recommendations on AI architecture, security, cost optimization, and operational excellence - Champion internal AI agent success stories to inspire customer innovation - Complete analysis and present periodic reviews of AI workload performance - Guide customers in developing responsible AI practices while ensuring security and compliance - Foster an ecosystem where AI and humans progress together through knowledge sharing - Work with AWS AI/ML service teams to advocate for customer needs - Participate in customer requested meetings (onsite or via phone) - Work directly with Amazon Web Service engineers to ensure rapid resolution of AI-related issues - Available in non-business hours to handle urgent issues ------------------------------------------------

更新于 2025-07-16
logo of nvidia
社招

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server platforms. Simulations play a significant role in building scalable systems at Speed of Light! You will work with world class engineering teams across HW and SW. What you’ll be doing: • Contribute to architect and develop simulation platform for next-gen NVIDIA DGX platforms. • Build, integrate and enhance simulator components with new HW features and write supporting technical documents. • Bring full SW stack up on DGX Simulator; work closely with hardware modeling, kernel & platform driver teams distributed globally. • Improve performance, fix bugs across user and kernel stack, and automate execution flow.

更新于 2025-09-22