logo of microsoft

微软Sr Cloud Solution Architect -- Cloud & AI Apps

社招全职Customer Success地点:上海状态:招聘

任职要求


• Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, or related field AND 8+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting• OR equivalent experience

Preferred:

• Master’s degree or above in Computer Science, Artificial Intelligence, or related fields (PhD preferred).


• Research background in Machine Vision, NLP, or related AI domains.


• 5+ years of professional experience in AI development and applied research.


• Strong knowledge of Transformer architectures, Generative AI (GenAI), and Agentic AI.


• Hands-on experience with Retrieval-Augmented Generation (RAG) and agent-based AI applications.


• Proficiency in programming with Python and Java.


• Familiarity with containerization (Docker), Kubernetes (K8s), and cloud-native environments.


• Proven…
登录查看完整任职要求
微信扫码,1秒登录

工作职责


• Engage with customer IT and business leaders to understand their application, data, and AI priorities, and design secure, scalable solutions that drive business value and customer satisfaction.
• Lead technical engagements across architecture design, Proof of Concepts (POCs), and Minimum Viable Products (MVPs) to accelerate adoption of Azure AI, App Services, GitHub, and data platforms.
• Own the end-to-end technical delivery results, ensuring completeness and accuracy of consumption and customer success plans in collaboration with the CSAM.
• Drive next best actions and generate incremental pipeline from each engagement, aligning with Unified Enterprise Support (ES) priorities.
• Deliver repeatable intellectual property (IP) and contribute to centralized IP development to accelerate deployment and achieve targeted outcomes.
• Provide delivery oversight and escalation support for key Factory engagements across AI and App Innovation projects.
• Lead the health, resiliency, security, and optimization of mission-critical workloads, ensuring readiness for production-scale AI use cases.
• Act as the Voice of the Customer by sharing insights and feedback with engineering teams to influence product improvements and remove adoption blockers.
• Support customer skilling through technical workshops, readiness activities, and recommendations that ensure solution performance, maintainability, and reliability.
• Maintain deep technical expertise and stay current with Azure, AI, GitHub, and cloud-native development trends, while contributing to internal and external technical communities.
• Be accredited and certified to deliver with advanced and expert-level proficiency in priority workloads including Azure AI Foundry, AKS, App Service, Cosmos DB, Azure SQL, PostgreSQL, APIM, and GitHub.
• Demonstrate a growth mindset by continuously aligning your skills to customer needs, contributing to knowledge sharing, and mentoring others to accelerate customer outcomes.
包括英文材料
NLP+
Transformer+
RAG+
AI agent+
还有更多 •••
相关职位

logo of amazon
社招Solution

*Hiring location: Beijing, Shanghai, Guangzhou, Shenzhen, Hong Kong(visa sponsorship provided) Would you like to join one of the fastest-growing teams within Amazon Web Services (AWS) and help shape the future of GPU optimization and high-performance computing? Join us in helping customers across all industries to maximize the performance and efficiency of their GPU workloads on AWS while pioneering innovative optimization solutions. As a Senior Technical Account Manager (Sr. TAM) specializing in GPU Optimization in AWS Enterprise Support, you will play a crucial role in two key missions: guiding customers' GPU acceleration initiatives across AWS's comprehensive compute portfolio, and spearheading the development of optimization strategies that revolutionize customer workload performance. Key Job Responsibilities - Build and maintain long-term technical relationships with enterprise customers, focusing on GPU performance optimization and resource allocation efficiency on AWS cloud or similar cloud services. - Analyze customers’ current architecture, models, data pipelines, and deployment patterns; create a GPU bottleneck map and measurable KPIs (e.g., GPU utilization, throughput, P95/P99 latency, cost per unit). - Design and optimize GPU resource usage on EC2/EKS/SageMaker or equivalent cloud compute, container, and ML services; implement node pool tiering, Karpenter/Cluster Autoscaler tuning, auto scaling, and cost governance (Savings Plans/RI/Spot/ODCR or equivalent). - Drive GPU partitioning and multi-tenant resource sharing strategies to reduce idle resources and increase overall cluster utilization. - Guide customers in PyTorch/TensorFlow performance tuning (DataLoader optimization, mixed precision, gradient accumulation, operator fusion, torch.compile) and inference acceleration (ONNX, TensorRT, CUDA Graphs, model compression). - Build GPU observability and monitoring systems (nvidia-smi, CloudWatch or equivalent monitoring tools, profilers, distributed communication metrics) to align capacity planning with SLOs. - Ensure compatibility across GPU drivers, CUDA, container runtimes, and frameworks; standardize change management and rollback processes. - Collaborate with cloud provider internal teams and external partners (NVIDIA, ISVs) to resolve cross-domain complex issues and deliver repeatable optimization solutions. ------------------------------------------------------

更新于 2025-08-18广州|上海|北京
logo of amazon
社招Solution

Every day will bring new and exciting challenges on the job while you: - Act as a strategic advisor for customers' Generative AI initiatives and internal AI agent innovation - Drive the development and implementation of collaborative AI agents within the TAM organization - Lead technical discussions around AWS AI services including Bedrock, Claude, and Amazon Q. - Make recommendations on AI architecture, security, cost optimization, and operational excellence - Champion internal AI agent success stories to inspire customer innovation - Complete analysis and present periodic reviews of AI workload performance - Guide customers in developing responsible AI practices while ensuring security and compliance - Foster an ecosystem where AI and humans progress together through knowledge sharing - Work with AWS AI/ML service teams to advocate for customer needs - Participate in customer requested meetings (onsite or via phone) - Work directly with Amazon Web Service engineers to ensure rapid resolution of AI-related issues - Available in non-business hours to handle urgent issues ------------------------------------------------

更新于 2025-07-16北京
logo of nvidia
社招

NVIDIA is the world leader in computer graphics, PC gaming, and accelerated computing. Today, we are tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of edge computers and robotics that can understand the world. Doing what is never been done before takes vision, innovation, and the world’s best talent. At NVIDIA, our employees are passionate about accelerated computing. We're united in our quest to transform the way accelerated computing are used for work and play. Our technology impacts the large language model in daily copilot, visual experience in video game development, film production, space exploration, medicine, computational finance and automotive design. And we've only scratched the surface of what we can accomplish when we apply our technology to it. We need passionate, hard-­‐working and creative people to help us seek some of these outstanding opportunities.We are now looking for a System & Network Solution Architect to join the NVIDIA China Solution Architect team. In this role, you will engage and support design-in projects with major China OEM customers, focusing on integrating NVIDIA’s world-class networking portfolio (ConnectX, BlueField, and Spectrum Switches). As a Solution Architect, you will act as the technical bridge between NVIDIA engineering and our OEM partners. You will guide customers through the integration of next-generation networking into their server and storage platforms, ensuring seamless compatibility, performance optimization, and successful mass production. What you’ll be doing: • Lead OEM Design-in & Integration: Work closely with OEM customers to integrate NVIDIA networking products (e.g., CX8/CX9, CX6 Dx, BF3 DPUs) and Switch platforms (Spectrum-4/6 Blackbox & Whitebox) into their server lineups. • Architecture & Customization: Understand customer requirements to provide system-level architectural guidance. Lead technical discussions on system topology, thermal/mechanical constraints, firmware customization, and sideband management (NC-SI, PLDM). • System Bring-up & Support: Support customers during the bring-up phase of new server designs. Diagnose complex system-level issues involving PCIe, BIOS/BMC, firmware, and OS/Driver interactions. • Performance Optimization: Guide customers in optimizing network performance for AI, HPC, and Cloud workloads, ensuring the best integration of NVIDIA NICs and DPUs within their specific hardware environments. • Crisis Management: Handle in-depth hands-on engagement with customers to resolve critical technical blockers during the NPI (New Product Introduction) and production phases. • Cross-Functional Leadership: Collaborate with NVIDIA worldwide hardware, firmware, software, and product teams to drive customer requirements and resolve issues. Act as the technical advocate for the customer within NVIDIA.

更新于 2026-01-13北京
logo of nvidia
社招

NVIDIA networking designs and manufactures high-performance networking equipment that enable the most powerful super computers in the largest data centers in the world. With a distributed collection of NVIDIA GPUs inter-connected by networking solutions such as InfiniBand, Ethernet, or RoCE (RDMA over Converged Ethernet) we make powerful ML/AI platforms possible. We are seeking motivated, personable, and independent individuals to join our team!We seek experienced software embedded engineers to help support our groundbreaking, innovative technologies that make AI workloads in large clusters even more performant. As a networking Sr. Solutions Architect at NVIDIA you will have agency and palpable effects on the business, and work closely with customers and R&D teams. What you’ll be doing: • Support networking technologies such as Spectrum-X and work with customers on their technical challenges and requirements using said technologies during pre-sales activities • Develop proof-of-concept materials for innovative technologies for use by early adopters • Gain customers’ trust and understand their needs to help design and deploy groundbreaking NVIDIA networking platforms to run AI and HPC workloads • Address sophisticated and highly visible customer issues • Work closely with R&D teams to develop new features for customers • Help with product requirements alongside engineering and product marketing

更新于 2025-06-15北京