亚马逊Senior Technical Account Manager - GPU (北上广深)
任职要求
基本任职资格 - 5+ years in cloud technical support, solutions architecture, or customer success management, with at least 3 years of hands-on experience in GPU/accelerated computing platforms. - In-depth understanding of GPU instance families (e.g., AWS G/P/H series) or similar offerings from other cloud providers, AMI/driver/CUDA/container compatibility management, and cloud storage/network performance tuning (e.g., S3 I/O, EBS/Instance Store equivalents, preprocessing pipelines). Proficient in scheduling GPU workloads with EKS or equivalent Kubernetes-based orchestration services, including node pool tiering, resource quotas, elastic scaling, and auto-recovery strategies. Experienced in multi-GPU/multi-node distributed computing (NCCL, topology awareness, tensor parallelism, pipeline parallelism) with expertise in communication optimization for large-scale AI training and inference. - Skilled in PyTorch/TensorFlow performance analysis and optimization, including DataLoader tuning, mixed precision, operator fusion, and inference acceleration toolchains (ONNX, TensorRT, CUDA Graphs). - Experienced in cost and capacity governance, familiar with Savings Plans, RI, ODCR, Spot, Capacity Blocks, and right-sizing strategies or their equivalents in other cloud platforms. - Demonstrated cross-functional communication and influence skills, capable of driving technical solutions with data and business objectives. 优先任职资格 - AWS Solutions Architect Professional, Machine Learning Specialty, or DevOps Professional cert…
工作职责
*Hiring location: Beijing, Shanghai, Guangzhou, Shenzhen, Hong Kong(visa sponsorship provided) Would you like to join one of the fastest-growing teams within Amazon Web Services (AWS) and help shape the future of GPU optimization and high-performance computing? Join us in helping customers across all industries to maximize the performance and efficiency of their GPU workloads on AWS while pioneering innovative optimization solutions. As a Senior Technical Account Manager (Sr. TAM) specializing in GPU Optimization in AWS Enterprise Support, you will play a crucial role in two key missions: guiding customers' GPU acceleration initiatives across AWS's comprehensive compute portfolio, and spearheading the development of optimization strategies that revolutionize customer workload performance. Key Job Responsibilities - Build and maintain long-term technical relationships with enterprise customers, focusing on GPU performance optimization and resource allocation efficiency on AWS cloud or similar cloud services. - Analyze customers’ current architecture, models, data pipelines, and deployment patterns; create a GPU bottleneck map and measurable KPIs (e.g., GPU utilization, throughput, P95/P99 latency, cost per unit). - Design and optimize GPU resource usage on EC2/EKS/SageMaker or equivalent cloud compute, container, and ML services; implement node pool tiering, Karpenter/Cluster Autoscaler tuning, auto scaling, and cost governance (Savings Plans/RI/Spot/ODCR or equivalent). - Drive GPU partitioning and multi-tenant resource sharing strategies to reduce idle resources and increase overall cluster utilization. - Guide customers in PyTorch/TensorFlow performance tuning (DataLoader optimization, mixed precision, gradient accumulation, operator fusion, torch.compile) and inference acceleration (ONNX, TensorRT, CUDA Graphs, model compression). - Build GPU observability and monitoring systems (nvidia-smi, CloudWatch or equivalent monitoring tools, profilers, distributed communication metrics) to align capacity planning with SLOs. - Ensure compatibility across GPU drivers, CUDA, container runtimes, and frameworks; standardize change management and rollback processes. - Collaborate with cloud provider internal teams and external partners (NVIDIA, ISVs) to resolve cross-domain complex issues and deliver repeatable optimization solutions. ------------------------------------------------------
Qualifications: • 8+ years' experience in product and program management roles driving cross-team collaborations and high-impact projets. • Customer centric and business driven. Eager to help business teams win from competition. 3+ years’ experience in customer facing program management or technical account manager roles. • Knowledge of Data Center GPU architecture and operations. Familiarity and knowledge of AI workloads and SW stack. • You possess a technology background that enables you to understand the complexities of cloud architecture. • You possess and exemplify maturity, judgment, negotiation/influence skills, analytical skills, and leadership skills. • You display a demonstrated ability to think broadly and strategically. You are comfortable in engaging senior leaders within OCI (from engineering to business) to drive results for core business initiatives. • You work well in ambiguity, can work with your team to dive into a problem and create a solution • Fluency in English and Mandarin. Preferred Qualifications • Knowledge of technical program management for Cloud Infra project. • Ability to communicate technical issues clearly to a diverse audience. • Ability to build and maintain healthy rapport with customer • Should have proactive, problem-solving mindset Why Join Us? • Be at the forefront of driving Oracle’s cloud growth strategy in APAC • Shape the direction and success of Oracle’s largest opportunities outside the US. • Collaborate with an elite team in a fast-paced, growth-oriented environment. • Take on a strategic and highly impactful role visible to OCI leadership. • Exposure to cutting-edge cloud and AI infra technologies.
• Work with Sales, BD and CPM team to introduce NVIDIA technologies into assigned accounts and grow business accordingly. • Serve as the primary technical authority on CPU technologies for NVIDIA’s Chinese CSP customers, providing expert consultation on CPU selection, architecture design, and integration with NVIDIA’s AI infrastructure (including Grace/Vera CPUs and NVL72 platforms). • Lead CPU-focused technical engagements with CSPs, collaborating with their R&D, infrastructure, and AI teams to understand workload requirements (e.g., AI data preprocessing, HPC, distributed computing) and design optimized CPU-GPU integrated solutions. • Drive CPU performance optimization for CSP workloads, conducting in-depth analysis of bottlenecks, implementing tuning strategies (including SIMD instruction set optimization and low-level intrinsics), and delivering reference implementations to unlock full platform potential. • Act as a liaison between CSP customers and NVIDIA’s global engineering, product, and R&D teams, advocating for customer-specific CPU requirements, providing feedback on product roadmaps, and ensuring alignment with NVIDIA’s technical strategy and export compliance guidelines. • Lead technical workshops, training sessions, and proof-of-concept (PoC) projects for CSPs, demonstrating the value of NVIDIA’s CPU-integrated solutions and enabling customer teams to effectively leverage these technologies. • Monitor industry trends in CPU technology, data center architectures, and CSP workload evolution, providing strategic insights to internal teams to enhance NVIDIA’s CPU-related products and solutions for the Chinese market. • Mentor junior technical team members, share CPU expertise, and drive best practices in CSP technical engagement and solution delivery.
Context Independent account management, manage the market to cash cycle, either for a small client or for a specific product. OR Contribute to sales at bigger customers / strategic sales. Operates within practices and procedures, but may deviate as long as the end results meet standards of acceptability.
Context Independent account management, manage the market to cash cycle, either for a small client or for a specific product. OR Contribute to sales at bigger customers / strategic sales. Operates within practices and procedures, but may deviate as long as the end results meet standards of acceptability.