logo of nvidia

英伟达CPU Performance Developer Technology Engineer

社招全职地点:上海 | 北京 | 深圳状态:招聘

任职要求


• BS, MS, or PhD in Computer Science, Computer Engineering, or a related field.
• 5+ years of relevant experience in performance engineering or CPU optimization.
• Strong programming proficiency in C/C++ and/or Python, with a deep understanding of algorithms and software architecture.
• Solid grasp of CPU microarchitecture, performance analysis tools, and optimization methodologies.
• Proven track record of CPU benchmarking and bottleneck-driven performance tuning.
• Excellent communication and organizational skills, wit…
登录查看完整任职要求
微信扫码,1秒登录

工作职责


• Collaborate with developers, researchers, and framework maintainers across industries to identify and resolve performance challenges in diverse workloads such as AI, data analytics, simulation, and numerical computing.
• Profile, analyze, and optimize CPU performance from application-level algorithms down to low-level microarchitecture.
• Contribute to open-source frameworks, key software stacks, reference implementations, and performance libraries to unlock full CPU potential.
• Work closely with NVIDIA’s architecture, research, libraries, tools, and system software teams to improve our overall platform performance.
• Provide insights that shape next-generation CPU designs, compiler toolchains, and development workflows for better developer productivity and throughput.
包括英文材料
C+
还有更多 •••
相关职位

logo of thead
社招5年以上技术-芯片

1、参与设计和实现推理引擎SDK,提升推理性能、易用性和产品稳定性。(Design and develop inference engine。Focusing on performance、usability and product robustness) 2、参与设计和实现推理引擎的AI编译。包括图融合、各类图优化、算子优化以及自动化调优等(Design and develop AI Compiling。including fusion,graph optimizations、kernel optimization and auto-tuning) 3、参与设计和实现推理引擎的运行时系统。包括内存管理以及资源管理等等。实现高效和稳定的稳定性。(Design and develop runtime system,including memory management and resource management) 4、参与设计和实现大模型的推理优化。基于推理引擎,研发和应用大模型推理优化的技术(Design and optimize LLM inference。Based on inference engine,develop and apply core technology for LLM inference)

更新于 2025-09-15杭州
logo of thead
社招5年以上技术-芯片

1、参与设计和实现推理引擎SDK,提升推理性能、易用性和产品稳定性。(Design and develop inference engine。Focusing on performance、usability and product robustness) 2、参与设计和实现推理引擎的AI编译。包括图融合、各类图优化、算子优化以及自动化调优等(Design and develop AI Compiling。including fusion,graph optimizations、kernel optimization and auto-tuning) 3、参与设计和实现推理引擎的运行时系统。包括内存管理以及资源管理等等。实现高效和稳定的稳定性。(Design and develop runtime system,including memory management and resource management) 4、参与设计和实现大模型的推理优化。基于推理引擎,研发和应用大模型推理优化的技术(Design and optimize LLM inference。Based on inference engine,develop and apply core technology for LLM inference)

更新于 2026-06-09上海
logo of thead
社招5年以上技术-芯片

1. 致力打造世界一流的深度学习硬件计算平台, 跟踪深度学习及系统硬件架构的发展,设计开发高性能低功耗的架构、芯片及硬件产品。 2. 针对阿里巴巴集团业务发展需求,与阿里巴巴的算法和业务团队和作, 规划设计与业务相匹配的异构计算软硬件产品构架。 3. 确保前端设计的质量检查,以及跟后端流程的协作。 1. Build the world-class deep learning platforms. Follow closely with the latest innovations on deep learning algorithms and accelerator architecture. Architect and design deep learning HW acceleration platform for high performance and low power. 2. Target at the specific computation needs of driving business growth. Collaborate with Alibaba algorithm and business teams. Architect and develope heterogenous platforms that drive business growth. 3. Own front-end design quality checks and reviews to present the physical design team with high-quality RTL.

更新于 2026-01-06上海
logo of amd
实习

An exciting internship opportunity to make an immediate contribution to AMD's next generation of technology innovations awaits you! We have a multifaceted, high-energy work environment filled with a diverse group of employees, and we provide outstanding opportunities for developing your career. During your internship, our programs provide the opportunity to collaborate with AMD leaders, receive one-on-one mentorship, attend amazing networking events, and much more. Being part of AMD means receiving hands-on experience that will give you a competitive edge. Together We Advance your career! DESCRIPTION OF DUTIES IN ADDITION TO THOSE IN JOB DESCRIPTION: • Positions(Either of below areas) Validation Power & Performanc Debug Platform Design •Key Responsiblity •To verify and validate hardware products designed and developed in house at AMD(ATI) using the standard qualification process developed within the hardware qualification group and participates in creating/improving all phases of test procedures and methodologies. •Work closely with internal data producing team and external automation framework team to develop engineering experience-based power and performance data analytic process and visualization structure. •Work closely with SOC design team & program lead team to understand performance target and validation methodology for APU/GPU Component and overall reference design System •Develop system and component level performance test strategies and plans. •Compose validation reports and provide future test plan improvement. •Ensure the failures found in testing is real issue and support the internal design team to find the root cause of the issues; • Executing and/or automating the tests with the instruction of team lead and able to independently work. • Communicate with various internal departments to resolve anomalies • Participate in issuing internal/external releasable test reports. • Able to work for 3-5 days per week, 4-6 months

更新于 2025-10-15上海