平头哥平头哥-芯片互联设计专家-上海
任职要求
* Minimum Bachelar degree in Computer Science or Electronics Engineering; M.S. or Ph.D. is preferred * Minimum of 5 years of experience on computer architecture design for proven silicons. Ethernet/Switch/RDMA/RoCE/Ethernet sub-domain is preferred * Strong experience on Ethernet/RDMA/RoCE protocol and architecture design. * Experience on one or more of the following areas: Cache, NOC, Coherency, Virtulization, Security, RAS, power management * Good verbal and written skill for communication * Hands-on experience on the performance simulation and analysis is a good plus * Faimilar AMBA protocol, CPU, knowledgable about SerDes, Phy, participated the chip integration, Server grade design is a plus.
工作职责
In this role, you will work with software and hardware engineering groups to define the next-generation inter-chip network architecture for high-performance computing SOC in Data Center Requirement of the Job * Identifies the challenging problems, and evaluate the various solutions for next-generation data center Computing solutions. * Gets strong influences on the shape of future products by advanced architecture design as the excellent interface between software and hardware * Documents the high-level architecture specification that defines the inter-chip network subsystem for the cutting-edge cloud applications. * Works closely with design, system, and verification team to bring up the subsystem
In this role, you will work with software and hardware engineering groups to define the next-generation inter-chip network architecture for high-performance AI chip and AI network. Requirement of the Job * Identifies the challenging problems, and evaluate various solutions for the next-generation of network for AI chip and AI Super Pod. * Gets strong influences on future AI products by advanced architecture design as the excellent interface between software and hardware. * Documents the high-level architecture specification that defines the inter-chip network subsystem for AI chips. * Participation of front-end Implementation of key subsystem. * Strong technical leadership to archive successful delivery of final silicon product. * Works closely with design, system, and verification team.
1、与架构、软件、设计等团队合作构建高端芯片设计验证平台; 2、负责和主导验证方法学和验证策略制定,开发高性能验证架构; 3、负责和主导数据中心芯片互联验证TB开发、环境开发、测试向量开发及调试,覆盖率收集及整体DV signoff的流程开发; 4、负责和主导芯片验证文档的撰写,验证Testbench搭建及实现,Testplan等;
1、与架构、软件、设计等团队合作构建高端芯片设计验证平台; 2、负责和参与data-center inter-chip connections的验证TB开发、环境开发、测试向量开发及调试,覆盖率收集及整体DV signoff的流程开发; 3、负责和参与芯片验证文档的撰写,验证Testbench搭建及实现,Testplan等; 4、负责和参与验证方法学制定,开发高性能验证架构。
团队介绍 我们是平头哥AI 芯片软件互联团队,主要职责是积极拥抱社区生态、并基于平头哥AI 芯片产品来打造我们自己的互联通信库。 越来越好的大模型对算力需求日益高涨,而大模型训练与推理的高效部署都依赖越来越多的芯片通过互联在一起,高效协同以发挥出线性增长的计算效率。我们会与架构/硬件/Model 同学紧密合作以共同打造越来越符合业界需求的芯片,同时也会协同服务器/网络等伙伴共同打造基于平头哥芯片的高性能集群 solution,还会深入到各种应用场景去洞察并满足用户对多卡训练、推理在性能、鲁棒性、故障定位等各方面的需求,协同各方共同打造最高效、易用的平头哥多卡产品软件解决方案。 职位描述 1. 为芯片设计开发高性能、有竞争力的互联通信库; 2. 基于芯片、服务器、网络集群架构特性与互联通信应用模式进行极致性能优化; 3. 增强在大规模机器任务下发生 hang 或 crash 时的专家分析与诊断、定位能力; 4. 支持多卡或多机互联场景下各种用户问题分析与定位; 5. 和其他团队紧密合作,影响芯片、服务器与集群架构等方案设计和演进。