logo of amd

AMDAI编译/高性能计算C++实习生 (Jan - Jun 2026)

实习兼职地点:北京状态:招聘

任职要求


You are in your Master's degree in Computer Science, Software Engineering, or a related field. You have strong programming skills in Python and C++. You have experience in compilers, parallel computing, or GPU programming. You are familiar …
登录查看完整任职要求
微信扫码,1秒登录

工作职责


An exciting internship opportunity to make an immediate contribution to AMD's next generation of technology innovations awaits you! We have a multifaceted, high-energy work environment filled with a diverse group of employees, and we provide outstanding opportunities for developing your career. During your internship, our programs provide the opportunity to collaborate with AMD leaders, receive one-on-one mentorship, attend amazing networking events, and much more. Being part of AMD means receiving hands-on experience that will give you a competitive edge. Together We Advance your career! JOB DETAILS: Location: Beijing,China Onsite/Hybrid: at least 3 days a week, either in a hybrid or onsite or remote work structure throughout the duration of the co-op/intern term. Duration: at least 6 months WHAT YOU WILL BE DOING: We are seeking highly motivated AI Compiler Software Engineering intern/co-op to join our team. In this role – We will involve you in extending Triton’s compiler infrastructure to support new AI workloads and hardware targets. We will assign you tasks to implement and optimize GPU kernels using Triton’s Python-based DSL. We will train you to analyze kernel performance using profiling tools and help you identify bottlenecks and optimization opportunities. We will understand how modern compilers translate high-level abstractions into efficient machine code.
包括英文材料
Python+
C+++
还有更多 •••
相关职位

logo of liauto
实习算法与软件

1、承接端到端自动驾驶/大语言类AI模型负载,为大算力芯片研发设计AI模型调度、编译软件栈,实现高性能推理或训练。 2、参与数据流模式时空调度建模与算法开发,支撑AI模型自动化调度到大算力芯片之上,达到较高端到端性能。 3、参与AI基本算子的开发和优化,支撑算法模型推理所需算子的功能和基本性能要求,分析性能瓶颈,构建方案极致优化。

北京
logo of liauto
实习算法与软件

1、承接端到端自动驾驶/大语言类AI模型负载,为大算力芯片研发设计AI模型调度、编译软件栈,实现高性能推理或训练。 2、参与数据流模式时空调度建模与算法开发,支撑AI模型自动化调度到大算力芯片之上,达到较高端到端性能。 3、参与AI基本算子的开发和优化,支撑算法模型推理所需算子的功能和基本性能要求,分析性能瓶颈,构建方案极致优化。

北京
logo of liauto
实习算法与软件

1.参与基于 MLIR 的空间数据流 AI 编译器开发及各类 AI 负载(含大模型)的建模与量化分析;

上海
logo of baidu
实习ACG

-结合前沿业务场景,构建昆仑芯AI大规模训练推理系统 -负责大模型分布式训练、推理框架的适配与调优,设计千卡级集群通信加速、混合精度训练等方案 -为昆仑芯AI芯片各系列高性能加速芯片提供软件栈,包括框架,图编译器以及周边产品的技术落地 -AI芯片性能深度学习高性能计算库开发,支持各种AI场景,持续提升系统效能

更新于 2025-03-17北京|上海