AMD算子开发实习生 AI/ML Kernel Optimization Intern
任职要求
You are currently enrolled in a China based University in a Master's program in Computer Science, Computer Engineering, or a related field. If you have knowledge/experience with any of the following technical skills (or related areas) and are enthusiastic about this role, we strongly encourage you to apply: Machine Learning & Data Science: Exposure to machine learning algorithms, data analysis, computer vision, etc. through coursework or projects. Programming Languages: Strong programming skills in C++ with a focus on writing clean, efficient, and scalable code. Machine Learning Frameworks: Practical experience with machine learning librar…工作职责
An exciting internship opportunity to make an immediate contribution to AMD's next generation of technology innovations awaits you! We have a multifaceted, high-energy work environment filled with a diverse group of employees, and we provide outstanding opportunities for developing your career. During your internship, our programs provide the opportunity to collaborate with AMD leaders, receive one-on-one mentorship, attend amazing networking events, and much more. Being part of AMD means receiving hands-on experience that will give you a competitive edge. Together We Advance your career! JOB DETAILS: Location: Shanghai, China Onsite/Hybrid: This role require the student to work at least 3 days/week, either in a hybrid (minimum 3 Days in Office) or onsite work structure throughout the duration of the co-op/intern term. Duration: Jan - June 2026 WHAT YOU WILL BE DOING: We are seeking a highly motivated Machine Learning (ML)/Artificial Intelligence (AI) intern/co-op to join our team and contribute to the development of next-generation product differentiation features alongside expert ML/AI engineers. In this role, you will: Gain hands-on experience with cutting-edge technologies in ML, AI, and High-Performance Computing. Learn to analyze and optimize GPU Kernel to maximize performance for specific AI operations. Contribute to projects such as: Researching, developing, and deploying machine learning and computer vision solutions for AMD's current and future products. Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs. Design and optimize deep learning models specifically for AMD GPU performance. Assisting AI software teams with roadmap planning, collateral development, and customer engagements. Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream. Apply sound engineering principles to ensure robust, maintainable solutions.
【职位描述】 1、设计和实现机器学习平台业务系统, 包括工具链/组件等AI基础设施, 落地业务功能需求; 2、高效优化和部署 计算机视觉、语音识别、语音合成、自然语言处理 等业务模型; 3、与公司各算法部门深度合作, 分析业务性能瓶颈和系统架构特征, 软硬件结合优化, 实现极致性能。
1、参与快手大规模深度学习推理框架的研发与优化,保障在线系统的高可用/高并发,为快手搜索数亿用户提供高效稳定的算力输出; 2、负责快手搜索模型推理优化工作,优化模型推理性能,高吞吐低延时支撑模型推理服务; 3、 支持大模型在搜索场景落地的相关模型优化,包括不局限于AI检索,Query改写等。
我们是蚂蚁集团网络技术团队,为蚂蚁集团全站提供通智一体、稳定高效的网络基础设施产品、平台和服务。 ● 负责集合通信库的设计和研发; ● 通过稳定性建设和通信优化提升大模型训练效率和减少推理成本;
1、嵌入式AI系统开发: • 负责RTOS系统平台上多模态AI终端产品的研发,包括方案评估、软件架构设计、核心功能模块(如人脸/手势识别、行为分析)开发与部署; • 主导端侧AI模型轻量化、跨平台推理框架适配(TensorFlow Lite/MNN/NCNN)及NPU芯片的性能优化(如内存、功耗、实时性); • 结合硬件特性设计轻量化模型架构,完成从算法训练到嵌入式端侧部署的全链路开发。 2、多模态算法工程化: • 优化计算机视觉算法在嵌入式设备(IoT/AR硬件/AI机器人)的落地效果,解决低算力、高延迟、多干扰场景下的工程挑战; • 开发芯片算子库适配方案,参与芯片选型、AI工具链优化及端云协同架构设计; • 探索多模态交互(视觉+语音+传感器)在智能终端的创新应用,如AI玩偶、陪伴机器人等。 3、跨团队协作与交付: • 与芯片厂商、算法团队、硬件团队协同开发,主导端侧SDK集成及性能调优,确保产品按时交付; • 支持产品量产落地,保障系统稳定性与用户体验。