logo of amd

AMD图像视频生成算法实习生 (Jan - Jun 2026)

实习兼职地点:北京状态:招聘

任职要求


Proficiency in at least one deep learning framework (such as TensorFlow, PyTorch, etc.) to design, implement, and optimize complex models. Excellent problem-solving and analytical skills, capable of working effectively and delivering results under high pressure. Strong communication abilities. Experiences about publication in top AI conference like NeurIPS/CVPR/ICLR... ACADEMIC CREDENTIALS: Bachelor’s degree in Computer/Software Engineering, Computer Science, Artificial Intelligence or related technical discipline #LI-EJ1 #LI-HYBRID

工作职责


Location: Beijing THE ROLE: AMD is looking for an AI R&D intern to join our growing team. As a key contributor you will be part of a leading team to drive and enhance AMD’s abilities to explore the highest quality, academic/industry-leading technologies. THE PERSON: The ideal candidate possesses an innovative and problem-solving mindset, has a keen eye for Software engineering development, and is diligent and passionate about Technology. A successful candidate will need to employ strong knowledge in computer technologies, and SW engineering expertise as well as a strong ability to compete effectively in a fast-paced, relevant environment while working with different teams of engineers and collaborators. KEY RESPONSIBILITIES: Research the latest advancements and technologies in Generative AI, more specifically image/video/world generation, MLLM, designing and developing innovative applications aligned with company needs. Study the SOTA generation algorithms and enhance the accuracy and performance of existing models. Explore optimized deployment approaches ensuring efficiency in production environments. Collaborate with teams, share best practices, and provide guidance and support on Generative AI technologies.
包括英文材料
开发框架+
TensorFlow+
PyTorch+
NeurIPS+
CVPR+
相关职位

logo of kuaishou
实习D12753

1、参与快手kling多模态视频生成的研发和落地工作(实习生以发论文为主),包括但不限于: t2v,i2v等基础模型研发、多模态可控视频生成编辑、世界模型等; 2、探索将多模态大语言模型mllm如deepseek/qwen相关技术与视频生成相结合,包括但不限于:提升kling视频生成的多模态理解、推理、多轮交互能力等; 3、探索将语音和视频生成相结合,包括但不限于:语音驱动的视频生成,有声视频等; 4、探索实时可拓展的多模态视频生成技术,提升多模态视频生成的质量和效率等; 5、在顶会顶刊上发表研究成果和开源代码,提升团队在多模态视频生成等领域的学术声望。

更新于 2025-09-30
logo of kuaishou
实习D12753

1、参与快手kling多模态视频生成的研发和落地工作(实习生以发论文为主),包括但不限于: t2v,i2v等基础模型研发、多模态可控视频生成编辑、世界模型等; 2、探索将多模态大语言模型mllm如deepseek/qwen相关技术与视频生成相结合,包括但不限于:提升kling视频生成的多模态理解、推理、多轮交互能力等; 3、探索将语音和视频生成相结合,包括但不限于:语音驱动的视频生成,有声视频等; 4、探索实时可拓展的多模态视频生成技术,提升多模态视频生成的质量和效率等; 5、在顶会顶刊上发表研究成果和开源代码,提升团队在多模态视频生成等领域的学术声望。

更新于 2025-09-30
logo of kuaishou
实习D0001

1、参与多模态生成算法的调研和分析,如Diffusion Models 、 GAN 、 VAE 、 Autoregressive Models等,包括但不限文本/图像/视频生成,解决生成质量、多样性、可控性、采样效率、可编辑等问题; 2、参与多模态生成算法的基础模块的研发,如 VAE、CLIP、LLM 等; 3、协助多模态生成算法的效果分析、数据优化、行业调研 等。

更新于 2025-02-12
logo of amap
实习高德地图2026

岗位职责: 我们正在寻找充满热情、富有创造力的空间重建与生成算法实习生,加入我们的前沿技术研发团队。您将专注于开发先进的空间重建与生成算法,构建下一代空间智能技术并赋能于多领域创新应用场景。 主要职责包括但不限于: 1. 协助团队完成 空间重建与生成 相关算法的预研与实现; 2. 在mentor指导下,参与视频理解与生成、视频切分、空间语义理解、空间重建等模块的开发与测试; 3. 负责多模态数据(图像/视频/点云)的标注、清洗与小规模数据集搭建; 4. 撰写实验记录与技术报告,输出可复现的实验流程与结果; 5. 跟进前沿论文与开源项目,协助完成小规模原型验证。

更新于 2025-05-29