AMD图像视频生成算法实习生 (Jan - Jun 2026)
实习兼职地点:北京状态:招聘
任职要求
Proficiency in at least one deep learning framework (such as TensorFlow, PyTorch, etc.) to design, implement, and optimize complex models. Excellent problem-solving and analytical skills, capable of working effectively and delivering results under high pressure. Strong…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
Location: Beijing THE ROLE: AMD is looking for an AI R&D intern to join our growing team. As a key contributor you will be part of a leading team to drive and enhance AMD’s abilities to explore the highest quality, academic/industry-leading technologies. THE PERSON: The ideal candidate possesses an innovative and problem-solving mindset, has a keen eye for Software engineering development, and is diligent and passionate about Technology. A successful candidate will need to employ strong knowledge in computer technologies, and SW engineering expertise as well as a strong ability to compete effectively in a fast-paced, relevant environment while working with different teams of engineers and collaborators. KEY RESPONSIBILITIES: Research the latest advancements and technologies in Generative AI, more specifically image/video/world generation, MLLM, designing and developing innovative applications aligned with company needs. Study the SOTA generation algorithms and enhance the accuracy and performance of existing models. Explore optimized deployment approaches ensuring efficiency in production environments. Collaborate with teams, share best practices, and provide guidance and support on Generative AI technologies.
包括英文材料
开发框架+
[英文] Understanding Modern Development Frameworks: A Guide for Developers and Technical Decision-makers
https://www.freecodecamp.org/news/understanding-modern-development-frameworks-guide-for-devs/
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
还有更多 •••
相关职位

社招算法研究
1, 算法创新,探索扩散模型在图像视频生成领域,画质,动态性提升的方法 2,算法创新,探索扩散模型推理提速的蒸馏方法和无需训练的方法 3,业务支持,改进现有扩散模型以实现目前业务所需的一些特性,如提高人像一致性,长视频生成的稳定性,指令遵循能力等 4,业务支持,改进现有扩散模型以实现流式地生成
更新于 2025-11-27北京|上海
实习D12753
1、参与快手kling多模态视频生成的研发和落地工作(实习生以发论文为主),包括但不限于: t2v,i2v等基础模型研发、多模态可控视频生成编辑、世界模型等; 2、探索将多模态大语言模型mllm如deepseek/qwen相关技术与视频生成相结合,包括但不限于:提升kling视频生成的多模态理解、推理、多轮交互能力等; 3、探索将语音和视频生成相结合,包括但不限于:语音驱动的视频生成,有声视频等; 4、探索实时可拓展的多模态视频生成技术,提升多模态视频生成的质量和效率等; 5、在顶会顶刊上发表研究成果和开源代码,提升团队在多模态视频生成等领域的学术声望。
更新于 2025-12-02北京
实习D12753
1、参与快手kling多模态视频生成的研发和落地工作(实习生以发论文为主),包括但不限于: t2v,i2v等基础模型研发、多模态可控视频生成编辑、世界模型等; 2、探索将多模态大语言模型mllm如deepseek/qwen相关技术与视频生成相结合,包括但不限于:提升kling视频生成的多模态理解、推理、多轮交互能力等; 3、探索将语音和视频生成相结合,包括但不限于:语音驱动的视频生成,有声视频等; 4、探索实时可拓展的多模态视频生成技术,提升多模态视频生成的质量和效率等; 5、在顶会顶刊上发表研究成果和开源代码,提升团队在多模态视频生成等领域的学术声望。
更新于 2025-12-02深圳