logo of amd

AMD图像视频生成算法实习生 (Jan - Jun 2026)

实习兼职地点:北京状态:招聘

任职要求


Proficiency in at least one deep learning framework (such as TensorFlow, PyTorch, etc.) to design, implement, and optimize complex models. Excellent problem-solving and analytical skills, capable of working effectively and delivering results under high pressure. Strong…
登录查看完整任职要求
微信扫码,1秒登录

工作职责


Location: Beijing THE ROLE: AMD is looking for an AI R&D intern to join our growing team. As a key contributor you will be part of a leading team to drive and enhance AMD’s abilities to explore the highest quality, academic/industry-leading technologies. THE PERSON: The ideal candidate possesses an innovative and problem-solving mindset, has a keen eye for Software engineering development, and is diligent and passionate about Technology. A successful candidate will need to employ strong knowledge in computer technologies, and SW engineering expertise as well as a strong ability to compete effectively in a fast-paced, relevant environment while working with different teams of engineers and collaborators. KEY RESPONSIBILITIES: Research the latest advancements and technologies in Generative AI, more specifically image/video/world generation, MLLM, designing and developing innovative applications aligned with company needs. Study the SOTA generation algorithms and enhance the accuracy and performance of existing models. Explore optimized deployment approaches ensuring efficiency in production environments. Collaborate with teams, share best practices, and provide guidance and support on Generative AI technologies.
包括英文材料
开发框架+
TensorFlow+
PyTorch+
还有更多 •••
相关职位

logo of sensetime
社招算法研究

1, 算法创新,探索扩散模型在图像视频生成领域,画质,动态性提升的方法 2,算法创新,探索扩散模型推理提速的蒸馏方法和无需训练的方法 3,业务支持,改进现有扩散模型以实现目前业务所需的一些特性,如提高人像一致性,长视频生成的稳定性,指令遵循能力等 4,业务支持,改进现有扩散模型以实现流式地生成

更新于 2025-11-27北京|上海
logo of kuaishou
实习D12753

1、参与快手kling多模态视频生成的研发和落地工作(实习生以发论文为主),包括但不限于: t2v,i2v等基础模型研发、多模态可控视频生成编辑、世界模型等; 2、探索将多模态大语言模型mllm如deepseek/qwen相关技术与视频生成相结合,包括但不限于:提升kling视频生成的多模态理解、推理、多轮交互能力等; 3、探索将语音和视频生成相结合,包括但不限于:语音驱动的视频生成,有声视频等; 4、探索实时可拓展的多模态视频生成技术,提升多模态视频生成的质量和效率等; 5、在顶会顶刊上发表研究成果和开源代码,提升团队在多模态视频生成等领域的学术声望。

更新于 2025-12-02北京
logo of kuaishou
实习D12753

1、参与快手kling多模态视频生成的研发和落地工作(实习生以发论文为主),包括但不限于: t2v,i2v等基础模型研发、多模态可控视频生成编辑、世界模型等; 2、探索将多模态大语言模型mllm如deepseek/qwen相关技术与视频生成相结合,包括但不限于:提升kling视频生成的多模态理解、推理、多轮交互能力等; 3、探索将语音和视频生成相结合,包括但不限于:语音驱动的视频生成,有声视频等; 4、探索实时可拓展的多模态视频生成技术,提升多模态视频生成的质量和效率等; 5、在顶会顶刊上发表研究成果和开源代码,提升团队在多模态视频生成等领域的学术声望。

更新于 2025-12-02深圳
logo of kuaishou
实习D0001

1、参与多模态生成算法的调研和分析,如Diffusion Models 、 GAN 、 VAE 、 Autoregressive Models等,包括但不限文本/图像/视频生成,解决生成质量、多样性、可控性、采样效率、可编辑等问题; 2、参与多模态生成算法的基础模块的研发,如 VAE、CLIP、LLM 等; 3、协助多模态生成算法的效果分析、数据优化、行业调研 等。

更新于 2025-02-12北京