苹果Computer Vision/Machine Learning Intern (Video Understanding)
任职要求
Minimum Qualifications • M.S. or PhD in Electrical Engineering/Computer Science or a related field (mathematics, physics or computer engineering), with a focus on computer vision and/or machine learning • Rich experiences in video machine learning covering one of the topics: Object detection and segmentation; Multiple sensor fusion; Activity Recognition; Video Caption • Proven prototyping skills and proficient in coding (C, C++, Python) • Excellent written and verbal communications skills, be comfortable presenting research to large audiences, and have the abil…
工作职责
The computer vision algorithm intern will work in a dynamic team as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apple’s products. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands. Keywords: Object detection and segmentation; Multiple sensor fusion; Activity Recognition; Video Caption
The computer vision algorithm intern will work in a dynamic team as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apple’s products. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands. Keywords: Agentic AI; Multi-Modal LLM; Video Foundation Model; Video Generative Editing
The computer vision algorithm intern will work in a dynamic team as part of the Video Engineering org which develops multi-modality based video quality assessment technologies in Apple Platform. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands. Keywords: Multi-Modal LLM; Video Quality Assessment; Post-training

We are seeking students motivated to advance the state-of-the-art in computer vision and generative AI. Our projects will broadly focus on image/video generation and understanding, 3D generation and reconstruction. Through collaboration, we aim to make significant product impacts and publish seminal works in top-tier conferences. Key Responsibilities: 1. Conduct research and development in computer vision and generative modeling, with a focus on image and video generation and editing. 2. Implement and experiment with state-of-the-art methods and models. 3. Collaborate closely with researchers and engineers to explore new research directions and contribute to impactful product solutions. 4. Contribute to research publications in top-tier venues. Basic

美图影像研究院(MT Lab)专注于计算机视觉、深度学习与计算机图形学等前沿算法的研究与应用。我们为美图产品提供核心技术支持。团队汇聚顶尖人才,致力于推动影像技术的突破,让科技与艺术美好交汇。 MT Lab focuses on R&D of cutting-edge algorithms in CV, deepearning, and computer graphics. We provide core technicalsupport for Meitu products.Our team of top talent is dedicated to advancing imagingtechnology, beautifully merging science and art. 岗位名称:计算机视觉实习生 工作地点:深圳 主要岗位方向: ● 计算机视觉和机器学习 ● 多模态的图片及视频生成 ● 数据分析,质量监控和数据处理 岗位职责: ● 搭建并运行生成式模型的推理流程,用于多样化视频数据的生成 ● 参与设计和渲染合成视频场景,提升数据的多样性与覆盖度 ● 开发自动化脚本和工具,支持数据生成、处理和结构化管理 ● 协助构建高质量视频数据集,为后续的模型训练与评估提供支持 ● 团队成员协作,解决模型与数据相关的问题,并进行实验性分析与对比