logo of apple

苹果Computer Vision/Machine Learning Intern (Agentic AI)

实习兼职Machine Learning and AI地点:北京状态:招聘

任职要求


Minimum Qualifications
• M.S. or PhD in Electrical Engineering/Computer Science or a related field (mathematics, physics or computer engineering), with a focus on computer vision and/or machine learning
• Rich experiences in video machine learning covering one of the topics: Agentic AI / Multi-Modal LLM / Video Foundation Model / Video Generative Editing
• Proven prototyping skills and proficient in coding (C, C++, Python)
• Excellent written and verbal communications skills, be comfortable presenting research to large audiences, and have the ability to …
登录查看完整任职要求
微信扫码,1秒登录

工作职责


The computer vision algorithm intern will work in a dynamic team as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apple’s products. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands. 

Keywords: Agentic AI; Multi-Modal LLM; Video Foundation Model; Video Generative Editing
包括英文材料
大模型+
C+
C+++
Python+
NeurIPS+
还有更多 •••
相关职位

logo of apple
实习Machine

The computer vision algorithm intern will work in a dynamic team as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apple’s products. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands. Keywords: Object detection and segmentation; Multiple sensor fusion; Activity Recognition; Video Caption

更新于 2025-10-21北京
logo of apple
实习Machine

The computer vision algorithm intern will work in a dynamic team as part of the Video Engineering org which develops multi-modality based video quality assessment technologies in Apple Platform. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands. Keywords: Multi-Modal LLM; Video Quality Assessment; Post-training

更新于 2025-11-04北京
logo of apple
实习Hardware

• Collaborate with team members to understand failure analysis challenges and translate them into actionable technical solutions. • Develop and deploy AIML models and algorithms to diagnose and predict product failures. • Perform data analysis and visualization to extract actionable insights. • Test, validate, and optimize models to ensure high performance and reliability. • Document model development processes and findings, presenting results to stakeholders. • Stay informed about the latest advancements in AIML technologies and apply them to ongoing projects.

更新于 2025-10-26上海
logo of apple
实习Machine

As a Machine Learning (ML) Engineer, you will be entrusted with the critical role of innovating and applying state-of-the-art research in ML to tackle complex data problems. The solutions you develop will significantly impact future Apple products and the broader ML development ecosystem. You will work with a multidisciplinary team to actively participate in the data-model co-design and co-development practice. Your responsibilities will extend to the design and development of a comprehensive data curation framework. You will also create robust model evaluation pipelines, integral to the continuous improvement and assessment of ML models. Additionally, your role will entail an in-depth analysis of collected data to underscore its influence on model performance. Furthermore, you will have the opportunity to showcase your groundbreaking research work by publishing and presenting at premier academic venues. Your work may span a variety of topics, including but not limited to: * Designing and implementing semi-supervised, self-supervised representation learning techniques for maximizing the power of both limited labeled data and large-scale unlabeled data. * Developing evaluation protocols centered on the end-to-end user experience, with a focus on anticipating potential failure modes, edge cases, and anomalies. * Employing data selection techniques such as novelty detection, active learning, and core-set selection for diverse data types like images, 3D models, natural language, and audio. * Uncovering patterns in data, setting performance targets, and leveraging modern statistical and ML-based methods to model data distributions. This will aid in reducing redundancy and addressing out-of-distribution samples.

更新于 2025-07-29北京