logo of didi

滴滴AI-Foundation Model CN 算法实习生

实习兼职技术类地点:北京状态:招聘

任职要求


实习算法工程师(VLM/LLM方向)- 行为预测与规划基座模型
职责
* 基于 Vision-Language Models (VLM) 和 Large Language Models (LLM),设计与实现自动驾驶中行为预测与运动规划的基座模型(Foundation Model)
* 利用多模态预训练大模型进行轨迹生成与融合,提升基座模型对其他交通参与者意图的理解与预测能力
* 针对车端/云端部署,开展模型算法层面性能优化工作,例如压缩、剪枝、蒸馏、训练和推理加速等,确保模型可用性、系统实时性与资源利用率
* 与算法、软件和系统团队紧密协作,推动模型集成及在仿真与真实车载平台的落地

要求
* 计算机科学、人工智能、…
登录查看完整任职要求
微信扫码,1秒登录

工作职责


包括英文材料
算法+
大模型+
自动驾驶+
机器学习+
学历+
GPT+
还有更多 •••
相关职位

logo of bytedance
社招5年以上A174225A

1、负责Foundation model和Generative AI的基础能力建设和业务落地,包括但不限于文本生成/翻译、图生文、Deepfake、大模型高效训练/推理等等,追踪业界最前沿进展,并进行前瞻性的技术研究; 2、带领团队将AIGC相关技术在广告、电商、短视频、直播等商业产品的内容理解上落地,构建新一代基于大模型的商业化生态; 3、负责大模型算法团队的项目规划、团队建设、跨团队合作,打造行业领先的内容理解算法团队。

更新于 2025-02-26上海
logo of nvidia
社招

• Design, implement, and optimize scalable ML training pipelines for training multimodal foundation models for robotics. • Collaborate with researchers to integrate cutting-edge model architectures into scalable training pipelines. • Implement scalable data loaders and preprocessors for multimodal datasets, such as videos, text, and sensor data. • Optimize GPU and cluster utilization for efficient model training and fine-tuning on massive datasets. • Develop robust monitoring and debugging tools to ensure the reliability and performance of training workflows on large GPU clusters.

更新于 2025-08-21上海
logo of apple
社招Machine

The computer vision algorithm engineer will work in a dynamic team as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apple’s products. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands. Keywords: Machine learning based ISP; Low level object detection and segmentation; Multiple sensor fusion

更新于 2025-10-13北京
logo of apple
社招Machine

The computer vision algorithm engineer will work in a dynamic team as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apple’s products. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands.

更新于 2025-10-13北京