小米语音多模态算法工程师实习生
实习兼职地点:北京状态:招聘
包括英文材料
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
相关职位

实习算法序列
1.参与多模态大模型(VLM/VLA)的研发与优化,探索图像、文本、语音等跨模态信息的融合方法,以及在自动驾驶领域的应用; 2.研究并实现前沿的视觉技术(如Diffusion Model、GAN、VAE等),推动技术落地; 3.配合团队完成算法设计、训练、调优及部署,提升模型性能与工程化能力; 4.跟踪领域前沿研究,撰写技术文档和实验报告,参与论文发表或专利申请。
更新于 2025-05-29
实习淘天集团2026
如果你,期望参与淘天集团语音多模态大模型技术研发,推动数字人AI智能对话、语音自然交互等技术在淘宝Vision和手机淘宝等亿级用户场景的产品化落地; 如果你,期望突破语音模态与语言模型的融合边界,构建新一代Speech-to-Speech多模态基座模型,持续跟踪大模型领域国际前沿技术,通过产学研合作打造行业领先的对话交互系统; 如果你,期待与顶尖算法团队并肩作战,在开放创新的技术氛围中与自驱力强、专业过硬、追求极致的技术伙伴共同开拓多模态交互新范式; 那还在等待什么,赶紧加入我们吧!
更新于 2025-05-07