字节跳动视觉算法工程师(人机交互/XR多模态方向)-PICO
社招全职A13728地点:北京状态:招聘
任职要求
1、计算机、自动化、电子、数学等相关专业,本科及以上学历,硕士学位、博士学位优先; 2、数学与算法基础扎实:线性代数、概率统计、优化方法、数值计算等;对机器学习/深度学习核心理论有系统理解; 3、编程基础扎实:熟练使用Python/C++(至少其一非常熟练);具备良好的工程习惯(调试、性能分析、单元测试、代码规范等); 4、熟悉至少一种深度学习框架(PyTorch/TensorFlow等),有模型训练、调参、评估与问题诊断的经验;理解常见视觉网络/Transformer等结构与训练范式; 5、具备良好的沟通协作能力与自驱力,能在不确定问题中快速学习、拆解并推进落地。 加分项: 1、在AI/计算机视觉/多模态…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1、负责XR场景下人机交互相关视觉感知算法的研发与工程落地工作,包括但不限于眼动追踪技术、情绪感知技术等方向; 2、基于深度学习方法进行模型设计、训练与评测,持续优化算法在真实设备上的鲁棒性、时延、功耗与体验指标,推动移动端/嵌入式侧部署(如量化、剪枝、加速与性能调优); 3、参与数据与评测体系建设:数据采集与清洗、自动/半自动标注、合成数据(仿真/渲染)、建立可复用的数据闭环与指标体系; 4、与产品、交互、硬件、系统软件等团队协作,推进算法在端侧应用落地;参与关键技术方案评审、问题定位与迭代优化; 5、探索多模态交互在XR场景的融合应用,研究多模态信息的对齐与融合(时空对齐、语义融合、置信度建模等),并参与原型系统验证。
包括英文材料
学历+
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
OpenCV+
https://learnopencv.com/getting-started-with-opencv/
At LearnOpenCV we are on a mission to educate the global workforce in computer vision and AI.
https://opencv.org/university/free-opencv-course/
This free OpenCV course will teach you how to manipulate images and videos, and detect objects and faces, among other exciting topics in just about 3 hours.
CVPR+
https://cvpr.thecvf.com/
ICCV+
https://iccv.thecvf.com/
ICCV is the premier international computer vision event comprising the main conference and several co-located workshops and tutorials.
还有更多 •••
相关职位
社招后端开发
【职位描述】 1、设计和实现机器学习平台业务系统, 包括工具链/组件等AI基础设施, 落地业务功能需求; 2、高效优化和部署 计算机视觉、语音识别、语音合成、自然语言处理 等业务模型; 3、与公司各算法部门深度合作, 分析业务性能瓶颈和系统架构特征, 软硬件结合优化, 实现极致性能。
北京|上海
社招5年以上技术类-前端
1、负责小天基/神农控制台/ASO/staragent/统一运维平台的前端开发工作,完成产品的前端框架升级,保证流畅的交互体验。 2、结合阿里云整体的视觉设计风格,建设统一的前端基础组件库(组件库、图形库、工程体系、低代码、服务化平台等),保障前端性能及交互一致性的同时,提升研发效率 3、基于阿里云统一的AEM基础设施对用户行为进行记录并建立数据化度量体系,为产品交互及后端性能优化方案或技术选型提供数据支撑 4、负责线上系统的维护和管理,保障系统稳定运行;
更新于 2025-04-02杭州
社招5年以上技术类-前端
1、负责小天基/神农控制台/ASO/staragent/统一运维平台的前端开发工作,完成产品的前端框架升级,保证流畅的交互体验。 2、结合阿里云整体的视觉设计风格,建设统一的前端基础组件库(组件库、图形库、工程体系、低代码、服务化平台等),保障前端性能及交互一致性的同时,提升研发效率 3、基于阿里云统一的AEM基础设施对用户行为进行记录并建立数据化度量体系,为产品交互及后端性能优化方案或技术选型提供数据支撑 4、负责线上系统的维护和管理,保障系统稳定运行;
更新于 2025-04-02北京