京东端到端多模态交互算法工程师
社招全职算法开发岗地点:北京状态:招聘
任职要求
1.硕士及以上学历,具有扎实的编程功底,良好的设计能力和编程基础、对设计模式有一定的了解; 2.对C++,数据结构,多线程编程和网络编程(TCP/WebSocket),操作系统有一定的了解和掌握; 3.熟悉跨平台Native开发的流程和工具:如CMake、Gitlab CI、JNI、OC/Swift等; 4.有移动端音频开发相关经验者优先,如熟悉OpenSL/Audio…
登录查看完整任职要求
微信扫码,1秒登录
工作职责
1.负责跨平台(iOS/Android/Linux)、跨端(服务端+客户端)音视频交互SDK设计、开发与优化; 2.负责和各产品线合作,接入成熟的音视频交互相关处理算法,提升音视频交互在产品中的表现效果; 3.参与开发支持音视频交互相关业务落地和技术研发; 4.持续学习新编程技术、工业界学术界语音系统进展,精炼业务逻辑。
包括英文材料
学历+
设计模式+
https://liaoxuefeng.com/books/java/design-patterns/index.html
设计模式,即Design Patterns,是指在软件设计中,被反复使用的一种代码设计经验。使用设计模式的目的是为了可重用代码,提高代码的可扩展性和可维护性。
[英文] Design Patterns
https://refactoring.guru/design-patterns
Design patterns are typical solutions to common problems in software design. Each pattern is like a blueprint that you can customize to solve a particular design problem in your code.
https://www.youtube.com/watch?v=NU_1StN5Tkk
Design Patterns tutorial explained in simple words using real-world examples.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
多线程+
https://liaoxuefeng.com/books/java/threading/basic/index.html
和单线程相比,多线程编程的特点在于:多线程经常需要读写共享数据,并且需要同步。
https://www.youtube.com/watch?v=_uQgGS_VIXM&list=PLsc-VaxfZl4do3Etp_xQ0aQBoC-x5BIgJ
https://www.youtube.com/watch?v=IEEhzQoKtQU
https://www.youtube.com/watch?v=mTGdtC9f4EU&list=PLL8woMHwr36EDxjUoCzboZjedsnhLP1j4
https://www.youtube.com/watch?v=TPVH_coGAQs&list=PLk6CEY9XxSIAeK-EAh3hB4fgNvYkYmghp
https://www.youtube.com/watch?v=xPqnoB2hjjA
This video is an introduction to multithreading in modern C++.
https://www.youtube.com/watch?v=YKBwKy5PrpQ
Rust threading is easy to implement and improves the efficiency of your applications on multi-core systems!
网络编程+
https://www.youtube.com/watch?v=2HrYIl6GpYg
I will make a simple HTTP web server with the C Programming Language.
https://www.youtube.com/watch?v=8z6okCgdREo
This tutorial is for Gophers who have written a command line or an API application, but have little to no experience in lower-level concepts like reading and writing to sockets, working with channels, and managing multiple goroutines.
https://www.youtube.com/watch?v=bdIiTxtMaKA&list=PL9IEJIKnBJjH_zM5LnovnoaKlXML5qh17
https://www.youtube.com/watch?v=bzja9fQWzdA
Implement the ubiquitous TCP protocol that underlies much of the traffic on the internet!
[英文] 📺Network Programming with Python Course (build a port scanner, mailing client, chat room, DDOS)
https://www.youtube.com/watch?v=FGdiSJakIS4
Learn network programming in Python by building four projects. You will learn to build a mailing client, a DDOS script, a port scanner, and a TCP Chat Room.
https://www.youtube.com/watch?v=gntyAFoZp-E
https://www.youtube.com/watch?v=JiuouCJQzSQ
Explore the fundamentals of networking in Rust by building a simple TCP server.
https://www.youtube.com/watch?v=JRTLSxGf_6w
https://www.youtube.com/watch?v=sFizpxHkIlI
In this video we'll cover SOCKET PROGRAMMING in JAVA.
https://www.youtube.com/watch?v=sXW_sNGvqcU
WebSocket+
[英文] WebSockets Tutorial
https://www.tutorialspoint.com/websockets/index.htm
Web sockets are defined as a two-way communication between the servers and the clients, which mean both the parties, communicate and exchange data at the same time.
还有更多 •••
相关职位
社招算法开发岗
1.负责跨平台(iOS/Android/Linux)、跨端(服务端+客户端)音视频交互SDK设计、开发与优化; 2.负责和各产品线合作,接入成熟的音视频交互相关处理算法,提升音视频交互在产品中的表现效果; 3.参与开发支持音视频交互相关业务落地和技术研发; 4.持续学习新编程技术、工业界学术界语音系统进展,精炼业务逻辑。
更新于 2025-09-02北京
社招A259606
1、支持端到端语音多模态大模型技术在字节跳动公司内外丰富的业务场景落地,解决落地过程中的前沿问题,持续优化落地效果; 2、探索前沿的多模态技术,专注语音多模态大模型的前沿技术和算法效果,追求和探索业界最前沿算法,包括但不限于语言、音乐、语音、音频的生成与理解等; 3、深入调研和关注音频/NLP/多模态等方向的前沿技术。
更新于 2025-03-28上海
社招3年以上技术类-算法
1. 提升对话式语音交互体验:围绕支付宝生活助手等智能化场景,持续打磨语音流式全双工交互体验,提升垂类场景语音理解生成质量,建设更具“真人感”的语音交互; 2. 构建多模态交互算法能力:结合多模态感知与融合算法,设计音视频模态协同实时理解、交互决策、长时记忆等关键技术能力,实现系统“边看边想边说”并辅以丰富的表现力; 3. 提升多模态交互反馈质量:面向文本/语音/视频等模态,在语义内容准确性的基础上提升表达“真实感”和“真人感”,包括但不限于副语言信息、视觉画质/美学提升等; 4. 优化端到端耗时体验:面向多模态大模型的训练/微调/推理加速方法,包括但不限于模型训练效率提升、模型加速、端云协同等,将算法模型极致优化并推动落地。
更新于 2025-11-21杭州
