字节跳动机器学习算法引擎研发工程师-Data语音
社招全职A11226地点:北京状态:招聘
任职要求
1、扎实的C++或C编程语言,熟悉常用数据结构以及基础算法,对有挑战的工作富有热情; 2、熟悉计算机体系结构和操作系统,具备软件/库架构的设计与研发经验的更佳; 3、有很强的分析问题和解决问题的能力,有强烈的责任心。 加分项: 1、在ACM/NOI/IOI/TopCoder获奖者优先; 2、有定点量化、指令集优化、深度模型优化等相关项目经验者优先; 3、有CPU,GPU,NPU,ARM,OpenCL,DSP等高性能计算优化经验者优先; 4、有相关语音信号处理,语音识别、语音合成或者自然语言处理经验的优先; 5、有相关语音算法引擎开发经验的优先。
工作职责
1、设计和研发业界领先的高性能端云算法引擎,提供满足语音识别,对话交互,语音合成,音频检索等场景的核心原子能力; 2、负责深度优化核心引擎,包括端云一体的高性能计算引擎,音频特征处理引擎,大规模解码引擎,音频合成引擎,音频特效引擎,对话交互引擎,音频检索引擎等常用引擎极致优化; 3、负责算法落地性能评估和分析,制定技术规划和性能标准,持续加强提升关键技术竞争力; 4、负责为字节跳动全系产品(今日头条、抖音、抖音火山版、西瓜视频、飞书、番茄小说等)提供AI语音理解、对话以及语音合成等方面的能力,用AI技术影响数亿用户。
包括英文材料
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
C+
https://www.freecodecamp.org/chinese/news/the-c-beginners-handbook/
本手册遵循二八定律。你将在 20% 的时间内学习 80% 的 C 编程语言。
https://www.youtube.com/watch?v=87SH2Cn0s9A
https://www.youtube.com/watch?v=KJgsSFOSQv0
This course will give you a full introduction into all of the core concepts in the C programming language.
https://www.youtube.com/watch?v=PaPN51Mm5qQ
In this complete C programming course, Dr. Charles Severance (aka Dr. Chuck) will help you understand computer architecture and low-level programming with the help of the classic C Programming language book written by Brian Kernighan and Dennis Ritchie.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
OpenCL+
https://developer.nvidia.com/opencl
OpenCL™ (Open Computing Language) is a low-level API for heterogeneous computing that runs on CUDA-powered GPUs.
https://engineering.purdue.edu/~smidkiff/ece563/NVidiaGPUTeachingToolkit/Mod20OpenCL/3rd-Edition-AppendixA-intro-to-OpenCL.pdf
we will give a brief overview of OpenCL for CUDA programers.
[英文] Hands On OpenCL
https://handsonopencl.github.io/
An open source two-day lecture course for teaching and learning OpenCL
https://leonardoaraujosantos.gitbook.io/opencl/chapter1
Open Computing Language is a framework for writing programs that execute across heterogeneous platforms.
https://ulhpc-tutorials.readthedocs.io/en/latest/gpu/opencl/
OpenCL came as a standard for heterogeneous programming that enables a code to run in different platforms.
https://www.youtube.com/watch?v=4q9fPOI-x80
This presentation will show how to make use of the GPU from Java using OpenCL.
语音识别+
https://www.youtube.com/watch?v=mYUyaKmvu6Y
Learn how to implement speech recognition in Python by building five projects.
https://www.youtube.com/watch?v=sR6_bZ6VkAg
How Rev.com harnesses human-in-the-loop and deep learning to build the world's best English speech recognition engine
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
相关职位
社招A221731
1、设计和研发业界领先的高性能端云算法引擎,提供满足语音识别,对话交互,语音合成,音频检索等场景的核心原子能力; 2、负责深度优化核心引擎,包括端云一体的高性能计算引擎,音频特征处理引擎,大规模解码引擎,音频合成引擎,音频特效引擎,对话交互引擎,音频检索引擎等常用引擎极致优化; 3、负责算法落地性能评估和分析,制定技术规划和性能标准,持续加强提升关键技术竞争力; 4、负责为字节跳动产品(今日头条、抖音、抖音火山版、西瓜视频、飞书、番茄小说等)提供AI语音理解、对话以及语音合成等方面的能力,用AI技术影响数亿用户。
更新于 2025-05-27
社招3年以上A147138
1、持续推进豆包语音大模型最新技术的工程优化和应用落地; 2、负责设计并实现高可用、可扩展、分布式机器学习平台,支撑多地域语音相关业务模型服务高效迭代更新; 3、与算法、引擎、后端工程师密切配合,理解语音大模型MaaS能力建设和部署运维流程,负责/参与机器学习平台的设计、开发和维护; 4、持续提升平台效率、易用性、降低使用成本,探索业界前沿的机器学习相关技术,设计并实现到机器学习平台。
更新于 2025-03-28
社招3年以上A30891
1、持续推进豆包语音大模型最新技术的工程优化和应用落地; 2、负责设计并实现高可用、可扩展、分布式机器学习平台,支撑多地域语音相关业务模型服务高效迭代更新; 3、与算法、引擎、后端工程师密切配合,理解语音大模型MaaS能力建设和部署运维流程,负责/参与机器学习平台的设计、开发和维护; 4、持续提升平台效率、易用性、降低使用成本,探索业界前沿的机器学习相关技术,设计并实现到机器学习平台。
更新于 2025-03-28