字节跳动语音识别算法工程师-Data语音
社招全职A39979A地点:杭州状态:招聘
任职要求
1、熟悉语音识别算法,对语音识别系统落地和业务效果优化有实际经验; 2、对工业级大规模数据有实际处理经验,有使用海量数据优化实际业务模型的动手经验; 3、对深度学习技术有深度了解和丰富的实战经验,熟悉PyTorch、Tensorflow、Kaldi等平台,有端到端语音识别框架(Transformer、RNN-T、LAS、CTC等)的调优经验; 4、有不错的编码能力,熟悉Linux开发环境,熟悉C++和Python语言; 5、有独立工作能力并同时能与团队融洽相处。 加分项: 1、在会议、智能硬件等场景有大规模的语音识别系统落地和优化经验; 2、对前沿的端到端语音识别系统有优化经验,熟悉RNN-T、Encoder-Decoder等端到端语音识别算法; 3、有优化语音识别解码器并实际落地的经验; 4、在相关国际会议或主流期刊上发表论文(ICASSP、Interspeech、ASRU、IEEE/ACM Transactions等); 5、语音相关比赛或机器学习相关比赛拿到国际领先名次,ACM/NOI/IOI/TopCoder等编程比赛获奖; 6、参与过有影响力开源项目。
工作职责
1、支持语音识别技术在字节跳动公司内外丰富的业务场景落地,解决落地过程中的前沿问题,持续优化语音识别核心技术效果; 2、搭建音频理解核心技术体系,专注语音识别的前沿技术和算法效果,追求和探索业界最前沿算法。
包括英文材料
语音识别+
https://www.youtube.com/watch?v=mYUyaKmvu6Y
Learn how to implement speech recognition in Python by building five projects.
https://www.youtube.com/watch?v=sR6_bZ6VkAg
How Rev.com harnesses human-in-the-loop and deep learning to build the world's best English speech recognition engine
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
Linux+
https://ryanstutorials.net/linuxtutorial/
Ok, so you want to learn how to use the Bash command line interface (terminal) on Unix/Linux.
https://ubuntu.com/tutorials/command-line-for-beginners
The Linux command line is a text interface to your computer.
https://www.youtube.com/watch?v=6WatcfENsOU
In this Linux crash course, you will learn the fundamental skills and tools you need to become a proficient Linux system administrator.
https://www.youtube.com/watch?v=v392lEyM29A
Never fear the command line again, make it fear you.
https://www.youtube.com/watch?v=ZtqBQ68cfJc
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
相关职位
社招JR6DP
1、支持语音识别技术在字节跳动公司内外丰富的业务场景落地,解决落地过程中的前沿问题,持续优化语音识别核心技术效果; 2、搭建音频理解核心技术体系,专注语音识别的前沿技术和算法效果,追求和探索业界最前沿算法。
更新于 2021-03-29
社招A66068
1、支持语音识别技术在字节跳动公司内外丰富的业务场景落地,解决落地过程中的前沿问题,持续优化语音识别核心技术效果; 2、搭建音频理解核心技术体系,专注语音识别的前沿技术和算法效果,追求和探索业界最前沿算法。
更新于 2025-03-28
社招A29448
1、支持语音交互技术在字节跳动公司内外丰富的业务场景落地,解决落地过程中的前沿问题,持续优化在智能硬件中的音频理解及处理,以及语音助手核心技术效果; 2、专注端侧智能交互的前沿技术和算法效果,追求和探索业界最前沿算法; 3、负责字节跳动旗下音频内容创作和消费业务场景的智能移频理解和处理算法研发和业务支持; 4、跟踪智能音频领域的最新技术进展并升级团队自研的各算法系统,包括回声消除、AI降噪、多通道音频处理、音频事件理解与检测; 5、跟踪研发业界先进的音频进展,统计模型/机器学习/深度学习技术在语音/音频领域研发并落地产品。
更新于 2025-03-24