字节跳动高级/资深算法工程师-业务中台
社招全职A148700A地点:北京状态:招聘
任职要求
1、优秀的编程和算法能力,熟悉Python/C++编程语言,熟悉MapReduce,了解Hadoop、Spark系统,掌握深度学习基础知识,熟悉Pytorch、Tensorflow等至少一种深度学习框架; 2、熟悉NLP、CV等相关技术,熟悉Transformer等深度学习算法,在NLP和CV方面有一定积累沉淀,有一定的多模态相关背景,较强的算法实现能力,熟悉多模态常用算法; 3、具备深度预训练模型经验者优先,有多模态、NLP、CV、视频/音频算法相关领域经验者优先,对LLM、多模态学习有深入理解和实践,有预训练、可控内容生成方向经验者优先; 4、有生成模型GAN、VAE、Diffusion等工程项目为加分项;有AIGC相关经验者为加分项,有NLP/CV/ML顶会发表经验者(ACL/EMNLP/CVPR/ICCV/NeurIPS等)为加分项; 5、具备良好的逻辑思维能力、沟通协作能力、自我学习能力,保持对事物的好奇心,态度积极向上,有责任心。
工作职责
1、与业务方紧密合作,理清业务需求并从多模态角度提供解决方案; 2、跟进前沿多模态算法,了解常见多模态任务、数据、评测手段,能够使用内外部多模态工具; 3、处理和分析多模态数据,需要能够有效地清洗、整理和可视化数据等; 4、在多模态LLM上要做到用能改,并在业务数据上Finetune; 5、着重探索基于多模态LLM的视频内容理解,支撑各类业务; 6、与各团队紧密协作,确保算法实施满足业务需求,有优秀的团队管理经验。
包括英文材料
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
MapReduce+
https://www.youtube.com/watch?v=bcjSe0xCHbE
https://www.youtube.com/watch?v=cHGaQz0E7AU
In this video I explain the basics of Map Reduce model, an important concept for any software engineer to be aware of.
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
CVPR+
https://cvpr.thecvf.com/
ICCV+
https://iccv.thecvf.com/
ICCV is the premier international computer vision event comprising the main conference and several co-located workshops and tutorials.
NeurIPS+
https://neurips.cc/
相关职位
社招信息技术类
-结合电商的业务特性,进行模型和算法创新,打造业行领先的机器学习/深度学习算法平台能力。 -超大规模的机器学习模型优化,包括但不限于深度学习、强化学习、表征学习等,最大效率地提升电商流量效率。
更新于 2025-05-20
社招3年以上技术
安全引擎部门介绍:安全引擎承担网约车、花小猪、顺风车、货运等业务线安全需求的工程研发,致力于构建“事前隔离-事中研判干预-事后妥处置”的全链路工程架构,覆盖全场景安全产品矩阵、降发生策略架构、风险研判工作台、音视频合规与隐私保护等重点工作。 工作职责: 1、投身构建世界一流的平台型安全术体系,提升滴滴用户在安全场景下的体验和满意度; 2、积极参与业务需求讨论,支撑滴滴负向业务的需求研发,确保端到端的交付效率与质量; 3、深入理解安全业务逻辑,抽象业务,沉淀中台,用技术手段提升滴滴的负向业务能力,赋能业务发展。
更新于 2025-06-13