
零一万物NLP算法工程师
社招全职3年以上研发地点:北京状态:招聘
任职要求
1.3年以上工作经验,计算机、软件工程、数学等相关专业本科及以上学历,有良好的数学基础及分析建模能力. 2.熟练掌握Tensorflow、Keras、Pytorch等深度学习框架的一种或多种. 3.熟悉深度学习技术在NLP领域的理论及进展,包括:RNN、CNN、Transformer、BERT等. 4.熟练掌握Python,go,Java等编程语言的一种或多种,具备良好的编码能力. 5.有LLM模型微调经验,有RAG、Agent等大模型相关应用研发经验优先. 6.有智能问答对话机器人、文本分类、知识图谱、机器学习、海量数据挖掘研发经验优先. 7.具备较强的执行力和责任心,具备优秀的学习能力、沟通协调能力、逻辑思维能力和解决问题能力,有团队合作精神。
工作职责
1.负责文本数据分类分级、内容风控算法模型和策略研发。 2.针对业务需求构建高效、精准的分类分级模型,提升数据分类分级精度及内容风控水平。 3.研究前沿AI安全技术,优化现有算法策略,从具体的场景和问题出发,进行数据挖掘、分析、建模并落地。 4.与产品、运营、审核团队协作,推动算法落地并持续迭代,提升模型及策略准确率。
包括英文材料
学历+
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
Keras+
https://keras.io/getting_started/intro_to_keras_for_engineers/
Keras 3 is a deep learning framework works with TensorFlow, JAX, and PyTorch interchangeably.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
RNN+
https://d2l.ai/chapter_recurrent-neural-networks/rnn.html
A neural network that uses recurrent computation for hidden states is called a recurrent neural network (RNN).
https://www.deeplearningbook.org/contents/rnn.html
Recurrent neural networks, or RNNs (Rumelhart et al., 1986a), are a family of neural networks for processing sequential data.
https://www.ibm.com/think/topics/recurrent-neural-networks
A recurrent neural network or RNN is a deep neural network trained on sequential or time series data to create a machine learning (ML) model that can make sequential predictions or conclusions based on sequential inputs.
CNN+
https://learnopencv.com/understanding-convolutional-neural-networks-cnn/
Convolutional Neural Network (CNN) forms the basis of computer vision and image processing.
[英文] CNN Explainer
https://poloclub.github.io/cnn-explainer/
Learn Convolutional Neural Network (CNN) in your browser!
https://www.deeplearningbook.org/contents/convnets.html
Convolutional networks(LeCun, 1989), also known as convolutional neuralnetworks, or CNNs, are a specialized kind of neural network for processing data.
https://www.youtube.com/watch?v=2xqkSUhmmXU
MIT Introduction to Deep Learning 6.S191: Lecture 3 Convolutional Neural Networks for Computer Vision
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
BERT+
https://www.youtube.com/watch?v=xI0HHN5XKDo
Understand the BERT Transformer in and out.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Go+
https://www.youtube.com/watch?v=8uiZC0l4Ajw
学习Golang的完整教程!从开始到结束不到一个小时,包括如何在Go中构建API的完整演示。没有多余的内容,只有你需要知道的知识。
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
数据挖掘+
https://www.youtube.com/watch?v=-bSkREem8dM
Database vs Data Warehouse vs Data Lake
https://www.youtube.com/watch?v=7rs0i-9nOjo
相关职位
实习HARDWARE
1. 负责语料库的构建, 包括数据的收集,清洗去重和标注。 2. 负责相关工具以及脚本的开发; 3. 负责语言模型的训练, 微调和优化; 4. 负责技术文档编写以及方案平复。
更新于 2025-05-20
社招3-5年研发类
一、 1、负责互联网内容理解算法开发及相关技术难点攻克,推动多模态内容理解算法和深度学习在互联网应用场景(短视频、图文信息流、广告、游戏等)的落地; 2、负责文案、图片、视频自动化生成相关的研发工作,基于素材创作、投放数据,优化创意文案、视频、图片等物料的智能生成策略的算法实现。 二、

社招计算机网络技术类
【语音识别方向】 1、 参与平安业务相关的文本分类、命名实体识别,文本相似性,语言模型,情感分析,用户行为分析等相关NLP工作; 2、 跟进NLP领域前沿技术,对现有产品和技术方案进行持续改进,同时探讨和开发新的产品。 【知识图谱方向】 1、负责大规模文本信息挖掘和分类、语义理解、智能问答、信息提取等,并应用于实际场景; 2、负责金融、法律等领域知识图谱以及事理图谱的构建; 3、探索业界前沿方法,并提升现有NLP能力。 【对话机器人方向】 1、基于机器学习, 并结合现有的自然语言处理技术,研发文本近似、信息抽取、关系推断、阅读理解、智能聊天机器人等的解决方案; 2、实现产品解决方案,进行效果调优; 3、发布相关产品,不断迭代产品效果。
更新于 2023-07-03