小米小米汽车-高级算法工程师-NLP&大模型
社招全职5年以上A79878地点:北京状态:招聘
任职要求
1. 计算机、电子、数学、机器学习或者统计学相关专业,本科及以上学历;5年以上机器学习、深度学习、大模型建模经验。 2. 精通 Python 编程语言,熟悉常用的数据结构与算法,能够高效地实现复杂的 NLP 算法逻辑,具备良好的代码编写习惯和代码优化能力,确保算法代码的可读性、可维护性和高效性。 3. 熟练掌握至少一种深度学习框架(如 TensorFlow、PyTorch 等),深入理解神经网络的基本原理与架构,包括但不限于RNN、CNN、Transformer 等在 NLP 领域的应用,能够灵活运用这些框架搭建、训练和部署 NLP 模型,以应对舆情数据的复杂特征。 4. 了解常用大模型如 Qwen、GLM、Baichuan 等方法论,能够通过Prompt调优提升推理精度,并对大模型微调技术如 LoRA、P-Tuning 等有实践经验。 5. 对 NLP 有深入理解,掌握文本分类、情感分析、命名实体识别等常见任务的原理与方法,具备丰富的实践经验。 6. 对瞬时大流量场景,如发布会,拥有分类、情感分析、总结摘要等算法处理经验。 7. 熟悉机器学习算法原理,包括监督学习、无监督学习、强化学习等,能够运用机器学习算法解决舆情数据中的分类、聚类、预测、总结等问题,为舆情趋势分析、热点话题挖掘等提供有力支持。 8. 了解舆情数据的特点和业务需求,对舆情监测、舆情分析、舆情预警等工作有一定的认识,能够将 NLP 技术与舆情业务场景紧密结合,为客户提供贴合实际需求的舆情解决方案。
工作职责
1. 负责舆情监测系统中 NLP 相关任务的算法建模与优化,包括文本分类、情感分析、实体识别、语义理解、视频内容理解等模块,确保能够快速准确地从海量文本数据中提取有价值的信息,为舆情预警、趋势分析等应用提供坚实技术支撑。 2. 深入研究舆情数据特点,探索适合的 NLP 模型架构与算法策略,针对舆情文本的复杂性(如网络用语、多领域话题交织等),不断改进现有模型,提高模型泛化能力,使其能够应对多样化的舆情场景和数据变化。 3. 进行标注标准制定,协同标注人员构建高质量数据集,为算法训练提供基础数据,同时基于反馈数据持续优化算法效果,以数据驱动算法迭代。 4. 跟踪行业前沿技术动态与研究成果,如将大语言模型,多模态模型等应用于舆情分析场景。 5. 协助开发团队将算法成果工程化落地,确保模型在实际舆情监测系统中的高效稳定运行,参与算法性能的测试与评估工作,及时解决上线过程中出现的技术难题,保障系统稳定性。
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
学历+
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
数据结构+
https://www.youtube.com/watch?v=8hly31xKli0
In this course you will learn about algorithms and data structures, two of the fundamental topics in computer science.
https://www.youtube.com/watch?v=B31LgI4Y4DQ
Learn about data structures in this comprehensive course. We will be implementing these data structures in C or C++.
https://www.youtube.com/watch?v=CBYHwZcbD-s
Data Structures and Algorithms full course tutorial java
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
TensorFlow+
https://www.youtube.com/watch?v=tpCFfeUEGs8
Ready to learn the fundamentals of TensorFlow and deep learning with Python? Well, you’ve come to the right place.
https://www.youtube.com/watch?v=ZUKz4125WNI
This part continues right where part one left off so get that Google Colab window open and get ready to write plenty more TensorFlow code.
PyTorch+
https://datawhalechina.github.io/thorough-pytorch/
PyTorch是利用深度学习进行数据科学研究的重要工具,在灵活性、可读性和性能上都具备相当的优势,近年来已成为学术界实现深度学习算法最常用的框架。
https://www.youtube.com/watch?v=V_xro1bcAuA
Learn PyTorch for deep learning in this comprehensive course for beginners. PyTorch is a machine learning framework written in Python.
Transformer+
https://huggingface.co/learn/llm-course/en/chapter1/4
Breaking down how Large Language Models work, visualizing how data flows through.
https://poloclub.github.io/transformer-explainer/
An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.
https://www.youtube.com/watch?v=wjZofJX0v4M
Breaking down how Large Language Models work, visualizing how data flows through.
Prompt+
https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/introduction-prompt-design
A prompt is a natural language request submitted to a language model to receive a response back.
https://learn.microsoft.com/en-us/azure/ai-foundry/openai/concepts/prompt-engineering
These techniques aren't recommended for reasoning models like gpt-5 and o-series models.
https://www.youtube.com/watch?v=LWiMwhDZ9as
Learn and master the fundamentals of Prompt Engineering and LLMs with this 5-HOUR Prompt Engineering Crash Course!
强化学习+
https://cloud.google.com/discover/what-is-reinforcement-learning?hl=en
Reinforcement learning (RL) is a type of machine learning where an "agent" learns optimal behavior through interaction with its environment.
https://huggingface.co/learn/deep-rl-course/unit0/introduction
This course will teach you about Deep Reinforcement Learning from beginner to expert. It’s completely free and open-source!
https://www.kaggle.com/learn/intro-to-game-ai-and-reinforcement-learning
Build your own video game bots, using classic and cutting-edge algorithms.
相关职位
社招1年以上技术
负责滴滴国际化搜索引擎研发,包括: 1、参与滴滴极具创新的搜索系统技术研究,挑战智能搜索领域的世界级问题。挖掘大规模地理信息数据的价值,推进NLP技术在智慧地图中的应用,领衔地理信息技术,创造极致出行体验。 2、负责用深度学习重新定义地图Query语义分析-召回架构,优化用户Query分析改写引擎,改进召回效果和效率,解决复杂Query语义理解和召回问题。 3、参与创新性技术研究,利用大模型、大规模地理数据改造传统搜索技术,推进AI技术发展。
更新于 2025-06-16
社招技术类-算法
我们是AliExpress广告算法团队,该岗位负责AE搜索广告的NLP&相关性、用户体验优化,包括并不限于: 1. 设计和优化搜索广告相关性下的Query理解、类目预测、深度语义相关性、商品理解、实体匹配等方向 2. 对比学习、表征学习、蒸馏学习在语义理解、类目预测、相关性判别等领域的应用和创新 3. 设计合理的全链路管控与供给策略,保证消费者体验、广告主投放效果、平台营收的良好平衡 4. LLM、MLLM在上述方向的全面应用与优化 5. 建立合理的相关性评测方法,进行数据挖掘,迭代数据标注任务,积累电商领域知识数据资产
更新于 2025-03-31
社招3年以上腾讯云技术
1.负责自然语言处理的算法研发,包括但不限于语义分析、意图识别、语义挖掘、知识图谱、命名实体识别等; 2.负责对话系统,尤其是知识类问答对话系统的技术研究,包括自然语言理解、对话策略学习、自然语言生成、encoder-decoder模型等; 3.负责知识图谱相关关键技术研究,解决知识图谱和自然语言深层次表示、理解与计算问题; 4.负责NLP前沿问题的研究,结合未来实际应用场景,提供技术解决方案。
更新于 2025-09-03