新浪微博高级算法工程师(语义搜索&大模型应用)
社招全职新浪&微博地点:北京状态:招聘
任职要求
1. 熟练掌握机器学习、深度学习等方向理论和应用,动手能力强,有主动探索和思考 2. 掌握RAG、文本生成、模型蒸馏/窃取等技术,并有项目实践经验 3. 熟悉主流大模型算法,对Prompt工程、SFT、Agent等技术有实践经验 4. 熟练使用C++/Java/Python至少一门语言,较强的技术攻关能力,能够跟进领域内最新技术研究成果,并结合应用场景快速实验和调优 5. 优秀的分析问题和解决问题的能力,对解决具有挑战性的问题充满激情 6. 良好的沟通能力,良好的团队合作精神
工作职责
1. 负责微博主站搜索业务的语义搜索技术研究和落地,包括:语义相关性、查询理解、问题生成、召回索引等核心技术 2. 基于海量用户行为数据以及人工标注数据,结合自然语言处理、大模型等前沿技术,支持Query改写、内容生成等一系列业务 3. 推进大模型技术在搜索引擎的落地,参与基础大语言模型应用研发,包括但不限于智能问答、物料扩充生成、搜索任务规划、内容优选和排序、工具调用、归纳总结、逻辑推理等能力
包括英文材料
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
深度学习+
https://d2l.ai/
Interactive deep learning book with code, math, and discussions.
RAG+
https://www.youtube.com/watch?v=sVcwVQRHIc8
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
Prompt+
https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/introduction-prompt-design
A prompt is a natural language request submitted to a language model to receive a response back.
https://learn.microsoft.com/en-us/azure/ai-foundry/openai/concepts/prompt-engineering
These techniques aren't recommended for reasoning models like gpt-5 and o-series models.
https://www.youtube.com/watch?v=LWiMwhDZ9as
Learn and master the fundamentals of Prompt Engineering and LLMs with this 5-HOUR Prompt Engineering Crash Course!
SFT+
https://cameronrwolfe.substack.com/p/understanding-and-using-supervised
Understanding how SFT works from the idea to a working implementation...
AI agent+
https://www.ibm.com/think/ai-agents
Your one-stop resource for gaining in-depth knowledge and hands-on applications of AI agents.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
相关职位
社招3年以上新浪&微博
1. 负责微博主站搜索业务,含搜索算法技术的研究、理解业务需求、优化搜索召回、排序效果 2. 负责智能搜索引擎相关算法开发和落地应用,涵盖智搜问答、语义搜索、内容理解、物料挖掘等 3. 负责搜索推荐前沿技术的调研与实现,研究RAG、语义检索、内容生成等技术和算法,并应用到实际问题中 4. 大规模数据挖据和分析,从海量数据中挖掘检索高质量微博与账号
更新于 2025-03-06
社招技术类-开发
1. 参与蚂蚁国际商服平台智能客服机器人AI算法的设计与开发,能够进行商服基座大模型的持续预训练(Continuous pretrain,CP)、监督微调(SFT)、基于人类反馈的强化学习(RHLF)等技术工作,并推动其在实际业务场景中的高效应用与落地。 2. 参与蚂蚁国际商服平台智能坐席助手AI算法的设计与开发,在坐席服务的前中后阶段,通过文本总结,观点挖掘,模拟对话、语义搜索、话术推荐、智能质检等AI技术,辅助提升坐席服务人员的服务半径与时效。
更新于 2025-06-03
社招1年以上技术
负责滴滴国际化搜索引擎研发,包括: 1、参与滴滴极具创新的搜索系统技术研究,挑战智能搜索领域的世界级问题。挖掘大规模地理信息数据的价值,推进NLP技术在智慧地图中的应用,领衔地理信息技术,创造极致出行体验。 2、负责用深度学习重新定义地图Query语义分析-召回架构,优化用户Query分析改写引擎,改进召回效果和效率,解决复杂Query语义理解和召回问题。 3、参与创新性技术研究,利用大模型、大规模地理数据改造传统搜索技术,推进AI技术发展。
更新于 2025-06-16