
Soul APP搜索算法工程师
社招全职3年以上地点:上海状态:招聘
任职要求
1. 至少3年以上搜索策略优化相关经验。或者搜广推5年以上优化经验 2. 深入了解自然语言处理、信息检索和机器学习等领域的基本理论。 3. 扎实的工程落地能力,熟练编程语言,如Python、Java或C++,并具备良好的软件工程实践。 4. 熟悉大规模数据处理工具箱,如Hadoop、flink、spark等,熟悉相关技术和工具。 5. 具备解决复杂问题和优化性能的能力,具有良好的分析思维。良好的沟通技巧,能够与多个团队合作,并分享技术知识。 加分项: 1. 有大模型搜索,搜索引擎或推荐系统开发经验者优先。 2. 做事耐心谨慎、长期主义者、对不确定性抱有乐观心态
工作职责
职位描述 1. 对搜索体验的量化指标有深刻理解,能够拆解到具体策略,如召回、意图理解、排序、混排等,以支撑卓越的搜索体验 2. 深入了解用户需求和搜索行为,根据数据驱动的方法改进搜索结果质量。 3. 在深入了解搜索引擎的核心链路基础上,分析和优化现有的搜索算法,能结合大模型能力去识别并解决效果瓶颈和问题。
包括英文材料
NLP+
https://www.youtube.com/watch?v=fNxaJsNG3-s&list=PLQY2H8rRoyvzDbLUZkbudP-MFQZwNmU4S
Welcome to Zero to Hero for Natural Language Processing using TensorFlow!
https://www.youtube.com/watch?v=R-AG4-qZs1A&list=PLeo1K3hjS3uuvuAXhYjV2lMEShq2UYSwX
Natural Language Processing tutorial for beginners series in Python.
https://www.youtube.com/watch?v=rmVRLeJRkl4&list=PLoROMvodv4rMFqRtEuo6SGjY4XbRIVRd4
The foundations of the effective modern methods for deep learning applied to NLP.
信息检索+
https://nlp.stanford.edu/IR-book/information-retrieval-book.html
Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.
机器学习+
https://www.youtube.com/watch?v=0oyDqO8PjIg
Learn about machine learning and AI with this comprehensive 11-hour course from @LunarTech_ai.
https://www.youtube.com/watch?v=i_LwzRVP7bg
Learn Machine Learning in a way that is accessible to absolute beginners.
https://www.youtube.com/watch?v=NWONeJKn6kc
Learn the theory and practical application of machine learning concepts in this comprehensive course for beginners.
https://www.youtube.com/watch?v=PcbuKRNtCUc
Learn about all the most important concepts and terms related to machine learning and AI.
Python+
https://liaoxuefeng.com/books/python/introduction/index.html
中文,免费,零起点,完整示例,基于最新的Python 3版本。
https://www.learnpython.org/
a free interactive Python tutorial for people who want to learn Python, fast.
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
Hadoop+
https://www.runoob.com/w3cnote/hadoop-tutorial.html
Hadoop 为庞大的计算机集群提供可靠的、可伸缩的应用层计算和存储支持,它允许使用简单的编程模型跨计算机群集分布式处理大型数据集,并且支持在单台计算机到几千台计算机之间进行扩展。
[英文] Hadoop Tutorial
https://www.tutorialspoint.com/hadoop/index.htm
Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
大模型+
https://www.youtube.com/watch?v=xZDB1naRUlk
You will build projects with LLMs that will enable you to create dynamic interfaces, interact with vast amounts of text data, and even empower LLMs with the capability to browse the internet for research papers.
https://www.youtube.com/watch?v=zjkBMFhNj_g
推荐系统+
[英文] Recommender Systems
https://www.d2l.ai/chapter_recommender-systems/index.html
Recommender systems are widely employed in industry and are ubiquitous in our daily lives.
相关职位
社招核心本地商业-业
1.负责美团搜索排序算法以及流量调控系统模型的优化升级,包括序列生成式重排、异构排序、端智能排序、销量预估、量价模型等; 2.负责研究和应用前沿的算法技术,不断优化搜索算法,提高搜索结果的精准度; 3.负责与产品、运营团队紧密合作,根据业务需求调整和优化搜索算法,提升产品的竞争力。
更新于 2025-06-22
社招2年以上核心本地商业-业
1、负责美团服务零售(原到综)搜索算法迭代,通过优化搜索粗精排模型、深度召回、序列建模、多意图识别,提升用户点击率、转化率指标,提升交易及用户规模; 2、应用大规模机器学习、深度学习算法,持续优化查询理解、召回、排序、重排等算法模块; 3、制定算法迭代计划,通过技术手段解决用户侧体验问题并提升流量分发效率; 4、协同工程、数科、前后端、产品等团队共同推进算法落地,承担某一方向或者多个模块的深度迭代; 5、探索手艺人、预订、拼团拼场等多种新业态增长机会,助力业务快速发展;
更新于 2025-05-13
校招J1004
1、参与综合短视频,直播,电商,本地,社交和多语言等搜索业务,用大规模机器学习,强化学习,多模态预训练等技术提升搜索质量,用户留存和点击率等核心业务指标; 2、负责搜索query 意图分类,query 表征,query推荐,视频内容理解&多模态表征,多模态语义召回和相关性等搜索核心技术,提升搜索用户渗透率和相关性; 3、负责搜索用户行为分析,语义和行为混合检索,多序列&多任务粗排,精排,重排等搜索排序技术,提升搜索质量和内容消费指标; 4、负责搜索生态和机制,参与搜索混排,多目标优化,异构内容混排,短期和长期目标平衡,冷启动等搜索机制和算法。
更新于 2025-09-11