美团搜索引擎研发工程师
社招全职3年以上核心本地商业-业务研发平台地点:北京状态:招聘
任职要求
1. 本科及以上学历,3年以上工作经验,2年以上搜索、推荐、广告相关领域工作经验。 2. 熟练掌握 Java 或 C++ 语言,具备优秀的编码能力。具有强烈的好奇心,能够发现业务中的痛点,并通过架构改进解决问题。 3. 具备大规模在线系统开发经验,具有优秀的问题分析和解决能力。 4. 责任心强、积极主动、有良好的沟通能力和团队合作能力。 具备以下条件优先 1. 有大规模分布式计算或存储系统研发经验,能够深度理解相关领域关键问题和主流架构者优先。 2. 熟悉检索系统、熟悉常见的向量检索算法,了解FAISS、ScaNN、HNSWLIB等相关开源组件者优先。 3. 熟悉 HA3、Lucene、ElasticSearch 等开源搜索引擎实现者优先,了解 GPU、SSD 等硬件并有相关经验者优先。 4. 熟悉 Spark、Flink等计算引擎,熟悉 HBase、TiDB 等存储系统,并有相关调优经验者优先。 5. 面对复杂系统、复杂问题有化繁为简的能力者优先。
工作职责
参与美团核心本地商业的统一搜索引擎建设,具体包含以下方面: 1. 支撑搜推核心业务的统一检索引擎建设,包含核心存储引擎建设(正倒排索引、向量化检索、KV存储等)、检索引 擎建设(SQL化的查询引擎)、索引构建系统、在线召回系统。 2. 负责流批一体的数据处理系统建设,支撑美团全业务线供给、数百数据源、百亿数据的接入处理、实时索引构建和更新。 3. 从离线数据处理、索引构建、在线召回的一站式平台研发,支持业务的一站式迭代、支持大规模存储服务的自动化运维、auto-resharding、弹性伸缩、离在线混布。
包括英文材料
学历+
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
信息检索+
https://nlp.stanford.edu/IR-book/information-retrieval-book.html
Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
TiDB+
相关职位
社招D2867
1、参与快手搜索引擎研发工作,服务数亿海量用户; 2、参与搜索引擎核心架构设计与开发,涵盖的方向包括: 1)参与新一代检索引擎的设计与开发工作,研发业界领先的在线检索系统(包含但不限于正倒排、ANN等); 2)参与千亿级数据的离线构建、增量构建过程,设计并研发领先的build service系统; 3)面向视频、电商等搜索场景业务,建设统一的检索平台,有机融合ANN,倒排等多类检索引擎,提供极致效率的建库,服务,运维体验; 3、探索搜索引擎与AI检索新技术、新思想的应用和落地。
更新于 2025-08-12
社招D2867
1、参与快手搜索引擎研发工作,服务数亿海量用户; 2、参与搜索引擎核心架构设计与开发,涵盖的方向包括: (1)负责前沿ANN算法与引擎的研发,实现大容量、高吞吐、高精度、低成本的向量检索引擎,持续构建业界领先的技术能力; (2)负责自研百亿规模的视频、网页,商品等多类资源的检索系统研发,包括且不限于高效的建库系统,高性能的检索引擎; (3)面向搜广推业务,建设统一的检索平台,有机融合ANN,倒排等多类检索引擎,提供极致效率的建库,服务,运维体验。 3、探索AI检索新技术、新思想的应用和落地。
更新于 2025-04-03
社招3年以上核心本地商业-点
1.负责点评搜索核心服务的架构抽象和优化,高效支持数据接入、召回、排序、展示等功能,支持搜索相关工具建设与优化,提升工程、算法、产品的迭代效率; 2.通过合理的技术选型和实现,保障搜索系统的高可用、高吞吐、低延迟; 3.深入理解搜索业务和产品需求,从而抽象出系统模型,高效支持点评App搜索需求研发工作。
更新于 2025-04-23