美团后台开发专家-存储检索引擎方向
社招全职3年以上核心本地商业-业务研发平台地点:北京状态:招聘
任职要求
1、本科及以上学历,3年以上工作经验,2年以上搜索、推荐、广告相关领域工作经验。 2、熟练掌握 Java 或 C++ 语言,具备优秀的编码能力。具有强烈的好奇心,能够发现业务中的痛点,并通过架构改进解决问题。 3、具备大规模在线系统开发经验,具有优秀的问题分析和解决能力。 4、责任心强、积极主动、有良好的沟通能力和团队合作能力。 具备以下条件优先 1、有大规模分布式计算或存储系统研发经验,能够深度理解相关领域关键问题和主流架构者优先。 2、熟悉检索系统、熟悉常见的向量检索算法,了解FAISS、ScaNN、HNSWLIB等相关开源组件者优先。 3、熟悉 HA3、Lucene、ElasticSearch 等开源搜索引擎实现者优先,了解 GPU、SSD 等硬件并有相关经验者优先。 4、熟悉 Spark、Flink等计算引擎,熟悉 HBase、TiDB 等存储系统,并有相关调优经验者优先。 5、面对复杂系统、复杂问题有化繁为简的能力者优先。
工作职责
1. 支撑搜推核心业务的统一检索引擎建设,包含核心存储引擎建设(正倒排索引、向量化检索、KV存储等)、检索引擎建设(SQL化的查询引擎)、索引构建系统、在线召回系统。 2. 负责流批一体的数据处理系统建设,支撑美团全业务线供给、数百数据源、百亿数据的接入处理、实时索引构建和更新。 2. 从离线数据处理、索引构建、在线召回的一站式平台研发,支持业务的一站式迭代、支持大规模存储服务的自动化运维、auto-resharding、弹性伸缩、离在线混布。
包括英文材料
学历+
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
C+++
https://www.learncpp.com/
LearnCpp.com is a free website devoted to teaching you how to program in modern C++.
https://www.youtube.com/watch?v=ZzaPdXTrSb8
信息检索+
https://nlp.stanford.edu/IR-book/information-retrieval-book.html
Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.
算法+
https://roadmap.sh/datastructures-and-algorithms
Step by step guide to learn Data Structures and Algorithms in 2025
https://www.hellointerview.com/learn/code
A visual guide to the most important patterns and approaches for the coding interview.
https://www.w3schools.com/dsa/
ElasticSearch+
https://www.youtube.com/watch?v=a4HBKEda_F8
Learn about Elasticsearch with this comprehensive course designed for beginners, featuring both theoretical concepts and hands-on applications using Python (though applicable to any programming language). The course is structured in two parts: first covering essential Elasticsearch fundamentals including index management, document storage, text analysis, pipeline creation, search functionality, and advanced features like semantic search and embeddings; followed by a practical section where you'll build a real-world website using Elasticsearch as a search engine, working with the Astronomy Picture of the Day (APOD) dataset to implement features such as data cleaning pipelines, tokenization, pagination, and aggregations.
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.
HBase+
[英文] HBase Tutorial
https://www.tutorialspoint.com/hbase/index.htm
HBase is a data model that is similar to Google's big table designed to provide quick random access to huge amounts of structured data. This tutorial provides an introduction to HBase, the procedures to set up HBase on Hadoop File Systems, and ways to interact with HBase shell.
TiDB+
相关职位
社招5年以上技术类-开发
1、基于多模态大模型设计无影AI的技术架构,不断优化可扩展性,帮助无影业务不断快速增长。 2、支持服务上亿用户规模的高性能、分布式、可扩展的桌面和应用虚拟化产品线架构。 3、设计并实施全流式通信的 LLM 应用架构,实现端到端最佳的性能和资源效率,不断优化高并发高吞吐在线系统的健壮性。 4、结合多模态数据存储服务和RAG检索技术、多Agent协同、Function Call等,快速开发针对行业的热门应用。
更新于 2025-06-19
社招8年以上CDG技术
1.主导海外联盟广告系统的后台技术架构设计和开发规划,推动后台开发团队技术演进与性能优化; 2.负责团队管理与人才梯队建设,制定技术规范与研发流程,提升团队工程化能力与交付效率; 3.负责与开发者对接海外广告联盟流量的接入, 流量变现机制的优化与分成结算; 4.动态分配广告后台系统算力,与平台部门协同优化广告系统召回、粗精排等全链路模块,提升海外广告联盟流量的变现效率; 5.与平台部门合作,推动广告投放产品的开发迭代和广告的效果优化; 6.与sdk协同提升广告的点击率和转化率; 7.与各团队进行数据对接,优化数据链路,保障数据的正确性; 8.与新加坡团队对接,保障系统的海外隔离部署和稳定运行。
更新于 2025-06-11
社招5年以上信息技术类
【职责描述】 1.配合业务部门梳理业务流程和数据模型,对设计模型设计,存储选型进行建议和把关. 2.ClickHouse/starrocks线上问题定位、线上SLA保障. 3.负责内核技术的研究和新技术的引入,解决业务问题.
更新于 2024-10-30