小米Senior Big Data Engineer
任职要求
"1. A degree in Computer Science, Software Engineering, Information Technology or related fields. 2. Strong Computer Science fundamentals in algorithms and data structures 3. Good understanding of system performance and scaling 4. Familiar with at least one of the languages, such as C++/GO/JAVA/Python, and have some experience in Linux shell development. 5. Have good team communication and collaboration skills."
工作职责
"1. Responsible for the research and development of data platfrom for xiaomi internet businesses. 2. Build the infrastructure and tools required for optimal extraction, transformation, and loading of data from a wide variety of data sources 3. Design and implement Data as a Service ( DaaS ) for analytics and data scientist team members that assist them in developing intelligent agile operation Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc."
You will design and build data warehouses on cloud, to provide efficient analytical and reporting capabilities across Apple’s global and regional sales and finance teams. You will develop highly scalable data pipelines to load data from various source systems, use Apache Airflow to orchestrate, schedule and monitor the workflows. Build generic and reusable solutions meeting data warehousing design standards for complex business requirements. You will be required to understand existing solutions, fine-tune them and support them as needed. Data quality is our goal and we expect you to meet our high standards on data and software quality. We are a rapidly growing team with plenty of interesting technical and business challenges to solve.We seek a self starter, who is willing to learn fast, adapt well to changing requirements and work with cross functional teams.
• Design and implement end-to-end data pipelines (ETL) to ensure efficient data collection, cleansing, transformation, and storage, supporting both real-time and offline analytics needs. • Develop automated data monitoring tools and interactive dashboards to enhance business teams’ insights into core metrics (e.g., user behavior, AI model performance). • Collaborate with cross-functional teams (e.g., Product, Operations, Tech) to align data logic, integrate multi-source data (e.g., user behavior, transaction logs, AI outputs), and build a unified data layer. • Establish data standardization and governance policies to ensure consistency, accuracy, and compliance. • Provide structured data inputs for AI model training and inference (e.g., LLM applications, recommendation systems), optimizing feature engineering workflows. • Explore innovative AI-data integration use cases (e.g., embedding AI-generated insights into BI tools). • Provide technical guidance and best practice on data architecture that meets both traditional reporting purpose and modern AI Agent requirements.
You will design and build data warehouses on cloud, to provide efficient analytical and reporting capabilities across Apple’s global and regional sales and finance teams. You will develop highly scalable data pipelines to load data from various source systems, use Apache Airflow to orchestrate, schedule and monitor the workflows. Build generic and reusable solutions meeting data warehousing design standards for complex business requirements. You will be required to understand existing solutions, fine-tune them and support them as needed. Data quality is our goal and we expect you to meet our high standards on data and software quality. We are a rapidly growing team with plenty of interesting technical and business challenges to solve.We seek a self starter, who is willing to learn fast, adapt well to changing requirements and work with cross functional teams.