字节跳动音频算法工程师(抖音智能对话机器人)-Data
任职要求
1、深入理解端到端语音大模型的原理和架构,熟悉常见的语音大模型,如Whisper等,对语音识别、合成、对话理解等技术有扎实的理论基础; 2、具备优秀的编程能力,熟练使用Python等主流编程语言,熟悉深度学习框架,如PyTorch或TensorFlow,能…
工作职责
1、负责为抖音客服业务VOIP和热线等语音交互场景提供专业的音频技术支持与研发,运用端到端的语音大模型实现更高效、智能的客服音频交互体验; 2、负责端到端语音大模型在客服应用中的落地与优化,搭建智能音频理解和处理在客服领域的系统级解决方案; 3、跟进客服产品业务的语音/音频需求,持续改进产品的音频质量体验;结合实际业务场景,对模型进行针对性训练和调优,确保语音识别、合成等功能能精准适配客服对话需求,提升对话理解和回复的准确性; 4、跟踪研发业界先进的音频进展,探索语音/音频领域最新技术的研发并落地产品。
Collaborate with BI teams to manage ETL processes to support data integration and transformation. Assist BI teams in performing raw data organization, desensitization, deduplication and governance to ensure data management timeliness, completeness, consistency and accuracy Implement cloud services(AWS) to achieve data management and data access control, data auditing, and other functions. Provide data query support for CS,UA teams etc. Continuously assist the company in evaluating and adopting the most suitable technologies for the organization’s data engineering needs to align with the latest compliance requirements in data management. Document processes and maintain comprehensive data architecture records.
Data Pipeline Development -Design, build, and maintain scalable data pipelines using SQL, Spark, and other relevant technologies. -Optimize data pipelines for performance, reliability, and efficiency. Data Governance & Quality -Implement and enforce data governance policies and procedures. -Establish data quality monitoring and validation processes, including data profiling, cleansing, and standardization. -Utilize data cataloging and tagging tools to improve data discoverability and usability. AI Model Support -Collaborate on data labeling and merchant tagging initiatives for AI model training. -Ensure data accuracy and consistency for model development. Business Intelligence & Visualization -Develop interactive data dashboards and reports using BI visualization tools (e.g., Power BI) to support business reviews and technical monitoring. -Translate complex data into clear and actionable insights. Data Support & Troubleshooting -Manage and prioritize data support requests from business and product teams. -Troubleshoot and resolve data-related issues across multiple systems, ensuring data consistency. -Proactively identify and resolve data discrepancies. Data Integration -Design and build scalable data integrations to support business needs. -Collaborate with engineering teams to develop and maintain high-volume inbound and outbound integrations. -Collaboration Work closely with cross-functional teams, including product managers, engineers, and business stakeholders.
-负责大模型数据工程,解决大模型训练,原生应用数据准备平台整体规划、设计和运营,制定产品策略和产品规划 -负责市场调研和分析,收集用户反馈,不断优化产品,提高用户体验 -负责协调技术、市场、运营等团队,推动产品研发团队、技术团队、市场团队等共同完成产品研发、测试、上线等工作 -负责产品的上线后的数据分析和优化,制定产品改进方案 -负责与销售、市场等团队配合,完成产品的销售、推广和运营工作
1.Data Monitoring and Analysis: Monitor and analyze capacity-related and customer experience data across cities, quickly identify issues, and propose optimization solutions.Develop and improve capacity monitoring reports, ensuring timely, accurate, and visualized data to support business decisions. 2.Cross-Department Collaboration: Collaborate closely with city heads, PMMs, and other business teams to drive improvements in capacity and customer experience.Communicate effectively with city teams to convey data insights and jointly address operational challenges. 3.Strategy Support and Execution: Participate in the development and optimization of city capacity management strategies, following up on execution results.Coordinate and advance experience-related operational projects, ensuring effective implementation. 4.Project Management and Process Supervision: Track the entire process of capacity and experience improvement projects to ensure their timely execution and target achievement.Continuously iterate operational plans based on data feedback to drive business improvement. 5.Data Management and Report Optimization: Maintain and update capacity and experience-related databases to ensure data integrity and traceability.Optimize existing data analysis frameworks and report structures to provide tailored reports for different management levels.