摘要
多模态技术为文本、语音、视频、图像等非结构化数据的智能处理提供了可能性。本文基于对多模态深度学习模态表示等关键技术的研究,针对网络视听监管业务的工作需求,对多模态技术在网络视听内容监管方面的应用进行了初步探索,旨在有效提升网络视听大数据处理的准确性和效率。
Multimodal technology provides the possibility for intelligent processing of unstructured data such as text,voice,video,and images.Based on the research on key technologies such as multimodal deep learning and modal representation,this article explores the application of multimodal technology in the supervision of online audio-visual content according to some difficulties and problems encountered in the process of audio-visual content supervision,combined with the current work needs of online audio-visual supervision business,effectively improving the accuracy and efficiency of online audio-visual big data processing.
作者
杨茜
Yang Qian(Chongqing Radio and Television Monitoring Station,Chongqing 401147,China)
出处
《广播与电视技术》
2023年第12期111-115,共5页
Radio & TV Broadcast Engineering
关键词
多模态
网络视听
内容监管
Multimodality
Network audiovisual
Content supervision