期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art
1
作者 Mengting Liu Ying Zhou +1 位作者 Yuwei Wu Feng Gao 《Machine Intelligence Research》 EI CSCD 2024年第1期4-28,共25页
In recent years,computing art has developed rapidly with the in-depth cross study of artificial intelligence generated con-tent(AIGC)and the main features of artworks.Audio-visual content generation has gradually been... In recent years,computing art has developed rapidly with the in-depth cross study of artificial intelligence generated con-tent(AIGC)and the main features of artworks.Audio-visual content generation has gradually been applied to various practical tasks,including video or game score,assisting artists in creation,art education and other aspects,which demonstrates a broad application pro-spect.In this paper,we introduce innovative achievements in audio-visual content generation from the perspective of visual art genera-tion and auditory art generation based on artificial intelligence(Al).We outline the development tendency of image and music datasets,visual and auditory content modelling,and related automatic generation systems.The objective and subjective evaluation of generated samples plays an important role in the measurement of algorithm performance.We provide a cogeneration mechanism of audio-visual content in multimodal tasks from image to music and display the construction of specific stylized datasets.There are still many new op-portunities and challenges in the field of audio-visual synesthesia generation,and we provide a comprehensive discussion on them. 展开更多
关键词 artificial intelligence(AI)art AUDIO-VISUAL artificial intelligence generated content(aigc) MULTIMODAL artistic evalu-ation
原文传递
基于精确扩散反演的生成式图像内生水印方法
2
作者 李莉 张新鹏 +2 位作者 王子驰 吴德阳 吴汉舟 《网络空间安全科学学报》 2024年第1期92-100,共9页
扩散模型在图像生成方面取得了显著成就,但生成的图像真假难辨,因此滥用扩散模型将引发隐私安全、法律伦理等社会问题。对生成模型的输出添加水印可以追踪生成内容版权,防止人工智能生成内容造成潜在危害。对于去噪扩散模型,在初始噪声... 扩散模型在图像生成方面取得了显著成就,但生成的图像真假难辨,因此滥用扩散模型将引发隐私安全、法律伦理等社会问题。对生成模型的输出添加水印可以追踪生成内容版权,防止人工智能生成内容造成潜在危害。对于去噪扩散模型,在初始噪声向量中添加水印的内生水印方法可直接生成含水印图像,版权验证时通过反向扩散重建初始向量以提取水印。但扩散模型中的采样过程并不是严格可逆,重建的噪声向量与原始噪声存在较大误差,很难保证水印的准确提取。通过引入基于耦合变换的精确反向扩散,可以更加准确地重建初始噪声向量,提升水印提取的准确性。通过实验验证了引入基于耦合变换的精确反向扩散对于生成式图像内生水印的性能提升,实验结果表明,内生水印可以在生成图像中嵌入不可见水印,嵌入的水印可通过精确反向扩散被准确提取,并具有一定的稳健性。 展开更多
关键词 生成式人工智能(artificial intelligence generated content aigc)溯源 模型水印 数字水印 去噪扩散模型 反向扩散
下载PDF
“虚拟数字人”概念:内涵、前景及技术瓶颈 被引量:24
3
作者 简圣宇 《上海师范大学学报(哲学社会科学版)》 北大核心 2023年第4期45-57,共13页
作为社会数字化转型的伴生产物,“虚拟数字人”产业蕴含着巨大的市场需求。从经济到文化教育等各个行业和领域,都需要能与人类实现协同合作的虚拟员工。当下的“虚拟数字人”仍只是些只有外观而没有自主思想的数字人物形象,不过随着驱... 作为社会数字化转型的伴生产物,“虚拟数字人”产业蕴含着巨大的市场需求。从经济到文化教育等各个行业和领域,都需要能与人类实现协同合作的虚拟员工。当下的“虚拟数字人”仍只是些只有外观而没有自主思想的数字人物形象,不过随着驱动程序的升级,它们也将对人类社会产生更深的影响。其在“元宇宙”这类智能虚拟平台搭建起来后还将有更广阔的应用空间,为人类社会增加新的人力资源。ChatGPT的出现带来了新的契机,它能够作为未来数字人的内在驱动而产生关键作用,赋予后者以“类人心智”。尽管如此,在人物形象的自动生成和智能驱动等方面,虚拟数字人产业仍有较多技术瓶颈问题亟待解决。 展开更多
关键词 人工智能 元宇宙 虚拟数字人 ChatGPT 类人心智 智能驱动 智能生成内容
下载PDF
Prompt learning in computer vision: a survey 被引量:1
4
作者 Yiming LEI Jingqi LI +2 位作者 Zilong LI Yuan CAO Hongming SHAN 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2024年第1期42-63,共22页
Prompt learning has attracted broad attention in computer vision since the large pre-trained visionlanguagemodels (VLMs) exploded. Based on the close relationship between vision and language information builtby VLM, p... Prompt learning has attracted broad attention in computer vision since the large pre-trained visionlanguagemodels (VLMs) exploded. Based on the close relationship between vision and language information builtby VLM, prompt learning becomes a crucial technique in many important applications such as artificial intelligencegenerated content (AIGC). In this survey, we provide a progressive and comprehensive review of visual promptlearning as related to AIGC. We begin by introducing VLM, the foundation of visual prompt learning. Then, wereview the vision prompt learning methods and prompt-guided generative models, and discuss how to improve theefficiency of adapting AIGC models to specific downstream tasks. Finally, we provide some promising researchdirections concerning prompt learning. 展开更多
关键词 Prompt learning Visual prompt tuning(VPT) Image generation Image classification artificial intelligence generated content(aigc)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部