Multimedia content is an integral part of Alibaba’s business ecosystem and is in great demand. The production of multimedia content usually requires high technology and much money. With the rapid development of artif...Multimedia content is an integral part of Alibaba’s business ecosystem and is in great demand. The production of multimedia content usually requires high technology and much money. With the rapid development of artificial intelligence(AI) technology in recent years, to meet the design requirements of multimedia content, many AI auxiliary tools for the production of multimedia content have emerged and become more and more widely used in Alibaba’s business ecology. Related applications include mainly auxiliary design, graphic design, video generation,and page production. In this report, a general pipeline of the AI auxiliary tools is introduced. Four representative tools applied in the Alibaba Group are presented for the applications mentioned above. The value brought by multimedia content design combined with AI technology has been well verified in business through these tools. This reflects the great role played by AI technology in promoting the production of multimedia content. The application prospects of the combination of multimedia content design and AI are also indicated.展开更多
The China-US Million Book Digital Library Project (Million Book Project) is an intemational cooperation program between China and the US. However, one million digitized books are considered not to be the ultimate go...The China-US Million Book Digital Library Project (Million Book Project) is an intemational cooperation program between China and the US. However, one million digitized books are considered not to be the ultimate goal of the project, but a first step towards universal access to human knowledge. In particular, there are four challenges about the new way to analyze, process, operate, visualize and interact with digital media resource in this library. To tackle these challenges, North China Centre of Million Book Project (in Chinese Academy of Sciences) has initiated several innovative research projects in areas such as multimedia content analysis and retrieval, bilingual services, multimodal information presentation, and knowledge-based organization and services. In this keynote speech, we simply review our work in these areas, and argue that by technological cooperation with these innovation research topics, the project will develop a top-level digital library platform for the million book library.展开更多
Peer-to-peer technologies have emerged as a powerful and scalable communication model for large scale content shar-ing. However, they are not yet provided with optimized heterogeneous aggregated content management fun...Peer-to-peer technologies have emerged as a powerful and scalable communication model for large scale content shar-ing. However, they are not yet provided with optimized heterogeneous aggregated content management functionality since they lack rich semantic specifications. To overcome these shortcomings, we elaborated a reference model of P2P architecture for a dynamic aggregation, sharing and retrieval of heterogeneous multimedia contents (simple or aggre-gated). This architecture was mainly developed under the CAM4Home European research project and is fully based on the CAM4Home semantic metadata model. This semantic model relies on RDF (Resource Description Framework) and is rich (but simple enough), extensible and dedicated for the description of any kind of multimedia content.In this paper, we detail and evaluate an original semantic-based community network architecture for heterogeneous multimedia con-tent sharing and retrieval. Within the presentedarchitecture, multimedia contents are managed according to their asso-ciated CAM4Home semantic metadata through a structured P2P topology. This topology relies on a semantically en-hanced DHT (Distributed Hash Table) and is also provided with an additional indexing system for offering semantic storage and search facilities and overcoming the problem of exact match keywords in DHTs.展开更多
With the development of cloud-based data centers and multimedia technologies, cloud-based multimedia service systems have been paid more and more attention. Audio highlights detection plays an important role in the cl...With the development of cloud-based data centers and multimedia technologies, cloud-based multimedia service systems have been paid more and more attention. Audio highlights detection plays an important role in the cloud-based multimedia service system. In this paper, we proposed a novel highlight detection method to extract the audio highlight effects for the cloud-based multimedia service system using the unsupervised approach. In the proposed method, we first extract the audio features for each audio document. Then the spectral clustering scheme was used to decompose the audio document into several audio effects. Then, we introduce the TF-IDF method to label the highlight effect. We design some experiments to evaluate the performance of the proposed method, and the experimental results show that our method can achieve satisfying results.展开更多
Straightforward image resizing operators without considering image contents (e.g., uniform scaling) cannot usually produce satisfactory results, while content-aware image retargeting aims to arbitrarily change image...Straightforward image resizing operators without considering image contents (e.g., uniform scaling) cannot usually produce satisfactory results, while content-aware image retargeting aims to arbitrarily change image size while preserving visually prominent features. In this paper, a cluster-based saliency-guided seam carving algorithm for content- aware image retargeting is proposed. To cope with the main drawback of the original seam carving algorithm relying on only gradient-based image importance map, we integrate a gradient-based map and a cluster-based saliency map to generate a more reliable importance map, resulting in better single image retargeting results. Experimental results have demonstrated the efficacy of the proposed algorithm.展开更多
文摘Multimedia content is an integral part of Alibaba’s business ecosystem and is in great demand. The production of multimedia content usually requires high technology and much money. With the rapid development of artificial intelligence(AI) technology in recent years, to meet the design requirements of multimedia content, many AI auxiliary tools for the production of multimedia content have emerged and become more and more widely used in Alibaba’s business ecology. Related applications include mainly auxiliary design, graphic design, video generation,and page production. In this report, a general pipeline of the AI auxiliary tools is introduced. Four representative tools applied in the Alibaba Group are presented for the applications mentioned above. The value brought by multimedia content design combined with AI technology has been well verified in business through these tools. This reflects the great role played by AI technology in promoting the production of multimedia content. The application prospects of the combination of multimedia content design and AI are also indicated.
文摘The China-US Million Book Digital Library Project (Million Book Project) is an intemational cooperation program between China and the US. However, one million digitized books are considered not to be the ultimate goal of the project, but a first step towards universal access to human knowledge. In particular, there are four challenges about the new way to analyze, process, operate, visualize and interact with digital media resource in this library. To tackle these challenges, North China Centre of Million Book Project (in Chinese Academy of Sciences) has initiated several innovative research projects in areas such as multimedia content analysis and retrieval, bilingual services, multimodal information presentation, and knowledge-based organization and services. In this keynote speech, we simply review our work in these areas, and argue that by technological cooperation with these innovation research topics, the project will develop a top-level digital library platform for the million book library.
文摘Peer-to-peer technologies have emerged as a powerful and scalable communication model for large scale content shar-ing. However, they are not yet provided with optimized heterogeneous aggregated content management functionality since they lack rich semantic specifications. To overcome these shortcomings, we elaborated a reference model of P2P architecture for a dynamic aggregation, sharing and retrieval of heterogeneous multimedia contents (simple or aggre-gated). This architecture was mainly developed under the CAM4Home European research project and is fully based on the CAM4Home semantic metadata model. This semantic model relies on RDF (Resource Description Framework) and is rich (but simple enough), extensible and dedicated for the description of any kind of multimedia content.In this paper, we detail and evaluate an original semantic-based community network architecture for heterogeneous multimedia con-tent sharing and retrieval. Within the presentedarchitecture, multimedia contents are managed according to their asso-ciated CAM4Home semantic metadata through a structured P2P topology. This topology relies on a semantically en-hanced DHT (Distributed Hash Table) and is also provided with an additional indexing system for offering semantic storage and search facilities and overcoming the problem of exact match keywords in DHTs.
基金supported by National Development and Reform Commission Information Security Special FundNational Key Basic Reseerch Program of China (973 program) under Grant No.2007CB311203
文摘With the development of cloud-based data centers and multimedia technologies, cloud-based multimedia service systems have been paid more and more attention. Audio highlights detection plays an important role in the cloud-based multimedia service system. In this paper, we proposed a novel highlight detection method to extract the audio highlight effects for the cloud-based multimedia service system using the unsupervised approach. In the proposed method, we first extract the audio features for each audio document. Then the spectral clustering scheme was used to decompose the audio document into several audio effects. Then, we introduce the TF-IDF method to label the highlight effect. We design some experiments to evaluate the performance of the proposed method, and the experimental results show that our method can achieve satisfying results.
基金supported by“MOST”under Grants No.105-2628-E-224-001-MY3 and No.103-2221-E-224-034-MY2
文摘Straightforward image resizing operators without considering image contents (e.g., uniform scaling) cannot usually produce satisfactory results, while content-aware image retargeting aims to arbitrarily change image size while preserving visually prominent features. In this paper, a cluster-based saliency-guided seam carving algorithm for content- aware image retargeting is proposed. To cope with the main drawback of the original seam carving algorithm relying on only gradient-based image importance map, we integrate a gradient-based map and a cluster-based saliency map to generate a more reliable importance map, resulting in better single image retargeting results. Experimental results have demonstrated the efficacy of the proposed algorithm.
基金Supported by the National Natural Science Foundation of China under Grant Nos.60573106, 60402027, 60573131 (国家自然科学基金)the Natural Science Foundation of Jiangsu Province of China under Grant No.BK2005411 (江苏省自然科学基金)the National Basic Research Program of China under Grant No.2002CB312002 (国家重点基础研究发展计划(973)