期刊文献+
共找到185篇文章
< 1 2 10 >
每页显示 20 50 100
Sparse Reconstructive Evidential Clustering for Multi-View Data
1
作者 Chaoyu Gong Yang You 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第2期459-473,共15页
Although many multi-view clustering(MVC) algorithms with acceptable performances have been presented, to the best of our knowledge, nearly all of them need to be fed with the correct number of clusters. In addition, t... Although many multi-view clustering(MVC) algorithms with acceptable performances have been presented, to the best of our knowledge, nearly all of them need to be fed with the correct number of clusters. In addition, these existing algorithms create only the hard and fuzzy partitions for multi-view objects,which are often located in highly-overlapping areas of multi-view feature space. The adoption of hard and fuzzy partition ignores the ambiguity and uncertainty in the assignment of objects, likely leading to performance degradation. To address these issues, we propose a novel sparse reconstructive multi-view evidential clustering algorithm(SRMVEC). Based on a sparse reconstructive procedure, SRMVEC learns a shared affinity matrix across views, and maps multi-view objects to a 2-dimensional humanreadable chart by calculating 2 newly defined mathematical metrics for each object. From this chart, users can detect the number of clusters and select several objects existing in the dataset as cluster centers. Then, SRMVEC derives a credal partition under the framework of evidence theory, improving the fault tolerance of clustering. Ablation studies show the benefits of adopting the sparse reconstructive procedure and evidence theory. Besides,SRMVEC delivers effectiveness on benchmark datasets by outperforming some state-of-the-art methods. 展开更多
关键词 Evidence theory multi-view clustering(MVC) optimization sparse reconstruction
下载PDF
Contrastive Consistency and Attentive Complementarity for Deep Multi-View Subspace Clustering
2
作者 Jiao Wang Bin Wu Hongying Zhang 《Computers, Materials & Continua》 SCIE EI 2024年第4期143-160,共18页
Deep multi-view subspace clustering (DMVSC) based on self-expression has attracted increasing attention dueto its outstanding performance and nonlinear application. However, most existing methods neglect that viewpriv... Deep multi-view subspace clustering (DMVSC) based on self-expression has attracted increasing attention dueto its outstanding performance and nonlinear application. However, most existing methods neglect that viewprivatemeaningless information or noise may interfere with the learning of self-expression, which may lead to thedegeneration of clustering performance. In this paper, we propose a novel framework of Contrastive Consistencyand Attentive Complementarity (CCAC) for DMVsSC. CCAC aligns all the self-expressions of multiple viewsand fuses them based on their discrimination, so that it can effectively explore consistent and complementaryinformation for achieving precise clustering. Specifically, the view-specific self-expression is learned by a selfexpressionlayer embedded into the auto-encoder network for each view. To guarantee consistency across views andreduce the effect of view-private information or noise, we align all the view-specific self-expressions by contrastivelearning. The aligned self-expressions are assigned adaptive weights by channel attention mechanism according totheir discrimination. Then they are fused by convolution kernel to obtain consensus self-expression withmaximumcomplementarity ofmultiple views. Extensive experimental results on four benchmark datasets and one large-scaledataset of the CCAC method outperformother state-of-the-artmethods, demonstrating its clustering effectiveness. 展开更多
关键词 Deep multi-view subspace clustering contrastive learning adaptive fusion self-expression learning
下载PDF
Low-Rank Multi-View Subspace Clustering Based on Sparse Regularization
3
作者 Yan Sun Fanlong Zhang 《Journal of Computer and Communications》 2024年第4期14-30,共17页
Multi-view Subspace Clustering (MVSC) emerges as an advanced clustering method, designed to integrate diverse views to uncover a common subspace, enhancing the accuracy and robustness of clustering results. The signif... Multi-view Subspace Clustering (MVSC) emerges as an advanced clustering method, designed to integrate diverse views to uncover a common subspace, enhancing the accuracy and robustness of clustering results. The significance of low-rank prior in MVSC is emphasized, highlighting its role in capturing the global data structure across views for improved performance. However, it faces challenges with outlier sensitivity due to its reliance on the Frobenius norm for error measurement. Addressing this, our paper proposes a Low-Rank Multi-view Subspace Clustering Based on Sparse Regularization (LMVSC- Sparse) approach. Sparse regularization helps in selecting the most relevant features or views for clustering while ignoring irrelevant or noisy ones. This leads to a more efficient and effective representation of the data, improving the clustering accuracy and robustness, especially in the presence of outliers or noisy data. By incorporating sparse regularization, LMVSC-Sparse can effectively handle outlier sensitivity, which is a common challenge in traditional MVSC methods relying solely on low-rank priors. Then Alternating Direction Method of Multipliers (ADMM) algorithm is employed to solve the proposed optimization problems. Our comprehensive experiments demonstrate the efficiency and effectiveness of LMVSC-Sparse, offering a robust alternative to traditional MVSC methods. 展开更多
关键词 CLUSTERING multi-view Subspace Clustering Low-Rank Prior Sparse Regularization
下载PDF
Multi-View & Transfer Learning for Epilepsy Recognition Based on EEG Signals
4
作者 Jiali Wang Bing Li +7 位作者 Chengyu Qiu Xinyun Zhang Yuting Cheng Peihua Wang Ta Zhou Hong Ge Yuanpeng Zhang Jing Cai 《Computers, Materials & Continua》 SCIE EI 2023年第6期4843-4866,共24页
Epilepsy is a central nervous system disorder in which brain activity becomes abnormal.Electroencephalogram(EEG)signals,as recordings of brain activity,have been widely used for epilepsy recognition.To study epilep-ti... Epilepsy is a central nervous system disorder in which brain activity becomes abnormal.Electroencephalogram(EEG)signals,as recordings of brain activity,have been widely used for epilepsy recognition.To study epilep-tic EEG signals and develop artificial intelligence(AI)-assist recognition,a multi-view transfer learning(MVTL-LSR)algorithm based on least squares regression is proposed in this study.Compared with most existing multi-view transfer learning algorithms,MVTL-LSR has two merits:(1)Since traditional transfer learning algorithms leverage knowledge from different sources,which poses a significant risk to data privacy.Therefore,we develop a knowledge transfer mechanism that can protect the security of source domain data while guaranteeing performance.(2)When utilizing multi-view data,we embed view weighting and manifold regularization into the transfer framework to measure the views’strengths and weaknesses and improve generalization ability.In the experimental studies,12 different simulated multi-view&transfer scenarios are constructed from epileptic EEG signals licensed and provided by the Uni-versity of Bonn,Germany.Extensive experimental results show that MVTL-LSR outperforms baselines.The source code will be available on https://github.com/didid5/MVTL-LSR. 展开更多
关键词 multi-view learning transfer learning least squares regression EPILEPSY EEG signals
下载PDF
Diverse Deep Matrix Factorization With Hypergraph Regularization for Multi-View Data Representation
5
作者 Haonan Huang Guoxu Zhou +2 位作者 Naiyao Liang Qibin Zhao Shengli Xie 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第11期2154-2167,共14页
Deep matrix factorization(DMF)has been demonstrated to be a powerful tool to take in the complex hierarchical information of multi-view data(MDR).However,existing multiview DMF methods mainly explore the consistency o... Deep matrix factorization(DMF)has been demonstrated to be a powerful tool to take in the complex hierarchical information of multi-view data(MDR).However,existing multiview DMF methods mainly explore the consistency of multi-view data,while neglecting the diversity among different views as well as the high-order relationships of data,resulting in the loss of valuable complementary information.In this paper,we design a hypergraph regularized diverse deep matrix factorization(HDDMF)model for multi-view data representation,to jointly utilize multi-view diversity and a high-order manifold in a multilayer factorization framework.A novel diversity enhancement term is designed to exploit the structural complementarity between different views of data.Hypergraph regularization is utilized to preserve the high-order geometry structure of data in each view.An efficient iterative optimization algorithm is developed to solve the proposed model with theoretical convergence analysis.Experimental results on five real-world data sets demonstrate that the proposed method significantly outperforms stateof-the-art multi-view learning approaches. 展开更多
关键词 Deep matrix factorization(DMF) diversity hypergraph regularization multi-view data representation(MDR)
下载PDF
Relational graph location network for multi-view image localization
6
作者 YANG Yukun LIU Xiangdong 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2023年第2期460-468,共9页
In multi-view image localization task,the features of the images captured from different views should be fused properly.This paper considers the classification-based image localization problem.We propose the relationa... In multi-view image localization task,the features of the images captured from different views should be fused properly.This paper considers the classification-based image localization problem.We propose the relational graph location network(RGLN)to perform this task.In this network,we propose a heterogeneous graph construction approach for graph classification tasks,which aims to describe the location in a more appropriate way,thereby improving the expression ability of the location representation module.Experiments show that the expression ability of the proposed graph construction approach outperforms the compared methods by a large margin.In addition,the proposed localization method outperforms the compared localization methods by around 1.7%in terms of meter-level accuracy. 展开更多
关键词 multi-view image localization graph construction heterogeneous graph graph neural network
下载PDF
ER-Net:Efficient Recalibration Network for Multi-ViewMulti-Person 3D Pose Estimation
7
作者 Mi Zhou Rui Liu +1 位作者 Pengfei Yi Dongsheng Zhou 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第8期2093-2109,共17页
Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application scenarios.With the introduction of end-to-end direct regression methods,the fi... Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application scenarios.With the introduction of end-to-end direct regression methods,the field has entered a new stage of development.However,the regression results of joints that are more heavily influenced by external factors are not accurate enough even for the optimal method.In this paper,we propose an effective feature recalibration module based on the channel attention mechanism and a relative optimal calibration strategy,which is applied to themulti-viewmulti-person 3D human pose estimation task to achieve improved detection accuracy for joints that are more severely affected by external factors.Specifically,it achieves relative optimal weight adjustment of joint feature information through the recalibration module and strategy,which enables the model to learn the dependencies between joints and the dependencies between people and their corresponding joints.We call this method as the Efficient Recalibration Network(ER-Net).Finally,experiments were conducted on two benchmark datasets for this task,Campus and Shelf,in which the PCP reached 97.3% and 98.3%,respectively. 展开更多
关键词 multi-view multi-person pose estimation attention mechanism computer vision
下载PDF
现代测绘技术在古建烫样数字化项目中的应用
8
作者 王莫 《文物保护与考古科学》 北大核心 2024年第2期171-176,共6页
烫样是研究清代皇家建筑的珍贵文物,为配合文物信息数字化工作的开展,针对烫样数据的采集和处理方法进行了深入研究。在综合考虑多种数据采集方法的特点及烫样数字化记录的难点后,选择使用手持式三维扫描仪与多视角三维重建技术相结合... 烫样是研究清代皇家建筑的珍贵文物,为配合文物信息数字化工作的开展,针对烫样数据的采集和处理方法进行了深入研究。在综合考虑多种数据采集方法的特点及烫样数字化记录的难点后,选择使用手持式三维扫描仪与多视角三维重建技术相结合的方式来进行烫样的基础数据采集,并通过实践总结出了一套完整的数据采集和处理工作流程。与此同时,针对成果可能的应用方式做了些许尝试,且为数据的后期维护与进一步利用拟定了数据管理系统的建设方案。 展开更多
关键词 烫样 手持三维扫描 多视角三维重建 彩色纹理模型 正射影像图 数据管理系统
下载PDF
融合图神经网络与多视图学习的社区问答专家推荐方法研究
9
作者 熊玮楠 《现代电子技术》 北大核心 2024年第9期115-118,共4页
互联网高速发展背景下,社区问答网站提问者有着更强烈的求知需求,海量数据为提问者识别有效信息带来困难,为提问者推荐更专业的专家用户对问题进行回答显得尤为重要。针对传统社区问答专家推荐方法难以准确计算出提问者提出的目标问题... 互联网高速发展背景下,社区问答网站提问者有着更强烈的求知需求,海量数据为提问者识别有效信息带来困难,为提问者推荐更专业的专家用户对问题进行回答显得尤为重要。针对传统社区问答专家推荐方法难以准确计算出提问者提出的目标问题和候选专家之间的相关性等问题,为了提高社区问答网站中专家推荐的效率,构建问题节点关系无向图,利用图神经网络GraphSAGE提取节点的二阶邻居信息,并使用多视图学习方法学习不同视图间的互补信息,最终获取目标问题文本和候选专家历史问题集丰富的向量表示,用来计算出目标问题与候选专家之间的匹配度,进而推荐出最适合回答目标问题的专家用户。实验结果表明,在不同的社区问答专家推荐方法上,文中方法在评价指标MRR、NDCG@10上均取得了更优的推荐效果。 展开更多
关键词 社区问答 专家推荐 图神经网络 多视图学习 推荐系统 深度学习模型
下载PDF
Multi-view feature fusion for rolling bearing fault diagnosis using random forest and autoencoder 被引量:6
10
作者 Sun Wenqing Deng Aidong +4 位作者 Deng Minqiang Zhu Jing Zhai Yimeng Cheng Qiang Liu Yang 《Journal of Southeast University(English Edition)》 EI CAS 2019年第3期302-309,共8页
To improve the accuracy and robustness of rolling bearing fault diagnosis under complex conditions, a novel method based on multi-view feature fusion is proposed. Firstly, multi-view features from perspectives of the ... To improve the accuracy and robustness of rolling bearing fault diagnosis under complex conditions, a novel method based on multi-view feature fusion is proposed. Firstly, multi-view features from perspectives of the time domain, frequency domain and time-frequency domain are extracted through the Fourier transform, Hilbert transform and empirical mode decomposition (EMD).Then, the random forest model (RF) is applied to select features which are highly correlated with the bearing operating state. Subsequently, the selected features are fused via the autoencoder (AE) to further reduce the redundancy. Finally, the effectiveness of the fused features is evaluated by the support vector machine (SVM). The experimental results indicate that the proposed method based on the multi-view feature fusion can effectively reflect the difference in the state of the rolling bearing, and improve the accuracy of fault diagnosis. 展开更多
关键词 multi-view features feature fusion fault diagnosis rolling bearing machine learning
下载PDF
Interactive transport of multi-view videos for 3DTV applications 被引量:4
11
作者 KURUTEPE Engin CIVANLAR M.Reha TEKALP A.Murat 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2006年第5期830-836,共7页
The authors propose a novel method for transporting multi-view videos that aims to keep the bandwidth requirements on both end-users and servers as low as possible. The method is based on application layer multicast, ... The authors propose a novel method for transporting multi-view videos that aims to keep the bandwidth requirements on both end-users and servers as low as possible. The method is based on application layer multicast, where each end point re- ceives only a selected number of views required for rendering video from its current viewpoint at any given time. The set of selected videos changes in real time as the user’s viewpoint changes because of head or eye movements. Techniques for reducing the black-outs during fast viewpoint changes were investigated. The performance of the approach was studied through network experiments. 展开更多
关键词 3DTV multi-view video Application-layer multicast Join-latency
下载PDF
Color Correction for Multi-view Video Using Energy Minimization of View Networks 被引量:4
12
作者 Kenji Yamamoto Ryutaro Oi 《International Journal of Automation and computing》 EI 2008年第3期234-245,共12页
Systems using numerous cameras are emerging in many fields due to their ease of production and reduced cost, and one of the fields where they are expected to be used more actively in the near future is in image-based ... Systems using numerous cameras are emerging in many fields due to their ease of production and reduced cost, and one of the fields where they are expected to be used more actively in the near future is in image-based rendering (IBR). Color correction between views is necessary to use multi-view systems in IBR to make audiences feel comfortable when views are switched or when a free viewpoint video is displayed. Color correction usually involves two steps: the first is to adjust camera parameters such as gain, brightness, and aperture before capture, and the second is to modify captured videos through image processing. This paper deals with the latter, which does not need a color pattern board. The proposed method uses scale invariant feature transform (SIFT) to detect correspondences, treats RGB channels independently, calculates lookup tables with an energy-minimization approach, and corrects captured video with these tables. The experimental results reveal that this approach works well. 展开更多
关键词 multi-view color correction image-based rendering (IBR) view networks (VNs) scale invariant feature transform (SIFT) energy minimization.
下载PDF
A Multi-View Gait Recognition Method Using Deep Convolutional Neural Network and Channel Attention Mechanism 被引量:2
13
作者 Jiabin Wang Kai Peng 《Computer Modeling in Engineering & Sciences》 SCIE EI 2020年第10期345-363,共19页
In many existing multi-view gait recognition methods based on images or video sequences,gait sequences are usually used to superimpose and synthesize images and construct energy-like template.However,information may b... In many existing multi-view gait recognition methods based on images or video sequences,gait sequences are usually used to superimpose and synthesize images and construct energy-like template.However,information may be lost during the process of compositing image and capture EMG signals.Errors and the recognition accuracy may be introduced and affected respectively by some factors such as period detection.To better solve the problems,a multi-view gait recognition method using deep convolutional neural network and channel attention mechanism is proposed.Firstly,the sliding time window method is used to capture EMG signals.Then,the back-propagation learning algorithm is used to train each layer of convolution,which improves the learning ability of the convolutional neural network.Finally,the channel attention mechanism is integrated into the neural network,which will improve the ability of expressing gait features.And a classifier is used to classify gait.As can be shown from experimental results on two public datasets,OULP and CASIA-B,the recognition rate of the proposed method can be achieved at 88.44%and 97.25%respectively.As can be shown from the comparative experimental results,the proposed method has better recognition effect than several other newer convolutional neural network methods.Therefore,the combination of convolutional neural network and channel attention mechanism is of great value for gait recognition. 展开更多
关键词 EMG signal capture channel attention mechanism convolutional neural network multi-view gait recognition gait characteristics BACK-PROPAGATION
下载PDF
Feature Fusion Multi-View Hashing Based on Random Kernel Canonical Correlation Analysis 被引量:2
14
作者 Junshan Tan Rong Duan +2 位作者 Jiaohua Qin Xuyu Xiang Yun Tan 《Computers, Materials & Continua》 SCIE EI 2020年第5期675-689,共15页
Hashing technology has the advantages of reducing data storage and improving the efficiency of the learning system,making it more and more widely used in image retrieval.Multi-view data describes image information mor... Hashing technology has the advantages of reducing data storage and improving the efficiency of the learning system,making it more and more widely used in image retrieval.Multi-view data describes image information more comprehensively than traditional methods using a single-view.How to use hashing to combine multi-view data for image retrieval is still a challenge.In this paper,a multi-view fusion hashing method based on RKCCA(Random Kernel Canonical Correlation Analysis)is proposed.In order to describe image content more accurately,we use deep learning dense convolutional network feature DenseNet to construct multi-view by combining GIST feature or BoW_SIFT(Bag-of-Words model+SIFT feature)feature.This algorithm uses RKCCA method to fuse multi-view features to construct association features and apply them to image retrieval.The algorithm generates binary hash code with minimal distortion error by designing quantization regularization terms.A large number of experiments on benchmark datasets show that this method is superior to other multi-view hashing methods. 展开更多
关键词 HASHING multi-view data random kernel canonical correlation analysis feature fusion deep learning
下载PDF
Multi-view video color correction using dynamic programming 被引量:1
15
作者 Shao Feng Jiang Gangyi Yu Mei 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2008年第6期1115-1120,共6页
Color inconsistency between views is an important problem to be solved in multi-view video systems. A multi-view video color correction method using dynamic programming is proposed. Three-dimensional histograms are co... Color inconsistency between views is an important problem to be solved in multi-view video systems. A multi-view video color correction method using dynamic programming is proposed. Three-dimensional histograms are constructed with sequential conditional probability in HSI color space. Then, dynamic programming is used to seek the best color mapping relation with the minimum cost path between target image histogram and source image histogram. Finally, video tracking technique is performed to correct multi-view video. Experimental results show that the proposed method can obtain better subjective and objective performance in color correction. 展开更多
关键词 multi-view video color correction dynamic programming video tracking
下载PDF
Multi-View Auxiliary Diagnosis Algorithm for Lung Nodules 被引量:1
16
作者 Shi Qiu Bin Li +2 位作者 Tao Zhou Feng Li Ting Liang 《Computers, Materials & Continua》 SCIE EI 2022年第9期4897-4910,共14页
Lung is an important organ of human body.More and more people are suffering from lung diseases due to air pollution.These diseases are usually highly infectious.Such as lung tuberculosis,novel coronavirus COVID-19,etc... Lung is an important organ of human body.More and more people are suffering from lung diseases due to air pollution.These diseases are usually highly infectious.Such as lung tuberculosis,novel coronavirus COVID-19,etc.Lung nodule is a kind of high-density globular lesion in the lung.Physicians need to spend a lot of time and energy to observe the computed tomography image sequences to make a diagnosis,which is inefficient.For this reason,the use of computer-assisted diagnosis of lung nodules has become the current main trend.In the process of computer-aided diagnosis,how to reduce the false positive rate while ensuring a low missed detection rate is a difficulty and focus of current research.To solve this problem,we propose a three-dimensional optimization model to achieve the extraction of suspected regions,improve the traditional deep belief network,and to modify the dispersion matrix between classes.We construct a multi-view model,fuse local three-dimensional information into two-dimensional images,and thereby to reduce the complexity of the algorithm.And alleviate the problem of unbalanced training caused by only a small number of positive samples.Experiments show that the false positive rate of the algorithm proposed in this paper is as low as 12%,which is in line with clinical application standards. 展开更多
关键词 Lung nodules deep belief network computer-aided diagnosis multi-view
下载PDF
Color compensation for multi-view video coding based on diversity of cameras 被引量:1
17
作者 Jun-yan HUO Yi-lin CHANG +1 位作者 Hai-tao YANG Shuai WAN 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2008年第12期1631-1637,共7页
A novel color compensation method for multi-view video coding (MVC) is proposed, which efficiently exploits the inter-view dependencies between views with the existence of color mismatch caused by the diversity of cam... A novel color compensation method for multi-view video coding (MVC) is proposed, which efficiently exploits the inter-view dependencies between views with the existence of color mismatch caused by the diversity of cameras. A color compensation model is developed in RGB channels and then extended to YCbCr channels for practical use. A modified inter-view reference picture is constructed based on the color compensation model, which is more similar to the coding picture than the original inter-view reference picture. Moreover, the color compensation factors can be derived in both encoder and decoder, therefore no additional data need to be transmitted to the decoder. The experimental results show that the proposed method improves the coding efficiency of MVC and maintains good subjective quality. 展开更多
关键词 multi-view video coding (MVC) H.264/AVC Color compensation Diversity of cameras
下载PDF
Three-dimensional,isotropic imaging of mouse brain using multi-view deconvolution light sheet microscopy 被引量:1
18
作者 Sa Liu Jun Nie +3 位作者 Yusha Li Tingting Yu Dan Zhu Peng Fei 《Journal of Innovative Optical Health Sciences》 SCIE EI CAS 2017年第5期94-100,共7页
We present a threedimensional(3D)isotropic imaging of mouse brain using light-sheet fuo-rescent microscopy(LSFM)in conjumction with a multi-view imaging computation.Unlike common single view LSFM is used for mouse bra... We present a threedimensional(3D)isotropic imaging of mouse brain using light-sheet fuo-rescent microscopy(LSFM)in conjumction with a multi-view imaging computation.Unlike common single view LSFM is used for mouse brain imaging,the brain tissue is 3D imaged under eight views in our study,by a home-built selective plane ilumination microscopy(SPIM).An output image containing complete structural infornation as well as significantly improved res olution(~4 times)are then computed based on these eight views of data,using a bead-guided multi-view registration and deconvolution.With superior imaging quality,the astrocyte and pyrarmidal neurons together with their subcellular nerve fbers can be clearly visualized and segmented.With further incuding other computational methods,this study can be potentially scaled up to map the conectome of whole mouse brain with a simple light.sheet microscope. 展开更多
关键词 Light sheet fuorescent microscopy multi-view dconvolution mouse brain imaging ISOTROPIC
下载PDF
Frame-layer bit allocation for multi-view video coding based on frame complexity estimation 被引量:1
19
作者 严涛 安平 +3 位作者 沈礼权 李振纲 王贺 张兆杨 《Journal of Shanghai University(English Edition)》 2010年第1期50-54,共5页
Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate... Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate-distortion (R-D) model. We reasonably allocate bit-rate among views based on the correlation analysisl The proposed algorithm consists of three levels to control the rate bits more accurately, of which the frame layer allocates bits according to the frame complexity and the temporal activity. Extensive experiments show that the proposed algorithm can control the bit rate efficiently. 展开更多
关键词 multi-view video coding (MVC) rate control bit-allocation rate-distortion model correlation analysis frame complexity
下载PDF
Efficient fast mode decision using mode complexity for multi-view video coding 被引量:1
20
作者 王凤随 沈庆宏 都思丹 《Journal of Central South University》 SCIE EI CAS 2014年第11期4244-4253,共10页
The variable block-size motion estimation(ME) and disparity estimation(DE) are adopted in multi-view video coding(MVC) to achieve high coding efficiency. However, much higher computational complexity is also introduce... The variable block-size motion estimation(ME) and disparity estimation(DE) are adopted in multi-view video coding(MVC) to achieve high coding efficiency. However, much higher computational complexity is also introduced in coding system, which hinders practical application of MVC. An efficient fast mode decision method using mode complexity is proposed to reduce the computational complexity. In the proposed method, mode complexity is firstly computed by using the spatial, temporal and inter-view correlation between the current macroblock(MB) and its neighboring MBs. Based on the observation that direct mode is highly possible to be the optimal mode, mode complexity is always checked in advance whether it is below a predefined threshold for providing an efficient early termination opportunity. If this early termination condition is not met, three mode types for the MBs are classified according to the value of mode complexity, i.e., simple mode, medium mode and complex mode, to speed up the encoding process by reducing the number of the variable block modes required to be checked. Furthermore, for simple and medium mode region, the rate distortion(RD) cost of mode 16×16 in the temporal prediction direction is compared with that of the disparity prediction direction, to determine in advance whether the optimal prediction direction is in the temporal prediction direction or not, for skipping unnecessary disparity estimation. Experimental results show that the proposed method is able to significantly reduce the computational load by 78.79% and the total bit rate by 0.07% on average, while only incurring a negligible loss of PSNR(about 0.04 d B on average), compared with the full mode decision(FMD) in the reference software of MVC. 展开更多
关键词 multi-view video coding mode decision mode complexity computational complexity
下载PDF
上一页 1 2 10 下一页 到第
使用帮助 返回顶部