Recently,the combination of video services and 5G networks have been gaining attention in the wireless communication realm.With the brisk advancement in 5G network usage and the massive popularity of threedimensional ...Recently,the combination of video services and 5G networks have been gaining attention in the wireless communication realm.With the brisk advancement in 5G network usage and the massive popularity of threedimensional video streaming,the quality of experience(QoE)of video in 5G systems has been receiving overwhelming significance from both customers and service provider ends.Therefore,effectively categorizing QoE-aware video streaming is imperative for achieving greater client satisfaction.This work makes the following contribution:First,a simulation platform based on NS-3 is introduced to analyze and improve the performance of video services.The simulation is formulated to offer real-time measurements,saving the expensive expenses associated with real-world equipment.Second,A valuable framework for QoE-aware video streaming categorization is introduced in 5G networks based on machine learning(ML)by incorporating the hyperparameter tuning(HPT)principle.It implements an enhanced hyperparameter tuning(EHPT)ensemble and decision tree(DT)classifier for video streaming categorization.The performance of the ML approach is assessed by considering precision,accuracy,recall,and computation time metrics for manifesting the superiority of these classifiers regarding video streaming categorization.This paper demonstrates that our ML classifiers achieve QoE prediction accuracy of 92.59%for(EHPT)ensemble and 87.037%for decision tree(DT)classifiers.展开更多
In recent years,real-time video streaming has grown in popularity.The growing popularity of the Internet of Things(IoT)and other wireless heterogeneous networks mandates that network resources be carefully apportioned...In recent years,real-time video streaming has grown in popularity.The growing popularity of the Internet of Things(IoT)and other wireless heterogeneous networks mandates that network resources be carefully apportioned among versatile users in order to achieve the best Quality of Experience(QoE)and performance objectives.Most researchers focused on Forward Error Correction(FEC)techniques when attempting to strike a balance between QoE and performance.However,as network capacity increases,the performance degrades,impacting the live visual experience.Recently,Deep Learning(DL)algorithms have been successfully integrated with FEC to stream videos across multiple heterogeneous networks.But these algorithms need to be changed to make the experience better without sacrificing packet loss and delay time.To address the previous challenge,this paper proposes a novel intelligent algorithm that streams video in multi-home heterogeneous networks based on network-centric characteristics.The proposed framework contains modules such as Intelligent Content Extraction Module(ICEM),Channel Status Monitor(CSM),and Adaptive FEC(AFEC).This framework adopts the Cognitive Learning-based Scheduling(CLS)Module,which works on the deep Reinforced Gated Recurrent Networks(RGRN)principle and embeds them along with the FEC to achieve better performances.The complete framework was developed using the Objective Modular Network Testbed in C++(OMNET++),Internet networking(INET),and Python 3.10,with Keras as the front end and Tensorflow 2.10 as the back end.With extensive experimentation,the proposed model outperforms the other existing intelligentmodels in terms of improving the QoE,minimizing the End-to-End Delay(EED),and maintaining the highest accuracy(98%)and a lower Root Mean Square Error(RMSE)value of 0.001.展开更多
Adaptive bitrate video streaming(ABR)has become a critical technique for mobile video streaming to cope with time-varying network conditions and different user preferences.However,there are still many problems in achi...Adaptive bitrate video streaming(ABR)has become a critical technique for mobile video streaming to cope with time-varying network conditions and different user preferences.However,there are still many problems in achieving high-quality ABR video streaming over cellular networks.Mobile Edge Computing(MEC)is a promising paradigm to overcome the above problems by providing video transcoding capability and caching the ABR video streaming within the radio access network(RAN).In this paper,we propose a flexible transcoding strategy to provide viewers with low-latency video streaming services in the MEC networks under the limited storage,computing,and spectrum resources.According to the information collected from users,the MEC server acts as a controlling component to adjust the transcoding strategy flexibly based on optimizing the video caching placement strategy.Specifically,we cache the proper bitrate version of the video segments at the edge servers and select the appropriate bitrate version of the video segments to perform transcoding under jointly considering access control,resource allocation,and user preferences.We formulate this problem as a nonconvex optimization and mixed combinatorial problem.Moreover,the simulation results indicate that our proposed algorithm can ensure a low-latency viewing experience for users.展开更多
With correlating with human perception, quality of experience(Qo E) is also an important measurement in evaluation of video quality in addition to quality of service(Qo S). A cross-layer scheme based on Lyapunov optim...With correlating with human perception, quality of experience(Qo E) is also an important measurement in evaluation of video quality in addition to quality of service(Qo S). A cross-layer scheme based on Lyapunov optimization framework for H.264/AVC video streaming over wireless Ad hoc networks is proposed, with increasing both Qo E and Qo S performances. Different from existing works, this scheme routes and schedules video packets according to the statuses of the frame buffers at the destination nodes to reduce buffer underflows and to increase video playout continuity. The waiting time of head-ofline packets of data queues are considered in routing and scheduling to reduce the average end-to-end delay of video sessions. Different types of packets are allocated with different priorities according to their generated rates under H.264/AVC. To reduce the computational complexity, a distributed media access control policy and a power control algorithm cooperating with the media access policy are proposed. Simulation results show that, compared with existing schemes, this scheme can improve both the Qo S and Qo E performances. The average peak signal-to-noise ratio(PSNR) of the received video streams is also increased.展开更多
In this paper,we investigate video quality enhancement using computation offloading to the mobile cloud computing(MCC)environment.Our objective is to reduce the computational complexity required to covert a low-resolu...In this paper,we investigate video quality enhancement using computation offloading to the mobile cloud computing(MCC)environment.Our objective is to reduce the computational complexity required to covert a low-resolution video to high-resolution video while minimizing computation at the mobile client and additional communication costs.To do so,we propose an energy-efficient computation offloading framework for video streaming services in a MCC over the fifth generation(5G)cellular networks.In the proposed framework,the mobile client offloads the computational burden for the video enhancement to the cloud,which renders the side information needed to enhance video without requiring much computation by the client.The cloud detects edges from the upsampled ultra-high-resolution video(UHD)and then compresses and transmits them as side information with the original low-resolution video(e.g.,full HD).Finally,the mobile client decodes the received content and integrates the SI and original content,which produces a high-quality video.In our extensive simulation experiments,we observed that the amount of computation needed to construct a UHD video in the client is 50%-60% lower than that required to decode UHD video compressed by legacy video encoding algorithms.Moreover,the bandwidth required to transmit a full HD video and its side information is around 70% lower than that required for a normal UHD video.The subjective quality of the enhanced UHD is similar to that of the original UHD video even though the client pays lower communication costs with reduced computing power.展开更多
Equation based TCP-friendly rate control (TFRC) protocol has been proposed to support video streaming applications. In order to improve TFRC performance in wireless channels, the link level automatic repeat request (A...Equation based TCP-friendly rate control (TFRC) protocol has been proposed to support video streaming applications. In order to improve TFRC performance in wireless channels, the link level automatic repeat request (ARQ) scheme is usually deployed. However, ARQ cannot ensure strict delay guarantees, especially over multihop links. This paper introduces a theoretical model to deduce an equation for packet size adjustment in transport layer to minimize retransmission delay by taking into con- sideration the causative reasons inducing retransmission in link layer. An enhanced TFRC (ETFRC) scheme is proposed inte- grating TFRC with variable packet size policy. Simulation results demonstrate that higher goodput, lower packet loss rate (PLR), lower frame transmission delay and jitter with good fairness can be achieved by our proposed mechanism.展开更多
For video streaming over lossy channels, intra refresh can mitigate the error-propagation effect caused by packet losses Besides some intra-mode macroblocks (MBs) generated by the "Lagrangian rate-distortion" or ...For video streaming over lossy channels, intra refresh can mitigate the error-propagation effect caused by packet losses Besides some intra-mode macroblocks (MBs) generated by the "Lagrangian rate-distortion" or "Sum of absolute difference" mode decision, the encoder or transcoder possibly needs to increase some "forced" intra-mode MBs for robust video streaming. Based on the error-propagation analysis in a group of pictures (GOP), we propose an unequal Forced-Intra-Refresh (FIR) scheme to improve packet loss resilience of video streaming. According to a GOP-level error-propagation model, the proposed unequal FIR scheme can optimally increase the unequal number of forced intra-mode MBs for different frames in a GOP. Simulation results showed that the proposed scheme can effectively enhance the robustness of video streaming under different channel conditions, and achieve about 0. 1-0.9 dB gains over the average FIR scheme in H.264/AVC tools.展开更多
A new rate allocation method for fine-granular scalability (FGS) coded bitstreams is presented in order to achieve smooth quality reconstruction of frames under channel conditions with a wide range of bandwidth variat...A new rate allocation method for fine-granular scalability (FGS) coded bitstreams is presented in order to achieve smooth quality reconstruction of frames under channel conditions with a wide range of bandwidth variation and improve the average PSNR of the whole sequence. Based on a quality weighted bit allocation method, a sliding window rate allocation method is proposed for the first time so that the window can slide along the video sequence with a certain sliding step. Experimental results show that, under dynamic bandwidth conditions, the proposed method can simultaneously satisfy the requirements for improving average PSNR of the whole video sequence greatly and reducing the fluctuations between adjacent frames greatly.展开更多
In the case of video streaming over wireless channels, burst errors may lead to serious video quality degradation. By jointly exploiting the scheduling mechanism on different communication layers, this paper proposes ...In the case of video streaming over wireless channels, burst errors may lead to serious video quality degradation. By jointly exploiting the scheduling mechanism on different communication layers, this paper proposes a quality-aware cross-layer scheduling scheme to achieve unequal error control for each Latency-constraint Frame Set (LFS) of a video stream. After a network-layer agent at base station firstly utilizes the network-layer packet scheduling to provide packet-granularity importance classifi-cation for the current LFS, a link-layer agent at base station further utilizes the Radio-Link-Unit (RLU) scheduling to implement finer selective retransmission of the current LFS. Under scheduling delay and bandwidth constraints, the proposed scheme can be aware of the application-layer quality and time-varying channel conditions, and hence burst errors can simply be shifted to lower-priority transmission units in the current LFS. Simulation results demonstrate that the proposed scheme has strong robustness against burst errors, and thus improves the overall received quality of the video stream over wireless channels.展开更多
Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers thelikelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions i...Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers thelikelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions in videostreams holds significant importance in computer vision research, as it aims to enhance exercise adherence, enableinstant recognition, advance fitness tracking technologies, and optimize fitness routines. However, existing actiondatasets often lack diversity and specificity for workout actions, hindering the development of accurate recognitionmodels. To address this gap, the Workout Action Video dataset (WAVd) has been introduced as a significantcontribution. WAVd comprises a diverse collection of labeled workout action videos, meticulously curated toencompass various exercises performed by numerous individuals in different settings. This research proposes aninnovative framework based on the Attention driven Residual Deep Convolutional-Gated Recurrent Unit (ResDCGRU)network for workout action recognition in video streams. Unlike image-based action recognition, videoscontain spatio-temporal information, making the task more complex and challenging. While substantial progresshas been made in this area, challenges persist in detecting subtle and complex actions, handling occlusions,and managing the computational demands of deep learning approaches. The proposed ResDC-GRU Attentionmodel demonstrated exceptional classification performance with 95.81% accuracy in classifying workout actionvideos and also outperformed various state-of-the-art models. The method also yielded 81.6%, 97.2%, 95.6%, and93.2% accuracy on established benchmark datasets, namely HMDB51, Youtube Actions, UCF50, and UCF101,respectively, showcasing its superiority and robustness in action recognition. The findings suggest practicalimplications in real-world scenarios where precise video action recognition is paramount, addressing the persistingchallenges in the field. TheWAVd dataset serves as a catalyst for the development ofmore robust and effective fitnesstracking systems and ultimately promotes healthier lifestyles through improved exercise monitoring and analysis.展开更多
Video streaming services are trending to be deployed on cloud. Cloud computing offers better stability and lower price than traditional IT facilities. Huge storage capacity is essential for video streaming service. Mo...Video streaming services are trending to be deployed on cloud. Cloud computing offers better stability and lower price than traditional IT facilities. Huge storage capacity is essential for video streaming service. More and more cloud providers appear so there are increasing cloud platforms to choose. A better choice is to use more than one data center, which is called multi-cloud. In this paper a closed-loop approach is proposed for optimizing Quality of Service (QoS) and cost. Modules of monitoring and controlling data centers are required as well as the application feedback such as video streaming services. An algorithm is proposed to help choose cloud providers and data centers in a multi-cloud environment as a video service manager. Performance with different video service workloads are evaluated. Compared with using only one cloud provider, dynamically deploying services in multi-cloud is better in aspects of both cost and QoS. If cloud service costs are different among data centers, the algorithm will help make choices to lower the cost and keep a high QoS.展开更多
A new wave of networks labeled Peer-to-Peer(P2P) networks attracts more researchers and rapidly becomes one of the most popular applications.In order to matching P2 P logical overlay network with physical topology,the...A new wave of networks labeled Peer-to-Peer(P2P) networks attracts more researchers and rapidly becomes one of the most popular applications.In order to matching P2 P logical overlay network with physical topology,the position-based topology has been proposed.The proposed topology not only focuses on non-functional characteristics such as scalability,reliability,fault-tolerance,selforganization,decentralization and fairness,but also functional characteristics are addressed as well.The experimental results show that the hybrid complex topology achieves better characteristics than other complex networks' models like small-world and scale-free models;since most of the real-life networks are both scale-free and small-world networks,it may perform well in mimicking the reality.Meanwhile,it reveals that the authors improve average distance,diameter and clustering coefficient versus Chord and CAN topologies.Finally,the authors show that the proposed topology is the most robust model,against failures and attacks for nodes and edges,versus small-world and scale-free networks.展开更多
In this work, we present an evaluation of the performance and error robustness of RTP-based broadcast streaming of high-quality high-definition (HD) H.264/AVC video. Using a fully controlled IP test bed (Hillestad et ...In this work, we present an evaluation of the performance and error robustness of RTP-based broadcast streaming of high-quality high-definition (HD) H.264/AVC video. Using a fully controlled IP test bed (Hillestad et al., 2005), we broadcast high-definition video over RTP/UDP, and use an IP network emulator to introduce a varying amount of randomly distributed packet loss. A high-performance network interface monitoring card is used to capture the video packets into a trace file. Purpose-built software parses the trace file, analyzes the RTP stream and assembles the correctly received NAL units into an H.264/AVC Annex B byte stream file, which is subsequently decoded by JVT JM 10.1 reference software. The proposed measurement setup is a novel, practical and intuitive approach to perform error resilience testing of real-world H.264/AVC broadcast applications. Through a series of experiments, we evaluate some of the error resilience features of the H.264/AVC standard, and see how they perform at packet loss rates from 0.01% to 5%. The results confirmed that an appropriate slice partitioning scheme is essential to have a graceful degradation in received quality in the case of packet loss. While flexible macroblock ordering reduces the compression efficiency about 1 dB for our test material, reconstructed video quality is improved for loss rates above 0.25%.展开更多
A joint channel selection and power control scheme is developed for video streaming in device-to-device (D2D) communications based cognitive radio networks. In particular, physical queue and virtual queue models by ...A joint channel selection and power control scheme is developed for video streaming in device-to-device (D2D) communications based cognitive radio networks. In particular, physical queue and virtual queue models by applying 'M/G/1 queue' and 'M/G/1 queue with vacations' theories are built up, respectively, to evaluate the delays experienced by various video traffics. Such delays play a vital role in calculating the packet loss rate for video streaming, which reflects the video distortion. Based on the distortion model, a video distortion minimization problem is formulated, subject to the rate constraint, maximum power constraint, primary users' tolerant interference constraint, and secondary users' minimum data rate requirement constraint. The optimization problem turns out to be a mixed integer nonlinear programming (MINLP) , which is generally nondeterministic in polynomial time. A Lagrangian dual method is thus employed to reformulate the video distortion minimization problem, based on which the sub-gradient algorithm is used to determine a relaxed solution. Thereafter, applying the iterative user removal yields the optimal joint channel selection and power control solution to the original MINLP problem. Extensive simulations validate our proposed scheme and demonstrate that it significantly increases the peak signal- to-noise ratio (PSNR) compared with the existing schemes.展开更多
Real-time video streaming using ultra-wideband(UWB) technology is experimentally demonstrated along long-reach passive optical networks(LR-PONs) with different wired and wireless reaches. Experimental tests using exte...Real-time video streaming using ultra-wideband(UWB) technology is experimentally demonstrated along long-reach passive optical networks(LR-PONs) with different wired and wireless reaches. Experimental tests using external and direct modulation with UWB wireless radiation in the 10- and 60-GHz bands are performed. An ultra-bendable fiber is also considered for a last-mile distribution. The video quality at the output of the optical fiber infrastructure of the LR-PON is assessed using the error vector magnitude(EVM), and the link quality indicator(LQI) is used as a figure of merit after wireless radiation. An EVM below –17 dB is achieved for both externally and directly modulated LR-PONs comprising up to 125 km of optical fiber. EVM improvement is observed for longer LR-PONs when directly modulated lasers(DMLs) are used because of the amplitude gain provided by the combined effect of dispersion and DML's chirp. Compared with optical back-to-back operation, the LQI level degrades to the maximum around 20% for LR-PONs ranging between 75 and 125 km of fiber reach and with a wireless coverage of 2 m in the 10-GHz UWB band. The same level of LQI degradation is observed using the 60-GHz UWB band with a LR-PON integrating 101 km of access network, a last-mile distribution using ultra-bendable fiber, and a 5.2-m wireless link.展开更多
User interactive behaviors play a dual role during the hypertext transfer protocol (HTTP) video service: reflection and influence. However, they are seldom taken into account in practices. To this end, this paper p...User interactive behaviors play a dual role during the hypertext transfer protocol (HTTP) video service: reflection and influence. However, they are seldom taken into account in practices. To this end, this paper puts forward the user interactive behaviors, as subjective factors of quality of experience (QoE) from viewer level, to structure a comprehensive multilayer evaluation model based on classic network quality of service (QoS) and application QoS. First, dual roles of user behaviors are studied and the characteristics are extracted where the user experience is correlated with user interactive behaviors. Furthermore, we categorize QoE factors into three dimensions and build the metric system. Then we perform the subjective tests and investigate the relationships among network path quality, user behaviors, and QoE. Ultimately, we employ the back propagation neural network (BPNN) to validate our analysis and model. Through the simulation experiment of mathematical and BPNN, the dual effects of user interaction behaviors on the reflection and influence of QoE in the video stream are analyzed, and the QoE metric system and evaluation model are established.展开更多
Video streaming is one of the most important applications used in the best-effort Internet. This paper presents a new scheme for multiple source video streaming in which the traditional fine granular scal-able coding ...Video streaming is one of the most important applications used in the best-effort Internet. This paper presents a new scheme for multiple source video streaming in which the traditional fine granular scal-able coding was rebuilt into a multiple sub-streams based transmission model. A peak signal to noise ratio based stream rate allocation algorithm was then developed based on the transmission model. In tests, the algorithm performance is about 1 dB higher than that of a uniform rate allocation algorithm. Therefore, this scheme can overcome bottlenecks along a single link and smooth jitter to achieve high quality and stable video.展开更多
Reduced-reference (RR) video-quality estimators send a small signature to the receiver. This signature comprises the original video content as well as the video stream. RR quality estimation provides reliability and...Reduced-reference (RR) video-quality estimators send a small signature to the receiver. This signature comprises the original video content as well as the video stream. RR quality estimation provides reliability and involves a small data payload. While significant in theory, RR estimators have only recently been used in practice for quality monitoring and adaptive system con- trol in streaming-video frameworks. In this paper, we classify RR algorithms according to whether they are based on a) model- ing the signal distortion, b) modeling the human visual system, or c) analyzing the video signal source. We review proposed RR techniques for monitoring and controlling quality in streaming video systems.展开更多
To achieve an optimal trade-off between video quality and energy efficiency in the uplink streaming of multi-user Scalable Video Coding (SVC) videos in relay-based Orthogonal Frequency Division Multiple Access (OFDMA)...To achieve an optimal trade-off between video quality and energy efficiency in the uplink streaming of multi-user Scalable Video Coding (SVC) videos in relay-based Orthogonal Frequency Division Multiple Access (OFDMA) cellular networks, a cross-layer design framework that jointly selects the Transmission Policy (TP) for SVC video frames, assigns OFDMA subcarriers, and allocates power for each subcarrier is proposed. We apply the dual decomposition method to the problem, and obtain a TP selection subproblem for each SVC video adaptation and a resource allocation subproblem of Joint Subcarrier, Relay and Power Allocation (JSRPA). A second level of dual decomposition is used to divide the JSRPA problem into independent subcarrier subproblems. The proposed Crosslayer Trade-off Optimization (CTO) algorithm is sub-distributed with significantly low complexity. A performance evaluation with typical SVC video traces demonstrates that the proposed algorithm is able to converge and efficiently achieve the optimal trade-off between the video quality and energy consumption at the MSs for uplink SVC streaming.展开更多
A novel bandwidth prediction and control scheme is proposed for video transmission over an ad boc network. The scheme is based on cross-layer, feedback, and Bayesian network techniques. The impacts of video quality ar...A novel bandwidth prediction and control scheme is proposed for video transmission over an ad boc network. The scheme is based on cross-layer, feedback, and Bayesian network techniques. The impacts of video quality are formulized and deduced. The relevant factors are obtained by a cross-layer mechanism or Feedback method. According to these relevant factors, the variable set and the Bayesian network topology are determined. Then a Bayesian network prediction model is constructed. The results of the prediction can be used as the bandwidth of the mobile ad hoc network (MANET). According to the bandwidth, the video encoder is controlled to dynamically adjust and encode the right bit rates of a real-time video stream. Integrated simulation of a video streaming communication system is implemented to validate the proposed solution. In contrast to the conventional transfer scheme, the results of the experiment indicate that the proposed scheme can make the best use of the network bandwidth; there are considerable improvements in the packet loss and the visual quality of real-time video.K展开更多
文摘Recently,the combination of video services and 5G networks have been gaining attention in the wireless communication realm.With the brisk advancement in 5G network usage and the massive popularity of threedimensional video streaming,the quality of experience(QoE)of video in 5G systems has been receiving overwhelming significance from both customers and service provider ends.Therefore,effectively categorizing QoE-aware video streaming is imperative for achieving greater client satisfaction.This work makes the following contribution:First,a simulation platform based on NS-3 is introduced to analyze and improve the performance of video services.The simulation is formulated to offer real-time measurements,saving the expensive expenses associated with real-world equipment.Second,A valuable framework for QoE-aware video streaming categorization is introduced in 5G networks based on machine learning(ML)by incorporating the hyperparameter tuning(HPT)principle.It implements an enhanced hyperparameter tuning(EHPT)ensemble and decision tree(DT)classifier for video streaming categorization.The performance of the ML approach is assessed by considering precision,accuracy,recall,and computation time metrics for manifesting the superiority of these classifiers regarding video streaming categorization.This paper demonstrates that our ML classifiers achieve QoE prediction accuracy of 92.59%for(EHPT)ensemble and 87.037%for decision tree(DT)classifiers.
文摘In recent years,real-time video streaming has grown in popularity.The growing popularity of the Internet of Things(IoT)and other wireless heterogeneous networks mandates that network resources be carefully apportioned among versatile users in order to achieve the best Quality of Experience(QoE)and performance objectives.Most researchers focused on Forward Error Correction(FEC)techniques when attempting to strike a balance between QoE and performance.However,as network capacity increases,the performance degrades,impacting the live visual experience.Recently,Deep Learning(DL)algorithms have been successfully integrated with FEC to stream videos across multiple heterogeneous networks.But these algorithms need to be changed to make the experience better without sacrificing packet loss and delay time.To address the previous challenge,this paper proposes a novel intelligent algorithm that streams video in multi-home heterogeneous networks based on network-centric characteristics.The proposed framework contains modules such as Intelligent Content Extraction Module(ICEM),Channel Status Monitor(CSM),and Adaptive FEC(AFEC).This framework adopts the Cognitive Learning-based Scheduling(CLS)Module,which works on the deep Reinforced Gated Recurrent Networks(RGRN)principle and embeds them along with the FEC to achieve better performances.The complete framework was developed using the Objective Modular Network Testbed in C++(OMNET++),Internet networking(INET),and Python 3.10,with Keras as the front end and Tensorflow 2.10 as the back end.With extensive experimentation,the proposed model outperforms the other existing intelligentmodels in terms of improving the QoE,minimizing the End-to-End Delay(EED),and maintaining the highest accuracy(98%)and a lower Root Mean Square Error(RMSE)value of 0.001.
基金This work was supported by National Natural Science Foundation of China(No.61771070)National Natural Science Foundation of China(No.61671088).
文摘Adaptive bitrate video streaming(ABR)has become a critical technique for mobile video streaming to cope with time-varying network conditions and different user preferences.However,there are still many problems in achieving high-quality ABR video streaming over cellular networks.Mobile Edge Computing(MEC)is a promising paradigm to overcome the above problems by providing video transcoding capability and caching the ABR video streaming within the radio access network(RAN).In this paper,we propose a flexible transcoding strategy to provide viewers with low-latency video streaming services in the MEC networks under the limited storage,computing,and spectrum resources.According to the information collected from users,the MEC server acts as a controlling component to adjust the transcoding strategy flexibly based on optimizing the video caching placement strategy.Specifically,we cache the proper bitrate version of the video segments at the edge servers and select the appropriate bitrate version of the video segments to perform transcoding under jointly considering access control,resource allocation,and user preferences.We formulate this problem as a nonconvex optimization and mixed combinatorial problem.Moreover,the simulation results indicate that our proposed algorithm can ensure a low-latency viewing experience for users.
文摘With correlating with human perception, quality of experience(Qo E) is also an important measurement in evaluation of video quality in addition to quality of service(Qo S). A cross-layer scheme based on Lyapunov optimization framework for H.264/AVC video streaming over wireless Ad hoc networks is proposed, with increasing both Qo E and Qo S performances. Different from existing works, this scheme routes and schedules video packets according to the statuses of the frame buffers at the destination nodes to reduce buffer underflows and to increase video playout continuity. The waiting time of head-ofline packets of data queues are considered in routing and scheduling to reduce the average end-to-end delay of video sessions. Different types of packets are allocated with different priorities according to their generated rates under H.264/AVC. To reduce the computational complexity, a distributed media access control policy and a power control algorithm cooperating with the media access policy are proposed. Simulation results show that, compared with existing schemes, this scheme can improve both the Qo S and Qo E performances. The average peak signal-to-noise ratio(PSNR) of the received video streams is also increased.
文摘In this paper,we investigate video quality enhancement using computation offloading to the mobile cloud computing(MCC)environment.Our objective is to reduce the computational complexity required to covert a low-resolution video to high-resolution video while minimizing computation at the mobile client and additional communication costs.To do so,we propose an energy-efficient computation offloading framework for video streaming services in a MCC over the fifth generation(5G)cellular networks.In the proposed framework,the mobile client offloads the computational burden for the video enhancement to the cloud,which renders the side information needed to enhance video without requiring much computation by the client.The cloud detects edges from the upsampled ultra-high-resolution video(UHD)and then compresses and transmits them as side information with the original low-resolution video(e.g.,full HD).Finally,the mobile client decodes the received content and integrates the SI and original content,which produces a high-quality video.In our extensive simulation experiments,we observed that the amount of computation needed to construct a UHD video in the client is 50%-60% lower than that required to decode UHD video compressed by legacy video encoding algorithms.Moreover,the bandwidth required to transmit a full HD video and its side information is around 70% lower than that required for a normal UHD video.The subjective quality of the enhanced UHD is similar to that of the original UHD video even though the client pays lower communication costs with reduced computing power.
基金Project supported by the National Natural Science Foundation ofChina (No. 60302004) and the Natural Science Foundation of HubeiProvince (No. 2005ABA264), China
文摘Equation based TCP-friendly rate control (TFRC) protocol has been proposed to support video streaming applications. In order to improve TFRC performance in wireless channels, the link level automatic repeat request (ARQ) scheme is usually deployed. However, ARQ cannot ensure strict delay guarantees, especially over multihop links. This paper introduces a theoretical model to deduce an equation for packet size adjustment in transport layer to minimize retransmission delay by taking into con- sideration the causative reasons inducing retransmission in link layer. An enhanced TFRC (ETFRC) scheme is proposed inte- grating TFRC with variable packet size policy. Simulation results demonstrate that higher goodput, lower packet loss rate (PLR), lower frame transmission delay and jitter with good fairness can be achieved by our proposed mechanism.
基金Project (No. 60332030) supported by the National Natural ScienceFoundation of China
文摘For video streaming over lossy channels, intra refresh can mitigate the error-propagation effect caused by packet losses Besides some intra-mode macroblocks (MBs) generated by the "Lagrangian rate-distortion" or "Sum of absolute difference" mode decision, the encoder or transcoder possibly needs to increase some "forced" intra-mode MBs for robust video streaming. Based on the error-propagation analysis in a group of pictures (GOP), we propose an unequal Forced-Intra-Refresh (FIR) scheme to improve packet loss resilience of video streaming. According to a GOP-level error-propagation model, the proposed unequal FIR scheme can optimally increase the unequal number of forced intra-mode MBs for different frames in a GOP. Simulation results showed that the proposed scheme can effectively enhance the robustness of video streaming under different channel conditions, and achieve about 0. 1-0.9 dB gains over the average FIR scheme in H.264/AVC tools.
文摘A new rate allocation method for fine-granular scalability (FGS) coded bitstreams is presented in order to achieve smooth quality reconstruction of frames under channel conditions with a wide range of bandwidth variation and improve the average PSNR of the whole sequence. Based on a quality weighted bit allocation method, a sliding window rate allocation method is proposed for the first time so that the window can slide along the video sequence with a certain sliding step. Experimental results show that, under dynamic bandwidth conditions, the proposed method can simultaneously satisfy the requirements for improving average PSNR of the whole video sequence greatly and reducing the fluctuations between adjacent frames greatly.
文摘In the case of video streaming over wireless channels, burst errors may lead to serious video quality degradation. By jointly exploiting the scheduling mechanism on different communication layers, this paper proposes a quality-aware cross-layer scheduling scheme to achieve unequal error control for each Latency-constraint Frame Set (LFS) of a video stream. After a network-layer agent at base station firstly utilizes the network-layer packet scheduling to provide packet-granularity importance classifi-cation for the current LFS, a link-layer agent at base station further utilizes the Radio-Link-Unit (RLU) scheduling to implement finer selective retransmission of the current LFS. Under scheduling delay and bandwidth constraints, the proposed scheme can be aware of the application-layer quality and time-varying channel conditions, and hence burst errors can simply be shifted to lower-priority transmission units in the current LFS. Simulation results demonstrate that the proposed scheme has strong robustness against burst errors, and thus improves the overall received quality of the video stream over wireless channels.
文摘Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers thelikelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions in videostreams holds significant importance in computer vision research, as it aims to enhance exercise adherence, enableinstant recognition, advance fitness tracking technologies, and optimize fitness routines. However, existing actiondatasets often lack diversity and specificity for workout actions, hindering the development of accurate recognitionmodels. To address this gap, the Workout Action Video dataset (WAVd) has been introduced as a significantcontribution. WAVd comprises a diverse collection of labeled workout action videos, meticulously curated toencompass various exercises performed by numerous individuals in different settings. This research proposes aninnovative framework based on the Attention driven Residual Deep Convolutional-Gated Recurrent Unit (ResDCGRU)network for workout action recognition in video streams. Unlike image-based action recognition, videoscontain spatio-temporal information, making the task more complex and challenging. While substantial progresshas been made in this area, challenges persist in detecting subtle and complex actions, handling occlusions,and managing the computational demands of deep learning approaches. The proposed ResDC-GRU Attentionmodel demonstrated exceptional classification performance with 95.81% accuracy in classifying workout actionvideos and also outperformed various state-of-the-art models. The method also yielded 81.6%, 97.2%, 95.6%, and93.2% accuracy on established benchmark datasets, namely HMDB51, Youtube Actions, UCF50, and UCF101,respectively, showcasing its superiority and robustness in action recognition. The findings suggest practicalimplications in real-world scenarios where precise video action recognition is paramount, addressing the persistingchallenges in the field. TheWAVd dataset serves as a catalyst for the development ofmore robust and effective fitnesstracking systems and ultimately promotes healthier lifestyles through improved exercise monitoring and analysis.
基金supported in part by National Key Basic Research and Development (973) Program of China(Nos. 2011CB302805 and 2013CB228206)the National High-Tech Research and Development (863) Program of China (No. 2013BAH19F01)the National Natural Science Foundation of China (No. 61233016)
文摘Video streaming services are trending to be deployed on cloud. Cloud computing offers better stability and lower price than traditional IT facilities. Huge storage capacity is essential for video streaming service. More and more cloud providers appear so there are increasing cloud platforms to choose. A better choice is to use more than one data center, which is called multi-cloud. In this paper a closed-loop approach is proposed for optimizing Quality of Service (QoS) and cost. Modules of monitoring and controlling data centers are required as well as the application feedback such as video streaming services. An algorithm is proposed to help choose cloud providers and data centers in a multi-cloud environment as a video service manager. Performance with different video service workloads are evaluated. Compared with using only one cloud provider, dynamically deploying services in multi-cloud is better in aspects of both cost and QoS. If cloud service costs are different among data centers, the algorithm will help make choices to lower the cost and keep a high QoS.
文摘A new wave of networks labeled Peer-to-Peer(P2P) networks attracts more researchers and rapidly becomes one of the most popular applications.In order to matching P2 P logical overlay network with physical topology,the position-based topology has been proposed.The proposed topology not only focuses on non-functional characteristics such as scalability,reliability,fault-tolerance,selforganization,decentralization and fairness,but also functional characteristics are addressed as well.The experimental results show that the hybrid complex topology achieves better characteristics than other complex networks' models like small-world and scale-free models;since most of the real-life networks are both scale-free and small-world networks,it may perform well in mimicking the reality.Meanwhile,it reveals that the authors improve average distance,diameter and clustering coefficient versus Chord and CAN topologies.Finally,the authors show that the proposed topology is the most robust model,against failures and attacks for nodes and edges,versus small-world and scale-free networks.
基金Project supported by the Research Council of Norway, Norwegian University of Science and Technology (NTNU), and the Norwegian Resarch Network (UNINETT)
文摘In this work, we present an evaluation of the performance and error robustness of RTP-based broadcast streaming of high-quality high-definition (HD) H.264/AVC video. Using a fully controlled IP test bed (Hillestad et al., 2005), we broadcast high-definition video over RTP/UDP, and use an IP network emulator to introduce a varying amount of randomly distributed packet loss. A high-performance network interface monitoring card is used to capture the video packets into a trace file. Purpose-built software parses the trace file, analyzes the RTP stream and assembles the correctly received NAL units into an H.264/AVC Annex B byte stream file, which is subsequently decoded by JVT JM 10.1 reference software. The proposed measurement setup is a novel, practical and intuitive approach to perform error resilience testing of real-world H.264/AVC broadcast applications. Through a series of experiments, we evaluate some of the error resilience features of the H.264/AVC standard, and see how they perform at packet loss rates from 0.01% to 5%. The results confirmed that an appropriate slice partitioning scheme is essential to have a graceful degradation in received quality in the case of packet loss. While flexible macroblock ordering reduces the compression efficiency about 1 dB for our test material, reconstructed video quality is improved for loss rates above 0.25%.
基金supported by the National Natural Science Foundation of China ( 61371127,61671347)the 111 Project of China ( B08038 )+1 种基金the Fundamental Research Funds for the Central Universities ( 7214603701 )the Key Technology R&D Program of Henan Province ( 142102210572)
文摘A joint channel selection and power control scheme is developed for video streaming in device-to-device (D2D) communications based cognitive radio networks. In particular, physical queue and virtual queue models by applying 'M/G/1 queue' and 'M/G/1 queue with vacations' theories are built up, respectively, to evaluate the delays experienced by various video traffics. Such delays play a vital role in calculating the packet loss rate for video streaming, which reflects the video distortion. Based on the distortion model, a video distortion minimization problem is formulated, subject to the rate constraint, maximum power constraint, primary users' tolerant interference constraint, and secondary users' minimum data rate requirement constraint. The optimization problem turns out to be a mixed integer nonlinear programming (MINLP) , which is generally nondeterministic in polynomial time. A Lagrangian dual method is thus employed to reformulate the video distortion minimization problem, based on which the sub-gradient algorithm is used to determine a relaxed solution. Thereafter, applying the iterative user removal yields the optimal joint channel selection and power control solution to the original MINLP problem. Extensive simulations validate our proposed scheme and demonstrate that it significantly increases the peak signal- to-noise ratio (PSNR) compared with the existing schemes.
基金supported by the Fundao para a Ciência e a Tecnologia from Portugal under projects PEst-OE/EEI/LA0008/2013 and TURBO-PTDC/EEATEL/104358/2008by the European FIVER-FP7-ICT-2009-4-249142 project
文摘Real-time video streaming using ultra-wideband(UWB) technology is experimentally demonstrated along long-reach passive optical networks(LR-PONs) with different wired and wireless reaches. Experimental tests using external and direct modulation with UWB wireless radiation in the 10- and 60-GHz bands are performed. An ultra-bendable fiber is also considered for a last-mile distribution. The video quality at the output of the optical fiber infrastructure of the LR-PON is assessed using the error vector magnitude(EVM), and the link quality indicator(LQI) is used as a figure of merit after wireless radiation. An EVM below –17 dB is achieved for both externally and directly modulated LR-PONs comprising up to 125 km of optical fiber. EVM improvement is observed for longer LR-PONs when directly modulated lasers(DMLs) are used because of the amplitude gain provided by the combined effect of dispersion and DML's chirp. Compared with optical back-to-back operation, the LQI level degrades to the maximum around 20% for LR-PONs ranging between 75 and 125 km of fiber reach and with a wireless coverage of 2 m in the 10-GHz UWB band. The same level of LQI degradation is observed using the 60-GHz UWB band with a LR-PON integrating 101 km of access network, a last-mile distribution using ultra-bendable fiber, and a 5.2-m wireless link.
基金supported by the Postdoctoral Science Foundation of China (2017M610827)
文摘User interactive behaviors play a dual role during the hypertext transfer protocol (HTTP) video service: reflection and influence. However, they are seldom taken into account in practices. To this end, this paper puts forward the user interactive behaviors, as subjective factors of quality of experience (QoE) from viewer level, to structure a comprehensive multilayer evaluation model based on classic network quality of service (QoS) and application QoS. First, dual roles of user behaviors are studied and the characteristics are extracted where the user experience is correlated with user interactive behaviors. Furthermore, we categorize QoE factors into three dimensions and build the metric system. Then we perform the subjective tests and investigate the relationships among network path quality, user behaviors, and QoE. Ultimately, we employ the back propagation neural network (BPNN) to validate our analysis and model. Through the simulation experiment of mathematical and BPNN, the dual effects of user interaction behaviors on the reflection and influence of QoE in the video stream are analyzed, and the QoE metric system and evaluation model are established.
基金the National Natural Science Foundation of China (No. 60273008)
文摘Video streaming is one of the most important applications used in the best-effort Internet. This paper presents a new scheme for multiple source video streaming in which the traditional fine granular scal-able coding was rebuilt into a multiple sub-streams based transmission model. A peak signal to noise ratio based stream rate allocation algorithm was then developed based on the transmission model. In tests, the algorithm performance is about 1 dB higher than that of a uniform rate allocation algorithm. Therefore, this scheme can overcome bottlenecks along a single link and smooth jitter to achieve high quality and stable video.
文摘Reduced-reference (RR) video-quality estimators send a small signature to the receiver. This signature comprises the original video content as well as the video stream. RR quality estimation provides reliability and involves a small data payload. While significant in theory, RR estimators have only recently been used in practice for quality monitoring and adaptive system con- trol in streaming-video frameworks. In this paper, we classify RR algorithms according to whether they are based on a) model- ing the signal distortion, b) modeling the human visual system, or c) analyzing the video signal source. We review proposed RR techniques for monitoring and controlling quality in streaming video systems.
基金partially supported by the National Natural Science Foundation of China under Grants No. 610202380, No. 60932007Major Program of National Natural Science Foundation of China under Grant No. 60932007+2 种基金Tianjin Research Program of Application Foundation and Advanced Technology under Grant No. 12JCQNJC00300Research Fund for the Doctoral Program of Higher Education of China under Grant No. 20110032120029the Innovation Foundation of Tianjin University
文摘To achieve an optimal trade-off between video quality and energy efficiency in the uplink streaming of multi-user Scalable Video Coding (SVC) videos in relay-based Orthogonal Frequency Division Multiple Access (OFDMA) cellular networks, a cross-layer design framework that jointly selects the Transmission Policy (TP) for SVC video frames, assigns OFDMA subcarriers, and allocates power for each subcarrier is proposed. We apply the dual decomposition method to the problem, and obtain a TP selection subproblem for each SVC video adaptation and a resource allocation subproblem of Joint Subcarrier, Relay and Power Allocation (JSRPA). A second level of dual decomposition is used to divide the JSRPA problem into independent subcarrier subproblems. The proposed Crosslayer Trade-off Optimization (CTO) algorithm is sub-distributed with significantly low complexity. A performance evaluation with typical SVC video traces demonstrates that the proposed algorithm is able to converge and efficiently achieve the optimal trade-off between the video quality and energy consumption at the MSs for uplink SVC streaming.
基金The National High Technology Research and Development Program of China (863Program) (No.2003AA1Z2130)the Scienceand Technology Project of Zhejiang Province(No.2005C11001-02)
文摘A novel bandwidth prediction and control scheme is proposed for video transmission over an ad boc network. The scheme is based on cross-layer, feedback, and Bayesian network techniques. The impacts of video quality are formulized and deduced. The relevant factors are obtained by a cross-layer mechanism or Feedback method. According to these relevant factors, the variable set and the Bayesian network topology are determined. Then a Bayesian network prediction model is constructed. The results of the prediction can be used as the bandwidth of the mobile ad hoc network (MANET). According to the bandwidth, the video encoder is controlled to dynamically adjust and encode the right bit rates of a real-time video stream. Integrated simulation of a video streaming communication system is implemented to validate the proposed solution. In contrast to the conventional transfer scheme, the results of the experiment indicate that the proposed scheme can make the best use of the network bandwidth; there are considerable improvements in the packet loss and the visual quality of real-time video.K