期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
基于改进深度强化学习的倒立摆控制器设计 被引量:6
1
作者 王雨轩 陈思溢 黄辉先 《控制工程》 CSCD 北大核心 2022年第11期2018-2026,共9页
小车倒立摆系统是一种具有非线性、强耦合、多变量、欠驱动等特性的自然不稳定系统,倒立摆系统的稳定控制是控制理论中的典型问题。针对该种控制目标,提出了一种基于改进深度学习策略梯度算法的控制方法,控制机构采用强化学习算法作为... 小车倒立摆系统是一种具有非线性、强耦合、多变量、欠驱动等特性的自然不稳定系统,倒立摆系统的稳定控制是控制理论中的典型问题。针对该种控制目标,提出了一种基于改进深度学习策略梯度算法的控制方法,控制机构采用强化学习算法作为控制策略。其中,强化学习系统由策略神经网络和基线函数神经网络共同构成,同时神经网络激活函数采用了性能更优的Swish函数,并添加了基线函数以提高训练效率。将新的算法应用于小车倒立摆系统进行仿真实验,并与经典控制算法进行比较,试验结果证明了本文算法的有效性。 展开更多
关键词 强化学习 深度强化学习 策略梯度算法 激活函数 神经网络 基线函数
下载PDF
基于深度强化学习的码率自适应算法研究 被引量:3
2
作者 易令 李泽平 《电子学报》 EI CAS CSCD 北大核心 2022年第5期1192-1200,共9页
码率自适应(Adaptive BitRate,ABR)算法是视频客户端提高用户体验质量(Quality of Experience,QoE)的一种有效途径.针对现有ABR算法存在频繁缓冲、视频卡顿、画质较低和网络吞吐量预测不准确等问题,本文提出一种基于深度强化学习的码率... 码率自适应(Adaptive BitRate,ABR)算法是视频客户端提高用户体验质量(Quality of Experience,QoE)的一种有效途径.针对现有ABR算法存在频繁缓冲、视频卡顿、画质较低和网络吞吐量预测不准确等问题,本文提出一种基于深度强化学习的码率自适应(Deep Reinforcement Learning based ABR,DRLA)算法.DRLA用实际网络带宽数据训练神经网络,通过收集客户端缓冲区占用率和网络吞吐量向视频服务器请求最佳码率的视频.首先,DRLA用基线函数方法优化损失函数L,用熵随机探索方法防止损失函数局部收敛;其次利用约束条件限制新旧策略的散度更新幅度提高算法的鲁棒性;最后通过置信域(trust region)优化找到最优策略,使得QoE达到最优.与现有ABR算法对比的实验结果表明:DRLA减少了训练时间,能进一步提高算法的鲁棒性和用户的QoE,并在实际环境下验证了算法的有效性. 展开更多
关键词 码率自适应算法 体验质量 深度强化学习 基线函数 置信域
下载PDF
退出盯住汇率制度的动因:基于生存分析技术的实证研究
3
作者 何青 何治莉 杨晓光 《管理评论》 2006年第6期20-27,共8页
本文利用生存分析技术对盯住汇率制度的持续期进行了实证研究。本文收集了23个国家48个样本数据。考虑到不同退出模式下经济体的不同表现,将样本国家分成了两类:贬值性退出模式的国家和非贬值性退出模式的国家,然后使用非参数方法和半... 本文利用生存分析技术对盯住汇率制度的持续期进行了实证研究。本文收集了23个国家48个样本数据。考虑到不同退出模式下经济体的不同表现,将样本国家分成了两类:贬值性退出模式的国家和非贬值性退出模式的国家,然后使用非参数方法和半参数方法,对这两类样本群体分别估计了危险率、基线危险函数和影响因素的系数。实证结果表明,两种退出模式下的决定因素存在着明显的差异。盯住汇率的持续期是决定退出模式和退出可能性的重要决定因素:贬值性退出的可能性会随着时间的推移而不断的增加,而非贬值性退出更容易发生在盯住汇率制度的早期;此外,某些决定因素在两类退出中表现迥异,比如,金融深化程度的提高会增加贬值性退出的可能性,但却降低了非贬值性退出盯住汇率制度的可能性。 展开更多
关键词 盯住汇率制度 持续期 贬值性退出 非贬值性退出 危险率 基线危险函数
下载PDF
Application of Radial Basis Function Network in Sensor Failure Detection
4
作者 钮永胜 赵新民 《Journal of Beijing Institute of Technology》 EI CAS 1999年第2期70-76,共7页
Aim To detect sensor failure in control system using a single sensor signal. Methods A neural predictor was designed based on a radial basis function network(RBFN), and the neural predictor learned the sensor sig... Aim To detect sensor failure in control system using a single sensor signal. Methods A neural predictor was designed based on a radial basis function network(RBFN), and the neural predictor learned the sensor signal on line with a hybrid algorithm composed of n means clustering and Kalman filter and then gave the estimation of the sensor signal at the next step. If the difference between the estimation and the actural values of the sensor signal exceeded a threshold, the sensor could be declared to have a failure. The choice of the failure detection threshold depends on the noise variance and the possible prediction error of neural predictor. Results and Conclusion\ The computer simulation results show the proposed method can detect sensor failure correctly for a gyro in an automotive engine. 展开更多
关键词 sensor failure failure detection radial basis function network(BRFN) on line learning
下载PDF
A New Strategy of Integrated Control and On-line Optimization on High-purity Distillation Process 被引量:10
5
作者 吕文祥 朱鹰 +2 位作者 黄德先 江永亨 金以慧 《Chinese Journal of Chemical Engineering》 SCIE EI CAS CSCD 2010年第1期66-79,共14页
For high-purity distillation processes,it is difficult to achieve a good direct product quality control using traditional proportional-integral-differential(PID)control or multivariable predictive control technique du... For high-purity distillation processes,it is difficult to achieve a good direct product quality control using traditional proportional-integral-differential(PID)control or multivariable predictive control technique due to some difficulties,such as long response time,many un-measurable disturbances,and the reliability and precision issues of product quality soft-sensors.In this paper,based on the first principle analysis and dynamic simulation of a distillation process,a new predictive control scheme is proposed by using the split ratio of distillate flow rate to that of bottoms as an essential controlled variable.Correspondingly,a new strategy with integrated control and on-line optimization is developed,which consists of model predictive control of the split ratio,surrogate model based on radial basis function neural network for optimization,and modified differential evolution optimization algorithm. With the strategy,the process achieves its steady state quickly,so more profit can be obtained.The proposed strategy has been successfully applied to a gas separation plant for more than three years,which shows that the strategy is feasible and effective. 展开更多
关键词 distillation process control split ratio surrogate model optimization modified differential evolution
下载PDF
Application and comparison of RNN, RBFNN and MNLR approaches on prediction of flotation column performance 被引量:8
6
作者 Nakhaei Fardis Irannajad Mehdi 《International Journal of Mining Science and Technology》 SCIE EI CSCD 2015年第6期983-990,共8页
Evaluation of grade and recovery plays an important role in process control and plant profitability in mineral processing operations, especially flotation. The accurate measurement or estimation of these two parameter... Evaluation of grade and recovery plays an important role in process control and plant profitability in mineral processing operations, especially flotation. The accurate measurement or estimation of these two parameters, based on the secondary variables, is a critical issue. Data-driven modeling techniques, which entail comprehensive data analysis and implementation of machine learning methods for system forecast, provide an attractive alternative. In this paper, two types of artificial neural networks(ANNs),namely radial basis function neural network(RBFNN) and layer recurrent neural network(RNN), and also a multivariate nonlinear regression(MNLR) model were employed to predict metallurgical performance of the flotation column. The training capacity and the accuracy of these three above mentioned types of models were compared. In order to acquire data for the simulation, a case study was conducted at Sarcheshmeh copper complex pilot plant. Based on the root mean squared error and correlation coefficient values, at training and testing stages, the RNN forecasted the metallurgical performance of the flotation column better than RBF and MNLR models. The RNN could predict Cu grade and recovery with correlation coefficients of 0.92 and 0.9, respectively in testing process. 展开更多
关键词 Flotation columnRadial basis functionRecurrent neural networkMultivariate nonlinear regressionMetallurgical performance
下载PDF
抛物线类渠道断面收缩水深的计算通式 被引量:14
7
作者 赵延风 王正中 刘计良 《水力发电学报》 EI CSCD 北大核心 2013年第1期126-131,共6页
为了得到抛物线类渠道收缩水深的显函数计算通式,该文首次提出基线函数增量法的概念并采用该方法确定函数的参数。通过对抛物线型渠道断面收缩水深的基本方程进行恒等变换,取得简单快速收敛的迭代公式;根据无量纲收缩水深与已知量综合... 为了得到抛物线类渠道收缩水深的显函数计算通式,该文首次提出基线函数增量法的概念并采用该方法确定函数的参数。通过对抛物线型渠道断面收缩水深的基本方程进行恒等变换,取得简单快速收敛的迭代公式;根据无量纲收缩水深与已知量综合参数之间的数值分析,采用基线函数增量法确定迭代初值函数中的参数并应用迭代理论,得到了n次抛物线型渠道收缩水深的直接计算通式。误差分析及实例计算表明,在工程常用范围即无量纲收缩水深α∈(0,0.5]范围内,在工程常用抛物线指数n∈[1,5]范围,收缩水深最大相对误差小于0.52%,在抛物线指数n∈[1,100]范围内,收缩水深的最大相对误差小于0.85%,计算公式形式简单、精度高、通用性强。 展开更多
关键词 水力学 收缩水深 基线函数增量法 抛物线型断面
原文传递
High-Order Dispersion Coefficients for Alkali-metal Atoms
8
作者 KANG Shuai DING Chi-Kun +1 位作者 CHEN Chang-Yong WU Xue-Qing 《Communications in Theoretical Physics》 SCIE CAS CSCD 2013年第7期73-79,共7页
High-order dispersion coefficients C9, C11, C12, and C13 for the ground-state alkali-metals were calculated by combining the 1-dependent model potential of alkali-metal atoms and linear variation method based on B-spl... High-order dispersion coefficients C9, C11, C12, and C13 for the ground-state alkali-metals were calculated by combining the 1-dependent model potential of alkali-metal atoms and linear variation method based on B-spline basis functions. The results were compared. 展开更多
关键词 dispersion coefficient alkali-metal atom B-SPLINE POLARIZABILITY
原文传递
MINIMIZING A LINEAR FRACTIONAL FUNCTION SUBJECT TO A SYSTEM OF SUP-T EQUATIONS WITH A CONTINUOUS ARCHIMEDEAN TRIANGULAR NORM 被引量:1
9
作者 Pingke LI Edward P.Fitts Department of Industrial and Systems Engineering,North Carolina State University,Raleigh,NC 27695-7906,US Shu-Cherng FANG Edward P.Fitts Department of Industrial and Systems Engineering,North Carolina State University,Raleigh,NC 27695-7906,USA Department of Mathematical Sciences,Tsinghua University,Beijing 100084,China College of Management,Dalian University of Technology,Dalian 116024,China. 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2009年第1期49-62,共14页
This paper shows that the problem of minimizing a linear fractional function subject to asystem of sup-T equations with a continuous Archimedean triangular norm T can be reduced to a 0-1linear fractional optimization ... This paper shows that the problem of minimizing a linear fractional function subject to asystem of sup-T equations with a continuous Archimedean triangular norm T can be reduced to a 0-1linear fractional optimization problem in polynomial time.Consequently,parametrization techniques,e.g.,Dinkelbach's algorithm,can be applied by solving a classical set covering problem in each iteration.Similar reduction can also be performed on the sup-T equation constrained optimization problems withan objective function being monotone in each variable separately.This method could be extended aswell to the case in which the triangular norm is non-Archimedean. 展开更多
关键词 Fractional optimization fuzzy relational equations triangular norms.
原文传递
POINTED REPRESENTATIONS OFINFINITE DIMENSIONAL LIE ALGEBRAS
10
作者 XUXIANG 《Chinese Annals of Mathematics,Series B》 SCIE CSCD 1995年第2期255-260,共6页
A contravaried bilinear pairing X on every M(ρ) × M(ρθ) is determined and it is provedthat M(ρ)is irreducible if and only if K is left nondegellerate. It is also proved that every cyclicpointed module is a qu... A contravaried bilinear pairing X on every M(ρ) × M(ρθ) is determined and it is provedthat M(ρ)is irreducible if and only if K is left nondegellerate. It is also proved that every cyclicpointed module is a quotient of some Verma-like poillted module; moreover if it is irreduciblethen it is a quotieDt of the Vermarlike poiDted module by the left kernel of some bilinearpairing K. In case the mass fUnction is symmetric, there exists a bilinear form on M(ρ). It isproved that unitals pointed modules are integrable. In addition, a characterization of the massfunctions of Kac-Moody algebras is given, which is a generalization of the finite dimensionalLie algebras case. 展开更多
关键词 Pointed representation Primitive cycle Mass function Bilinear pairing.
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部