One way to reduce energy consumption in wireless sensor networks is to reduce the number of active nodes in the network. When sensors are redundantly deployed, a subset of sensors should be selected to actively monito...One way to reduce energy consumption in wireless sensor networks is to reduce the number of active nodes in the network. When sensors are redundantly deployed, a subset of sensors should be selected to actively monitor the field (referred to as a "cover"), whereas the rest of the sensors should be put to sleep to conserve their batteries. In this paper, a learning automata based algorithm for energy-efficient monitoring in wireless sensor networks (EEMLA) is proposed. Each node in EEMLA algorithm is equipped with a learning automaton which decides for the node to be active or not at any time during the operation of the network. Using feedback received from neighboring nodes, each node gradually learns its proper state during the operation of the network. Experimental results have shown that the proposed monitoring algorithm in comparison to other existing methods such as Tian and LUC can better prolong the network lifetime.展开更多
The mobile Ad Hoc network(MANET)is a self-organizing and self-configuring wireless network,consisting of a set of mobile nodes.The design of efficient routing protocols for MANET has always been an active area of rese...The mobile Ad Hoc network(MANET)is a self-organizing and self-configuring wireless network,consisting of a set of mobile nodes.The design of efficient routing protocols for MANET has always been an active area of research.In existing routing algorithms,however,the current work does not scale well enough to ensure route stability when the mobility and distribution of nodes vary with time.In addition,each node in MANET has only limited initial energy,so energy conservation and balance must be taken into account.An efficient routing algorithm should not only be stable but also energy saving and balanced,within the dynamic network environment.To address the above problems,we propose a stable and energy-efficient routing algorithm,based on learning automata(LA)theory for MANET.First,we construct a new node stability measurement model and define an effective energy ratio function.On that basis,we give the node a weighted value,which is used as the iteration parameter for LA.Next,we construct an LA theory-based feedback mechanism for the MANET environment to optimize the selection of available routes and to prove the convergence of our algorithm.The experiments show that our proposed LA-based routing algorithm for MANET achieved the best performance in route survival time,energy consumption,energy balance,and acceptable per-formance in end-to-end delay and packet delivery ratio.展开更多
It has been shown in recent economic and statistical studies that combining forecasts may produce more accurate forecasts than individual ones. However, the literature on combining forecasts has almost exclusively foc...It has been shown in recent economic and statistical studies that combining forecasts may produce more accurate forecasts than individual ones. However, the literature on combining forecasts has almost exclusively focused on linear combining forecasts. In this paper, a new nonlinear combination forecasting method based on fuzzy inference system is present to overcome the difficulties and drawbacks in linear combination modeling of non-stationary time series. Furthermore, the optimization algorithm based on a hierarchical structure of learning automata is used to identify the parameters of the fuzzy system. Experiment results related to numerical examples demonstrate that the new technique has excellent identification performances and forecasting accuracy superior to other existing linear combining forecasts.展开更多
The power market is a typical imperfectly competitive market where power suppliers gain higher profits through strategic bidding behaviors.Most existing studies assume that a power supplier is accessible to the suffic...The power market is a typical imperfectly competitive market where power suppliers gain higher profits through strategic bidding behaviors.Most existing studies assume that a power supplier is accessible to the sufficient market information to derive an optimal bidding strategy.However,this assumption may not be true in reality,particularly when a power market is newly launched.To help power suppliers bid with the limited information,a modified continuous action reinforcement learning automata algorithm is proposed.This algorithm introduces the discretization and Dyna structure into continuous action reinforcement learning automata algorithm for easy implementation in a repeated game.Simulation results verify the effectiveness of the proposed learning algorithm.展开更多
Software systems are present all around us and playing their vital roles in our daily life.The correct functioning of these systems is of prime concern.In addition to classical testing techniques,formal techniques lik...Software systems are present all around us and playing their vital roles in our daily life.The correct functioning of these systems is of prime concern.In addition to classical testing techniques,formal techniques like model checking are used to reinforce the quality and reliability of software systems.However,obtaining of behavior model,which is essential for model-based techniques,of unknown software systems is a challenging task.To mitigate this problem,an emerging black-box analysis technique,called Model Learning,can be applied.It complements existing model-based testing and verification approaches by providing behavior models of blackbox systems fully automatically.This paper surveys the model learning technique,which recently has attracted much attention from researchers,especially from the domains of testing and verification.First,we review the background and foundations of model learning,which form the basis of subsequent sections.Second,we present some well-known model learning tools and provide their merits and shortcomings in the form of a comparison table.Third,we describe the successful applications of model learning in multidisciplinary fields,current challenges along with possible future works,and concluding remarks.展开更多
Operant conditioning is one of the fundamental mechanisms of animal learning, which suggests that the behavior of all animals, from protists to humans, is guided by its consequences. We present a new stochastic learni...Operant conditioning is one of the fundamental mechanisms of animal learning, which suggests that the behavior of all animals, from protists to humans, is guided by its consequences. We present a new stochastic learning automaton called a Skinner au- tomaton that is a psychological model for formalizing the theory of operant conditioning. We identify animal operant learning with a thermodynamic process, and derive a so-called Skinner algorithm from Monte Carlo method as well as Metropolis algo- rithm and simulated annealing. Under certain conditions, we prove that the Skinner automaton is expedient, 6-optimal, optimal, and that the operant probabilities converge to the set of stable roots with probability of 1. The Skinner automaton enables ma- chines to autonomously learn in an animal-like way.展开更多
Dynamic power management (DPM) in wireless sensor nodes is a well-known technique for reducing idle energy consumption. DPM controls a node's operating mode by dynamically toggling the on/off status of its units ba...Dynamic power management (DPM) in wireless sensor nodes is a well-known technique for reducing idle energy consumption. DPM controls a node's operating mode by dynamically toggling the on/off status of its units based on predictions of event occurrences. However, since each mode change induces some overhead in its own right, guaranteeing DPM's eificiency is no mean feat in environments exhibiting non-determinism and uncertainty with unknown statistics. Our solution suite in this paper, collectively referred to as cognitive power management (CPM), is a principled attempt toward enabling DPM in statistically unknown settings and gives two different analytical guarantees. Our first design is based on learning automata and guarantees better-than-pure-chance DPM in the face of non-stationary event processes. Our second solution caters tor an even more general setting in which event occurrences may take on an adversarial character. In this case, we formulate the interaction of an individual mote with its environment in terms of a repeated zero-sum game in which the node relies on a no-external-regret procedure to learn its mini-max strategies in an online fashion. We conduct numerical experiments to measure the performance of our schemes in terms of network lifetime and event loss percentage.展开更多
基金supported by the Islamic Azad University Urmia Brach,Iran
文摘One way to reduce energy consumption in wireless sensor networks is to reduce the number of active nodes in the network. When sensors are redundantly deployed, a subset of sensors should be selected to actively monitor the field (referred to as a "cover"), whereas the rest of the sensors should be put to sleep to conserve their batteries. In this paper, a learning automata based algorithm for energy-efficient monitoring in wireless sensor networks (EEMLA) is proposed. Each node in EEMLA algorithm is equipped with a learning automaton which decides for the node to be active or not at any time during the operation of the network. Using feedback received from neighboring nodes, each node gradually learns its proper state during the operation of the network. Experimental results have shown that the proposed monitoring algorithm in comparison to other existing methods such as Tian and LUC can better prolong the network lifetime.
基金The work was supported by the National Natural Science Foundation of China(No.61772386)Guangdong provincial science and technology project(No.2015B010131007)。
文摘The mobile Ad Hoc network(MANET)is a self-organizing and self-configuring wireless network,consisting of a set of mobile nodes.The design of efficient routing protocols for MANET has always been an active area of research.In existing routing algorithms,however,the current work does not scale well enough to ensure route stability when the mobility and distribution of nodes vary with time.In addition,each node in MANET has only limited initial energy,so energy conservation and balance must be taken into account.An efficient routing algorithm should not only be stable but also energy saving and balanced,within the dynamic network environment.To address the above problems,we propose a stable and energy-efficient routing algorithm,based on learning automata(LA)theory for MANET.First,we construct a new node stability measurement model and define an effective energy ratio function.On that basis,we give the node a weighted value,which is used as the iteration parameter for LA.Next,we construct an LA theory-based feedback mechanism for the MANET environment to optimize the selection of available routes and to prove the convergence of our algorithm.The experiments show that our proposed LA-based routing algorithm for MANET achieved the best performance in route survival time,energy consumption,energy balance,and acceptable per-formance in end-to-end delay and packet delivery ratio.
基金Funded by the Excellent Young Teachers of MOE (350) and Chongqing Education Committee Foundation
文摘It has been shown in recent economic and statistical studies that combining forecasts may produce more accurate forecasts than individual ones. However, the literature on combining forecasts has almost exclusively focused on linear combining forecasts. In this paper, a new nonlinear combination forecasting method based on fuzzy inference system is present to overcome the difficulties and drawbacks in linear combination modeling of non-stationary time series. Furthermore, the optimization algorithm based on a hierarchical structure of learning automata is used to identify the parameters of the fuzzy system. Experiment results related to numerical examples demonstrate that the new technique has excellent identification performances and forecasting accuracy superior to other existing linear combining forecasts.
基金This work was supported by the National Natural Science Foundation of China(No.U1866206).
文摘The power market is a typical imperfectly competitive market where power suppliers gain higher profits through strategic bidding behaviors.Most existing studies assume that a power supplier is accessible to the sufficient market information to derive an optimal bidding strategy.However,this assumption may not be true in reality,particularly when a power market is newly launched.To help power suppliers bid with the limited information,a modified continuous action reinforcement learning automata algorithm is proposed.This algorithm introduces the discretization and Dyna structure into continuous action reinforcement learning automata algorithm for easy implementation in a repeated game.Simulation results verify the effectiveness of the proposed learning algorithm.
基金the National Natural Science Foundation of China(NSFC)(Grant Nos.61872016,61932007 and 61972013).
文摘Software systems are present all around us and playing their vital roles in our daily life.The correct functioning of these systems is of prime concern.In addition to classical testing techniques,formal techniques like model checking are used to reinforce the quality and reliability of software systems.However,obtaining of behavior model,which is essential for model-based techniques,of unknown software systems is a challenging task.To mitigate this problem,an emerging black-box analysis technique,called Model Learning,can be applied.It complements existing model-based testing and verification approaches by providing behavior models of blackbox systems fully automatically.This paper surveys the model learning technique,which recently has attracted much attention from researchers,especially from the domains of testing and verification.First,we review the background and foundations of model learning,which form the basis of subsequent sections.Second,we present some well-known model learning tools and provide their merits and shortcomings in the form of a comparison table.Third,we describe the successful applications of model learning in multidisciplinary fields,current challenges along with possible future works,and concluding remarks.
基金supported by the National Natural Science Foundation of China(Grant Nos.61075110,60774077,61375086)the National Basic Research Program of China("973" Project)(Grant No.2012CB720000)+3 种基金the National High-Tech Research and Development Program of China("863" Project)(Grant No.2007AA04Z226)the Beijing Natural Science Foundation(Grant No.4102011)the Key Project of S&T Plan of Beijing Municipal Commission of Education(Grant Nos.KM2008-10005016,KZ201210005001)the Specialized Research Fund for the Doctoral Program of Higher Education(Grant No.20101103110007)
文摘Operant conditioning is one of the fundamental mechanisms of animal learning, which suggests that the behavior of all animals, from protists to humans, is guided by its consequences. We present a new stochastic learning automaton called a Skinner au- tomaton that is a psychological model for formalizing the theory of operant conditioning. We identify animal operant learning with a thermodynamic process, and derive a so-called Skinner algorithm from Monte Carlo method as well as Metropolis algo- rithm and simulated annealing. Under certain conditions, we prove that the Skinner automaton is expedient, 6-optimal, optimal, and that the operant probabilities converge to the set of stable roots with probability of 1. The Skinner automaton enables ma- chines to autonomously learn in an animal-like way.
文摘Dynamic power management (DPM) in wireless sensor nodes is a well-known technique for reducing idle energy consumption. DPM controls a node's operating mode by dynamically toggling the on/off status of its units based on predictions of event occurrences. However, since each mode change induces some overhead in its own right, guaranteeing DPM's eificiency is no mean feat in environments exhibiting non-determinism and uncertainty with unknown statistics. Our solution suite in this paper, collectively referred to as cognitive power management (CPM), is a principled attempt toward enabling DPM in statistically unknown settings and gives two different analytical guarantees. Our first design is based on learning automata and guarantees better-than-pure-chance DPM in the face of non-stationary event processes. Our second solution caters tor an even more general setting in which event occurrences may take on an adversarial character. In this case, we formulate the interaction of an individual mote with its environment in terms of a repeated zero-sum game in which the node relies on a no-external-regret procedure to learn its mini-max strategies in an online fashion. We conduct numerical experiments to measure the performance of our schemes in terms of network lifetime and event loss percentage.