A Berry–Esseen bound is obtained for self-normalized martingales under the assumption of finite moments.The bound coincides with the classical Berry–Esseenboundforstandardizedmartingales.Anexampleisgiventoshowtheopt...A Berry–Esseen bound is obtained for self-normalized martingales under the assumption of finite moments.The bound coincides with the classical Berry–Esseenboundforstandardizedmartingales.Anexampleisgiventoshowtheoptimality of the bound.Applications to Student’s statistic and autoregressive process are also discussed.展开更多
Let 1≤q≤∞,b be a slowly varying function and letΦ:[0,∞)■[0,∞)be an increasing convex function withΦ(0)=0 and■Φ(r)=∞.In this paper,we present a new class of Doob’s maximal inequality on Orlicz-Lorentz-Karam...Let 1≤q≤∞,b be a slowly varying function and letΦ:[0,∞)■[0,∞)be an increasing convex function withΦ(0)=0 and■Φ(r)=∞.In this paper,we present a new class of Doob’s maximal inequality on Orlicz-Lorentz-Karamata spaces LΦ,q,b.The results are new,even for the Lorentz-Karamata spaces withΦ(t)=tp,the Orlicz-Lorentz spaces with b≡1,and weak Orlicz-Karamata spaces with q=∞in the framework of LΦ,q,b-Moreover,we obtain some even stronger qualitative results that can remove the△2-condition of Liu,Hou and Wang(Sci China Math,2010,53(4):905-916).展开更多
With the continuous growth of online news articles,there arises the necessity for an efficient abstractive summarization technique for the problem of information overloading.Abstractive summarization is highly complex...With the continuous growth of online news articles,there arises the necessity for an efficient abstractive summarization technique for the problem of information overloading.Abstractive summarization is highly complex and requires a deeper understanding and proper reasoning to come up with its own summary outline.Abstractive summarization task is framed as seq2seq modeling.Existing seq2seq methods perform better on short sequences;however,for long sequences,the performance degrades due to high computation and hence a two-phase self-normalized deep neural document summarization model consisting of improvised extractive cosine normalization and seq2seq abstractive phases has been proposed in this paper.The novelty is to parallelize the sequence computation training by incorporating feed-forward,the self-normalized neural network in the Extractive phase using Intra Cosine Attention Similarity(Ext-ICAS)with sentence dependency position.Also,it does not require any normalization technique explicitly.Our proposed abstractive Bidirectional Long Short Term Memory(Bi-LSTM)encoder sequence model performs better than the Bidirectional Gated Recurrent Unit(Bi-GRU)encoder with minimum training loss and with fast convergence.The proposed model was evaluated on the Cable News Network(CNN)/Daily Mail dataset and an average rouge score of 0.435 was achieved also computational training in the extractive phase was reduced by 59%with an average number of similarity computations.展开更多
Self-normalizing neural networks(SNN)regulate the activation and gradient flows through activation functions with the self-normalization property.As SNNs do not rely on norms computed from minibatches,they are more fr...Self-normalizing neural networks(SNN)regulate the activation and gradient flows through activation functions with the self-normalization property.As SNNs do not rely on norms computed from minibatches,they are more friendly to data parallelism,kernel fusion,and emerging architectures such as ReRAM-based accelerators.However,existing SNNs have mainly demonstrated their effectiveness on toy datasets and fall short in accuracy when dealing with large-scale tasks like ImageNet.They lack the strong normalization,regularization,and expression power required for wider,deeper models and larger-scale tasks.To enhance the normalization strength,this paper introduces a comprehensive and practical definition of the self-normalization property in terms of the stability and attractiveness of the statistical fixed points.It is comprehensive as it jointly considers all the fixed points used by existing studies:the first and second moment of forward activation and the expected Frobenius norm of backward gradient.The practicality comes from the analytical equations provided by our paper to assess the stability and attractiveness of each fixed point,which are derived from theoretical analysis of the forward and backward signals.The proposed definition is applied to a meta activation function inspired by prior research,leading to a stronger self-normalizing activation function named‘‘bi-scaled exponential linear unit with backward standardized’’(bSELU-BSTD).We provide both theoretical and empirical evidence to show that it is superior to existing studies.To enhance the regularization and expression power,we further propose scaled-Mixup and channel-wise scale&shift.With these three techniques,our approach achieves 75.23%top-1 accuracy on the ImageNet with Conv MobileNet V1,surpassing the performance of existing self-normalizing activation functions.To the best of our knowledge,this is the first SNN that achieves comparable accuracy to batch normalization on ImageNet.展开更多
From the point of view of the basic option model, enterprise investment decision making under uncertainty is studied based on the martingale method. The study shows that investment options and yields are increasing fu...From the point of view of the basic option model, enterprise investment decision making under uncertainty is studied based on the martingale method. The study shows that investment options and yields are increasing functions of time, and when the option equals the yield, the investment opportunity cost is the least, which is the appropriate time for the enterprise investment. Under the condition that the investment yield is an increasing function of time, the investment opportunity cost is also an increasing function of time after the time when the investment option equals the investment yield. So the investors should invest as soon as possible, otherwise they should stop investment forever in this project. It is impossible to acquire more investment yields by indefinitely delaying the investment. Meanwhile, the study also shows that the martingale method, used widely in financial investment theory, is a powerful tool for enterprise investment decision making.展开更多
Let 2≤p【∞ and let (f n) be a martingale. Using exponential bounds of the probabilities of the type P(|f n|】λ‖T(f n)‖ ∞) for some quasi-linear operators acting on martingales, we estimate upper bounds for t...Let 2≤p【∞ and let (f n) be a martingale. Using exponential bounds of the probabilities of the type P(|f n|】λ‖T(f n)‖ ∞) for some quasi-linear operators acting on martingales, we estimate upper bounds for the L p-norms of the maximal functions of martinglaes. Our result is the extension and improvements of the results obtained previously by HITCZENKO and ZENG .展开更多
In this article, the authors introduce some new Lorentz spaces for martingales, which are extensions of Hardy spaces of martingales. Then they discuss their basic properties, embedding relationships, and interpolation...In this article, the authors introduce some new Lorentz spaces for martingales, which are extensions of Hardy spaces of martingales. Then they discuss their basic properties, embedding relationships, and interpolation spaces between them, during which the use of rearrangement good-λ-inequality plays an important role.展开更多
In this article, several weak Hardy spaces of Banach-space-valued martingales are introduced, some atomic decomposition theorems for them are established and their duals are investigated. The results closely depend on...In this article, several weak Hardy spaces of Banach-space-valued martingales are introduced, some atomic decomposition theorems for them are established and their duals are investigated. The results closely depend on the geometrical properties of the Banach space in which the martingales take values.展开更多
In this article, some necessary and sufficient conditions are shown in order that the inequality of the form Ф1(λ)Pu(f^*〉λ)≤Ev (Ф2(C|f∞|)) holds with some constant C 〉 0 independent of martingale f...In this article, some necessary and sufficient conditions are shown in order that the inequality of the form Ф1(λ)Pu(f^*〉λ)≤Ev (Ф2(C|f∞|)) holds with some constant C 〉 0 independent of martingale f = (fn)n≥0 and λ 〉 0, where Фl and Ф2 are a pair of Young functions, f^*=sup n≥0|fn| adn f∞=lim n→∞ fn a.e.展开更多
A generalized Rosenthal's inequality for Banach-space-valued martingales is proved, which extends the corresponding results in the previous literatures and characterizes the p-uniform smoothness and q-uniform convexi...A generalized Rosenthal's inequality for Banach-space-valued martingales is proved, which extends the corresponding results in the previous literatures and characterizes the p-uniform smoothness and q-uniform convexity of the underlying Banach space. As an application of this inequality, the strong law of large numbers for Banach-space-valued martingales is also given.展开更多
In this paper we introduce the concept of two-parameterB-valued strong martingales and investigate some features of these strong martingales. We also characterizep-smoothable Banach spaces in terms of these strong mar...In this paper we introduce the concept of two-parameterB-valued strong martingales and investigate some features of these strong martingales. We also characterizep-smoothable Banach spaces in terms of these strong martingales.展开更多
In this paper, we apply function parameters to real interpolation of Lorentz- Orlicz martingale spaces. Some new interpolation theorems are formulated which generalize some known results in Lorentz spaces An introduce...In this paper, we apply function parameters to real interpolation of Lorentz- Orlicz martingale spaces. Some new interpolation theorems are formulated which generalize some known results in Lorentz spaces An introduced by Sharpley.展开更多
Let x (xn)≥1 be a martingale on a noncommutative probability space n (M, r) and (wn)n≥1 a sequence of positive numbers such that Wn = ∑ k=1^n wk →∞ as n →∞ We prove that x = (x.)n≥1 converges in E(M...Let x (xn)≥1 be a martingale on a noncommutative probability space n (M, r) and (wn)n≥1 a sequence of positive numbers such that Wn = ∑ k=1^n wk →∞ as n →∞ We prove that x = (x.)n≥1 converges in E(M) if and only if (σn(x)n≥1 converges in E(.hd), where E(A//) is a noncommutative rearrangement invariant Banach function space with the Fatou property and σn(x) is given by σn(x) = 1/Wn ∑k=1^n wkxk, n=1, 2, .If in addition, E(Ad) has absolutely continuous norm, then, (an(x))≥1 converges in E(.M) if and only if x = (Xn)n≥1 is uniformly integrable and its limit in measure topology x∞∈ E(M).展开更多
In this article the authors introduce the minimal operator on martingale spaces, discuss some one-weight and two-weight inequalities for the minimal operator and characterize the conditions which make the inequalities...In this article the authors introduce the minimal operator on martingale spaces, discuss some one-weight and two-weight inequalities for the minimal operator and characterize the conditions which make the inequalities hold.展开更多
The atomic decompositions of weak Hardy spaces of Banach-space-valued martingales are given. With the help of the atomic decompositions, some inequalities for B-valued martingales are established in the case 0〈r≤1. ...The atomic decompositions of weak Hardy spaces of Banach-space-valued martingales are given. With the help of the atomic decompositions, some inequalities for B-valued martingales are established in the case 0〈r≤1. Here the results are connected closely with the p-uniform smoothness and q-uniform convexity of Banach spaces which the martingales take values in.展开更多
文摘A Berry–Esseen bound is obtained for self-normalized martingales under the assumption of finite moments.The bound coincides with the classical Berry–Esseenboundforstandardizedmartingales.Anexampleisgiventoshowtheoptimality of the bound.Applications to Student’s statistic and autoregressive process are also discussed.
基金supported by the National Natural Science Foundation of China(11801001,12101223)the Scientific Research Fund of Hunan Provincial Education Department(20C0780)the Natural Science Foundation of Hunan Province(2022JJ40145,2022JJ40146)。
文摘Let 1≤q≤∞,b be a slowly varying function and letΦ:[0,∞)■[0,∞)be an increasing convex function withΦ(0)=0 and■Φ(r)=∞.In this paper,we present a new class of Doob’s maximal inequality on Orlicz-Lorentz-Karamata spaces LΦ,q,b.The results are new,even for the Lorentz-Karamata spaces withΦ(t)=tp,the Orlicz-Lorentz spaces with b≡1,and weak Orlicz-Karamata spaces with q=∞in the framework of LΦ,q,b-Moreover,we obtain some even stronger qualitative results that can remove the△2-condition of Liu,Hou and Wang(Sci China Math,2010,53(4):905-916).
文摘With the continuous growth of online news articles,there arises the necessity for an efficient abstractive summarization technique for the problem of information overloading.Abstractive summarization is highly complex and requires a deeper understanding and proper reasoning to come up with its own summary outline.Abstractive summarization task is framed as seq2seq modeling.Existing seq2seq methods perform better on short sequences;however,for long sequences,the performance degrades due to high computation and hence a two-phase self-normalized deep neural document summarization model consisting of improvised extractive cosine normalization and seq2seq abstractive phases has been proposed in this paper.The novelty is to parallelize the sequence computation training by incorporating feed-forward,the self-normalized neural network in the Extractive phase using Intra Cosine Attention Similarity(Ext-ICAS)with sentence dependency position.Also,it does not require any normalization technique explicitly.Our proposed abstractive Bidirectional Long Short Term Memory(Bi-LSTM)encoder sequence model performs better than the Bidirectional Gated Recurrent Unit(Bi-GRU)encoder with minimum training loss and with fast convergence.The proposed model was evaluated on the Cable News Network(CNN)/Daily Mail dataset and an average rouge score of 0.435 was achieved also computational training in the extractive phase was reduced by 59%with an average number of similarity computations.
基金National Key R&D Program of China(2018AAA0102600)National Natural Science Foundation of China(No.61876215,62106119)+1 种基金Beijing Academy of Artificial Intelligence(BAAI),ChinaChinese Institute for Brain Research,Beijing,and the Science and Technology Major Project of Guangzhou,China(202007030006).
文摘Self-normalizing neural networks(SNN)regulate the activation and gradient flows through activation functions with the self-normalization property.As SNNs do not rely on norms computed from minibatches,they are more friendly to data parallelism,kernel fusion,and emerging architectures such as ReRAM-based accelerators.However,existing SNNs have mainly demonstrated their effectiveness on toy datasets and fall short in accuracy when dealing with large-scale tasks like ImageNet.They lack the strong normalization,regularization,and expression power required for wider,deeper models and larger-scale tasks.To enhance the normalization strength,this paper introduces a comprehensive and practical definition of the self-normalization property in terms of the stability and attractiveness of the statistical fixed points.It is comprehensive as it jointly considers all the fixed points used by existing studies:the first and second moment of forward activation and the expected Frobenius norm of backward gradient.The practicality comes from the analytical equations provided by our paper to assess the stability and attractiveness of each fixed point,which are derived from theoretical analysis of the forward and backward signals.The proposed definition is applied to a meta activation function inspired by prior research,leading to a stronger self-normalizing activation function named‘‘bi-scaled exponential linear unit with backward standardized’’(bSELU-BSTD).We provide both theoretical and empirical evidence to show that it is superior to existing studies.To enhance the regularization and expression power,we further propose scaled-Mixup and channel-wise scale&shift.With these three techniques,our approach achieves 75.23%top-1 accuracy on the ImageNet with Conv MobileNet V1,surpassing the performance of existing self-normalizing activation functions.To the best of our knowledge,this is the first SNN that achieves comparable accuracy to batch normalization on ImageNet.
文摘From the point of view of the basic option model, enterprise investment decision making under uncertainty is studied based on the martingale method. The study shows that investment options and yields are increasing functions of time, and when the option equals the yield, the investment opportunity cost is the least, which is the appropriate time for the enterprise investment. Under the condition that the investment yield is an increasing function of time, the investment opportunity cost is also an increasing function of time after the time when the investment option equals the investment yield. So the investors should invest as soon as possible, otherwise they should stop investment forever in this project. It is impossible to acquire more investment yields by indefinitely delaying the investment. Meanwhile, the study also shows that the martingale method, used widely in financial investment theory, is a powerful tool for enterprise investment decision making.
文摘Let 2≤p【∞ and let (f n) be a martingale. Using exponential bounds of the probabilities of the type P(|f n|】λ‖T(f n)‖ ∞) for some quasi-linear operators acting on martingales, we estimate upper bounds for the L p-norms of the maximal functions of martinglaes. Our result is the extension and improvements of the results obtained previously by HITCZENKO and ZENG .
文摘In this article, the authors introduce some new Lorentz spaces for martingales, which are extensions of Hardy spaces of martingales. Then they discuss their basic properties, embedding relationships, and interpolation spaces between them, during which the use of rearrangement good-λ-inequality plays an important role.
基金Supported by the National Natural Foundation of China(10671147)
文摘In this article, several weak Hardy spaces of Banach-space-valued martingales are introduced, some atomic decomposition theorems for them are established and their duals are investigated. The results closely depend on the geometrical properties of the Banach space in which the martingales take values.
文摘In this article, some necessary and sufficient conditions are shown in order that the inequality of the form Ф1(λ)Pu(f^*〉λ)≤Ev (Ф2(C|f∞|)) holds with some constant C 〉 0 independent of martingale f = (fn)n≥0 and λ 〉 0, where Фl and Ф2 are a pair of Young functions, f^*=sup n≥0|fn| adn f∞=lim n→∞ fn a.e.
基金Supported by the Scientific Research Foundation of Hubei Province (D200613001)the National Natural Science Foundation of China (10371093)
文摘A generalized Rosenthal's inequality for Banach-space-valued martingales is proved, which extends the corresponding results in the previous literatures and characterizes the p-uniform smoothness and q-uniform convexity of the underlying Banach space. As an application of this inequality, the strong law of large numbers for Banach-space-valued martingales is also given.
文摘In this paper we introduce the concept of two-parameterB-valued strong martingales and investigate some features of these strong martingales. We also characterizep-smoothable Banach spaces in terms of these strong martingales.
基金supported by National Natural Science Foundation of China(Grant No.11201354)Hubei Province Key Laboratory of Systems Science in Metallurgical Process(Wuhan University of Science and Technology)(Y201321)National Natural Science Foundation of Pre-Research Item(2011XG005)
文摘In this paper, we apply function parameters to real interpolation of Lorentz- Orlicz martingale spaces. Some new interpolation theorems are formulated which generalize some known results in Lorentz spaces An introduced by Sharpley.
基金supported by the National Natural Science Foundation of China (11071190)
文摘Let x (xn)≥1 be a martingale on a noncommutative probability space n (M, r) and (wn)n≥1 a sequence of positive numbers such that Wn = ∑ k=1^n wk →∞ as n →∞ We prove that x = (x.)n≥1 converges in E(M) if and only if (σn(x)n≥1 converges in E(.hd), where E(A//) is a noncommutative rearrangement invariant Banach function space with the Fatou property and σn(x) is given by σn(x) = 1/Wn ∑k=1^n wkxk, n=1, 2, .If in addition, E(Ad) has absolutely continuous norm, then, (an(x))≥1 converges in E(.M) if and only if x = (Xn)n≥1 is uniformly integrable and its limit in measure topology x∞∈ E(M).
基金This work was supported by the NSF of China and the aid financial plan for the backbone of the young teachers of University of Henan
文摘In this article the authors introduce the minimal operator on martingale spaces, discuss some one-weight and two-weight inequalities for the minimal operator and characterize the conditions which make the inequalities hold.
基金Supported by the National Natural Science Foun-dation of China (10371093)
文摘The atomic decompositions of weak Hardy spaces of Banach-space-valued martingales are given. With the help of the atomic decompositions, some inequalities for B-valued martingales are established in the case 0〈r≤1. Here the results are connected closely with the p-uniform smoothness and q-uniform convexity of Banach spaces which the martingales take values in.