In this paper, an estimation method for reliability parameter in the case of zero-failuare data-synthetic estimation method is given. For zero-failure data of double-parameter exponential distribution, a hierarchical ...In this paper, an estimation method for reliability parameter in the case of zero-failuare data-synthetic estimation method is given. For zero-failure data of double-parameter exponential distribution, a hierarchical Bayesian estimation of the failure probability is presented. After failure information is introduced, hierarchical Bayesian estimation and synthetic estimation of the failure probability, as well as synthetic estimation of reliability are given. Calculation and analysis are performed regarding practical problems in case that life distribution of an engine obeys double-parameter exponential distribution.展开更多
To improve question answering (QA) performance based on real-world web data sets,a new set of question classes and a general answer re-ranking model are defined.With pre-defined dictionary and grammatical analysis,t...To improve question answering (QA) performance based on real-world web data sets,a new set of question classes and a general answer re-ranking model are defined.With pre-defined dictionary and grammatical analysis,the question classifier draws both semantic and grammatical information into information retrieval and machine learning methods in the form of various training features,including the question word,the main verb of the question,the dependency structure,the position of the main auxiliary verb,the main noun of the question,the top hypernym of the main noun,etc.Then the QA query results are re-ranked by question class information.Experiments show that the questions in real-world web data sets can be accurately classified by the classifier,and the QA results after re-ranking can be obviously improved.It is proved that with both semantic and grammatical information,applications such as QA, built upon real-world web data sets, can be improved,thus showing better performance.展开更多
作为一门新兴的学科领域,数据科学的科学性受到了关注且其科学问题未明确提出。文中从科学研究范式及方法论、可证伪性和可再现性、科学精神及快速迭代以及科学研究纲领及理论体系4个方面探讨了数据科学的“科学性”,并解答了为什么数...作为一门新兴的学科领域,数据科学的科学性受到了关注且其科学问题未明确提出。文中从科学研究范式及方法论、可证伪性和可再现性、科学精神及快速迭代以及科学研究纲领及理论体系4个方面探讨了数据科学的“科学性”,并解答了为什么数据科学是一门新兴科学的问题。在此基础上,结合DIKW模型(DIKW Pyramid or Hierarchy)、DMP(Data-Model-Problem)模型、数据科学的统计学和机器学习方法论以及数据科学的流程与活动,提出了数据科学的7个核心科学问题:解释在先还是在后或无、问题对齐数据还是数据对齐问题、更加相信数据还是模型、更加重视性能还是可解释性、如何划分数据、如何用已知数据解决未知数据的问题、人在环路还是人出环路。最后,提出了数据科学研究的4点建议:聚焦数据科学本身的理论研究,推动数据的科学、技术和工程需要进一步分离和专业化,加强人工智能赋能的数据科学的理论与实践以及数据科学学科(Data Science as A Discipline)与学科中的数据科学(Data Science Within A Discipline)的联动。展开更多
This paper introduces a new method, E-Bayesian estimation method, to estimate the reliability in zero-failure data. The definition of E-Bayesian estimation of the reliability is given. Based on the definition,the form...This paper introduces a new method, E-Bayesian estimation method, to estimate the reliability in zero-failure data. The definition of E-Bayesian estimation of the reliability is given. Based on the definition,the formulas of E-Bayesian estimation and hierarchical Bayesian estimation of the reliability are provided, and property of the E-Bayesian estimation, i.e. relation between E-Bayesian estimation and hierarchical Bayesian estimation, is discussed. Calculations performed on practical problems show that the proposed new method is feasible and easy to operate.展开更多
In this paper, for zero-fai1ure data (t,, n1), at moment ti, if the prior distribution of the failure probability p, = P {T<ti } is incomplete Fisher--Z distribution: Fisher-Z (0, λi; a, b), the author gives pi hi...In this paper, for zero-fai1ure data (t,, n1), at moment ti, if the prior distribution of the failure probability p, = P {T<ti } is incomplete Fisher--Z distribution: Fisher-Z (0, λi; a, b), the author gives pi hierarchical Biyesian estimation and the estimation of reliability under zero-failure data condition is obtained also. The author also gives a practical ca1culating example using the theory.展开更多
文摘In this paper, an estimation method for reliability parameter in the case of zero-failuare data-synthetic estimation method is given. For zero-failure data of double-parameter exponential distribution, a hierarchical Bayesian estimation of the failure probability is presented. After failure information is introduced, hierarchical Bayesian estimation and synthetic estimation of the failure probability, as well as synthetic estimation of reliability are given. Calculation and analysis are performed regarding practical problems in case that life distribution of an engine obeys double-parameter exponential distribution.
基金Microsoft Research Asia Internet Services in Academic Research Fund(No.FY07-RES-OPP-116)the Science and Technology Development Program of Tianjin(No.06YFGZGX05900)
文摘To improve question answering (QA) performance based on real-world web data sets,a new set of question classes and a general answer re-ranking model are defined.With pre-defined dictionary and grammatical analysis,the question classifier draws both semantic and grammatical information into information retrieval and machine learning methods in the form of various training features,including the question word,the main verb of the question,the dependency structure,the position of the main auxiliary verb,the main noun of the question,the top hypernym of the main noun,etc.Then the QA query results are re-ranked by question class information.Experiments show that the questions in real-world web data sets can be accurately classified by the classifier,and the QA results after re-ranking can be obviously improved.It is proved that with both semantic and grammatical information,applications such as QA, built upon real-world web data sets, can be improved,thus showing better performance.
文摘作为一门新兴的学科领域,数据科学的科学性受到了关注且其科学问题未明确提出。文中从科学研究范式及方法论、可证伪性和可再现性、科学精神及快速迭代以及科学研究纲领及理论体系4个方面探讨了数据科学的“科学性”,并解答了为什么数据科学是一门新兴科学的问题。在此基础上,结合DIKW模型(DIKW Pyramid or Hierarchy)、DMP(Data-Model-Problem)模型、数据科学的统计学和机器学习方法论以及数据科学的流程与活动,提出了数据科学的7个核心科学问题:解释在先还是在后或无、问题对齐数据还是数据对齐问题、更加相信数据还是模型、更加重视性能还是可解释性、如何划分数据、如何用已知数据解决未知数据的问题、人在环路还是人出环路。最后,提出了数据科学研究的4点建议:聚焦数据科学本身的理论研究,推动数据的科学、技术和工程需要进一步分离和专业化,加强人工智能赋能的数据科学的理论与实践以及数据科学学科(Data Science as A Discipline)与学科中的数据科学(Data Science Within A Discipline)的联动。
基金the Ningbo University of Technology Science Foundation and Ningbo Natural Science Foundation(No.2013A610108)
文摘This paper introduces a new method, E-Bayesian estimation method, to estimate the reliability in zero-failure data. The definition of E-Bayesian estimation of the reliability is given. Based on the definition,the formulas of E-Bayesian estimation and hierarchical Bayesian estimation of the reliability are provided, and property of the E-Bayesian estimation, i.e. relation between E-Bayesian estimation and hierarchical Bayesian estimation, is discussed. Calculations performed on practical problems show that the proposed new method is feasible and easy to operate.
文摘In this paper, for zero-fai1ure data (t,, n1), at moment ti, if the prior distribution of the failure probability p, = P {T<ti } is incomplete Fisher--Z distribution: Fisher-Z (0, λi; a, b), the author gives pi hierarchical Biyesian estimation and the estimation of reliability under zero-failure data condition is obtained also. The author also gives a practical ca1culating example using the theory.