Two newly recorded species in the genus Mesaphorura Bomer, 1901 from China are described: Mesaphorura hylophila Rusek, 1982 and Mesaphorura pacifica Rusek, 1976. The important morphological characters of these Chines...Two newly recorded species in the genus Mesaphorura Bomer, 1901 from China are described: Mesaphorura hylophila Rusek, 1982 and Mesaphorura pacifica Rusek, 1976. The important morphological characters of these Chinese specimens are described in details. A key to Chinese Mesaphorura species is provided.展开更多
[Objective] Taking the knowledge of tea-science field as research object,an extraction method for the taxonomic relation of ontology conception was proposed in the paper.[Method] Through improving the rule based on la...[Objective] Taking the knowledge of tea-science field as research object,an extraction method for the taxonomic relation of ontology conception was proposed in the paper.[Method] Through improving the rule based on language mode,generalized suffix tree was constructed for the concept set of tea-science field,forming hierarchical structure and taxonomic relation among conceptions.[Result and Conclusion] Moreover,corresponding prototype system was developed based on above method,and test result indicating that the method was effective.展开更多
Two new species of Tullbergiidae, Paratullbergia qilianensis sp. nov. from Gansu and Pongeiella yinchuanensis sp. nov. from Ningxia, northwest China are described. Paratullbergia qilianensis is characterized by the pr...Two new species of Tullbergiidae, Paratullbergia qilianensis sp. nov. from Gansu and Pongeiella yinchuanensis sp. nov. from Ningxia, northwest China are described. Paratullbergia qilianensis is characterized by the presence of one pair of pseudocelli on thoracic segment Ⅰ, with two pairs of pseudocelli on each of thoracic segments Ⅱand Ⅲ, seta px present on abdominal segment Ⅳ, setae a2 and p4 on abdominal segment V as macrosetae, and a less differentiated sensillum p3 on abdominal segment Ⅴ.Pongeiella yinchuanensis is characterized by the pseudocelli of type Ⅲ, the presence of seta p3 on Th.Ⅱ and Ⅲ, five thickened sensilla on Ant.Ⅳ with four of them having distinct basal heels and seta oc2 on head as macroseta.展开更多
Objective: To discuss the relationship between the postoperative breast cancer with distant metastasis and the TCM syndromes classification. Methods: 160 postoperative 5-year breast cancer patients from 1995 to 2000 w...Objective: To discuss the relationship between the postoperative breast cancer with distant metastasis and the TCM syndromes classification. Methods: 160 postoperative 5-year breast cancer patients from 1995 to 2000 were tracked, summed up and analysized TCM syndromes as stagnation of hepatic qi, deficiency of spleen and pathogenic phlegm reten- tion, blood stasis and toxin stagnation, deficiencies of both blood and qi. Results: (1) For blood stasis and toxin stagnation TCM syndrome, the metastatic rate raised to 45% during 5 years. However, the metastatic rates of other three TCM syn- dromes are 15%, 17.5% and 22.5% respectively. The general distant metastasis rate was 27.5% (P<0.01). (2) Lymph node metastasis, tumor size, Her-2 and its receptor have no obvious relation with TCM syndromes classification (P>0.05). Conclu- sion: (1) TCM syndrome classification has close relation with breast cancer distant metastasis. Distant metastasis have close relationship with blood stasis and toxin stagnation syndrome. (2) Lymph node metastasis, tumor size, Her-2 and its receptor have no obvious relation with TCM syndromes classification, which suggested that metastatic ability has been programmed in the early stage of carcinoma initiation. (3) Significantly enlightening for predict the prognosis under the guide of TCM syn- drome classification and take right therapeutic strategy: attack pathogen and activate blood circulation against cancer.展开更多
Logistic regression is a fast classifier and can achieve higher accuracy on small training data.Moreover,it can work on both discrete and continuous attributes with nonlinear patterns.Based on these properties of logi...Logistic regression is a fast classifier and can achieve higher accuracy on small training data.Moreover,it can work on both discrete and continuous attributes with nonlinear patterns.Based on these properties of logistic regression,this paper proposed an algorithm,called evolutionary logistical regression classifier(ELRClass),to solve the classification of evolving data streams.This algorithm applies logistic regression repeatedly to a sliding window of samples in order to update the existing classifier,to keep this classifier if its performance is deteriorated by the reason of bursting noise,or to construct a new classifier if a major concept drift is detected.The intensive experimental results demonstrate the effectiveness of this algorithm.展开更多
Variable selection is one of the most fundamental problems in regression analysis. By sampling from the posterior distributions of candidate models, Bayesian variable selection via MCMC (Markov chain Monte-Carlo) is...Variable selection is one of the most fundamental problems in regression analysis. By sampling from the posterior distributions of candidate models, Bayesian variable selection via MCMC (Markov chain Monte-Carlo) is effective to overcome the computational burden of all-subset variable selection approaches. However, the convergence of the MCMC is often hard to determine and one is often not sure about if obtained samples are unbiased. This complication has limited the application of Bayesian variable selection in practice. Based on the idea of CFTP (coupling from the past), perfect sampling schemes have been developed to obtain independent samples from the posterior distribution for a variety of problems. Here the authors propose an efficient and effective perfect sampling algorithm for Bayesian variable selection of linear regression models, which independently and identically sample from the posterior distribution of the model space and can efficiently handle thousands of variables. The effectiveness of the authors' algorithm is illustrated by three simulation studies, which have up to thousands of variables, the authors' method is further illustrated in SNPs (single nucleotide polymorphisms) association study among RA (rheumatoid arthritis) patients.展开更多
AutoClass is an unsupervised Bayesian classification approach which seeks a maximum posterior probability classification for determining the optimal classes in large data sets. Using stellar photometric data from the ...AutoClass is an unsupervised Bayesian classification approach which seeks a maximum posterior probability classification for determining the optimal classes in large data sets. Using stellar photometric data from the Sloan Digital Sky Survey (SDSS) data release 7 (DR7), we utilize AutoClass to select non-stellar objects from this sample in order to build a pure stellar sample. For this purpose, the differences between PSF (point spread function) magnitudes and model magnitudes in five wavebands are taken as the input of AutoClass. Through clustering analysis of this sample by AutoClass, 617 non-stellar candidates are found. These candidates are identified by NED and SIMBAD databases. Most of the identified sources (13 from SIMBAD and 28 from NED respectively) are extragalactic sources (e.g., galaxies, HII, radio sources, infrared sources), some are peculiar stars (e.g., supernovas), and very few are normal stars. The extragalactic sources and peculiar stars of the identified objects occupy 94.1%. The result indicates that this method is an effective and robust clustering algorithm to find non-stellar objects and peculiar stars from the total stellar sample.展开更多
The cytology of 130 indeterminate nodules (Thy 3) was retrospectively reviewed according to the British Thyroid Association 2014 classification. Nodules were divided into Thy 3a (atypical features) and Thy 3f (fo...The cytology of 130 indeterminate nodules (Thy 3) was retrospectively reviewed according to the British Thyroid Association 2014 classification. Nodules were divided into Thy 3a (atypical features) and Thy 3f (follicular lesion) categories. Histology was available as a reference for 97 nodules. Pre-surgical evaluations comprised biochemical tests, color-Doppler ultrasonogrephy (US), semi-quantitative elastography-US (USE), contrast-enhanced US (CEUS), and mutation analysis from cytological slides. Thyroid malignancy was the final diagnosis for 19% of surgically- treated nodules. No statistically significant difference in the risk of malignancy was found between Thy 3a (26%) and Thy 3f (14%) nodules. Histology of the Thy 3a and Thy 3f nodules showed a higher incidence of Hurtle cell adenomas in Thy 3f (29%) than in Thy 3a (3%) nodules (P=0.01). The only pre-surgical difference concerned the BRAF V600E mutation, which was positive in some Thy 3a but not in any Thy 3f nodules (P=0.04). Receiver-operating characteristic (ROC) analysis was used to obtain cut-off values from US (score), USE (ELX 2/1 strain index), and CEUS (time-to- peak index and peak index) data. The cut-off values were similar for Thy 3a and Thy 3f nodules. Data showed that malignancy can be suspected if the US score is 〉2, ELX 1/2 strain index 〉1, time-to-peakindex 〉1, and peak index 〈1. In a sub-group of 24 revised nodules (12 Thy 3a and 12 Thy 3f) with histology as a reference, the diagnostic power of cumulative pre-surgical analysis by means of US, USE, and CEUS showed high positive and negative predictive values (83% and 100%, respectively) for the presence of malignancy in Thy 3a and Thy 3f nodules. In conclusion, in our series of revised Thy 3 nodules, malignancy was low and displayed no significant differences between Thy 3a and Thy 3f categories. The use of cut-offs based on histology as a reference could reduce surgery. Our data support the conviction that, in mutation-negative Thy 3a and Thy 3f nodules, observation should be the first choice when not all instrumental results are suspect.展开更多
基金sponsored by the Natural Science Foundation of Shanghai(17ZR1418700)the Forest Pest Investigation Project of Hebei Province
文摘Two newly recorded species in the genus Mesaphorura Bomer, 1901 from China are described: Mesaphorura hylophila Rusek, 1982 and Mesaphorura pacifica Rusek, 1976. The important morphological characters of these Chinese specimens are described in details. A key to Chinese Mesaphorura species is provided.
文摘[Objective] Taking the knowledge of tea-science field as research object,an extraction method for the taxonomic relation of ontology conception was proposed in the paper.[Method] Through improving the rule based on language mode,generalized suffix tree was constructed for the concept set of tea-science field,forming hierarchical structure and taxonomic relation among conceptions.[Result and Conclusion] Moreover,corresponding prototype system was developed based on above method,and test result indicating that the method was effective.
基金supported by the National Natural ScienceFoundation of China(31772509)the Natural Science Foundation of Shanghai(17ZR1418700)
文摘Two new species of Tullbergiidae, Paratullbergia qilianensis sp. nov. from Gansu and Pongeiella yinchuanensis sp. nov. from Ningxia, northwest China are described. Paratullbergia qilianensis is characterized by the presence of one pair of pseudocelli on thoracic segment Ⅰ, with two pairs of pseudocelli on each of thoracic segments Ⅱand Ⅲ, seta px present on abdominal segment Ⅳ, setae a2 and p4 on abdominal segment V as macrosetae, and a less differentiated sensillum p3 on abdominal segment Ⅴ.Pongeiella yinchuanensis is characterized by the pseudocelli of type Ⅲ, the presence of seta p3 on Th.Ⅱ and Ⅲ, five thickened sensilla on Ant.Ⅳ with four of them having distinct basal heels and seta oc2 on head as macroseta.
文摘Objective: To discuss the relationship between the postoperative breast cancer with distant metastasis and the TCM syndromes classification. Methods: 160 postoperative 5-year breast cancer patients from 1995 to 2000 were tracked, summed up and analysized TCM syndromes as stagnation of hepatic qi, deficiency of spleen and pathogenic phlegm reten- tion, blood stasis and toxin stagnation, deficiencies of both blood and qi. Results: (1) For blood stasis and toxin stagnation TCM syndrome, the metastatic rate raised to 45% during 5 years. However, the metastatic rates of other three TCM syn- dromes are 15%, 17.5% and 22.5% respectively. The general distant metastasis rate was 27.5% (P<0.01). (2) Lymph node metastasis, tumor size, Her-2 and its receptor have no obvious relation with TCM syndromes classification (P>0.05). Conclu- sion: (1) TCM syndrome classification has close relation with breast cancer distant metastasis. Distant metastasis have close relationship with blood stasis and toxin stagnation syndrome. (2) Lymph node metastasis, tumor size, Her-2 and its receptor have no obvious relation with TCM syndromes classification, which suggested that metastatic ability has been programmed in the early stage of carcinoma initiation. (3) Significantly enlightening for predict the prognosis under the guide of TCM syn- drome classification and take right therapeutic strategy: attack pathogen and activate blood circulation against cancer.
文摘Logistic regression is a fast classifier and can achieve higher accuracy on small training data.Moreover,it can work on both discrete and continuous attributes with nonlinear patterns.Based on these properties of logistic regression,this paper proposed an algorithm,called evolutionary logistical regression classifier(ELRClass),to solve the classification of evolving data streams.This algorithm applies logistic regression repeatedly to a sliding window of samples in order to update the existing classifier,to keep this classifier if its performance is deteriorated by the reason of bursting noise,or to construct a new classifier if a major concept drift is detected.The intensive experimental results demonstrate the effectiveness of this algorithm.
文摘Variable selection is one of the most fundamental problems in regression analysis. By sampling from the posterior distributions of candidate models, Bayesian variable selection via MCMC (Markov chain Monte-Carlo) is effective to overcome the computational burden of all-subset variable selection approaches. However, the convergence of the MCMC is often hard to determine and one is often not sure about if obtained samples are unbiased. This complication has limited the application of Bayesian variable selection in practice. Based on the idea of CFTP (coupling from the past), perfect sampling schemes have been developed to obtain independent samples from the posterior distribution for a variety of problems. Here the authors propose an efficient and effective perfect sampling algorithm for Bayesian variable selection of linear regression models, which independently and identically sample from the posterior distribution of the model space and can efficiently handle thousands of variables. The effectiveness of the authors' algorithm is illustrated by three simulation studies, which have up to thousands of variables, the authors' method is further illustrated in SNPs (single nucleotide polymorphisms) association study among RA (rheumatoid arthritis) patients.
基金supported by the National Natural Science Foundation of China (Grant Nos. 10778724 and 11033001)the Natural Science Foundation of Education Department of Hebei Province (GrantNo. ZD2010127) the Young Researcher Grant of National Astronomical Observatories, Chinese Academy of Sciences
文摘AutoClass is an unsupervised Bayesian classification approach which seeks a maximum posterior probability classification for determining the optimal classes in large data sets. Using stellar photometric data from the Sloan Digital Sky Survey (SDSS) data release 7 (DR7), we utilize AutoClass to select non-stellar objects from this sample in order to build a pure stellar sample. For this purpose, the differences between PSF (point spread function) magnitudes and model magnitudes in five wavebands are taken as the input of AutoClass. Through clustering analysis of this sample by AutoClass, 617 non-stellar candidates are found. These candidates are identified by NED and SIMBAD databases. Most of the identified sources (13 from SIMBAD and 28 from NED respectively) are extragalactic sources (e.g., galaxies, HII, radio sources, infrared sources), some are peculiar stars (e.g., supernovas), and very few are normal stars. The extragalactic sources and peculiar stars of the identified objects occupy 94.1%. The result indicates that this method is an effective and robust clustering algorithm to find non-stellar objects and peculiar stars from the total stellar sample.
文摘The cytology of 130 indeterminate nodules (Thy 3) was retrospectively reviewed according to the British Thyroid Association 2014 classification. Nodules were divided into Thy 3a (atypical features) and Thy 3f (follicular lesion) categories. Histology was available as a reference for 97 nodules. Pre-surgical evaluations comprised biochemical tests, color-Doppler ultrasonogrephy (US), semi-quantitative elastography-US (USE), contrast-enhanced US (CEUS), and mutation analysis from cytological slides. Thyroid malignancy was the final diagnosis for 19% of surgically- treated nodules. No statistically significant difference in the risk of malignancy was found between Thy 3a (26%) and Thy 3f (14%) nodules. Histology of the Thy 3a and Thy 3f nodules showed a higher incidence of Hurtle cell adenomas in Thy 3f (29%) than in Thy 3a (3%) nodules (P=0.01). The only pre-surgical difference concerned the BRAF V600E mutation, which was positive in some Thy 3a but not in any Thy 3f nodules (P=0.04). Receiver-operating characteristic (ROC) analysis was used to obtain cut-off values from US (score), USE (ELX 2/1 strain index), and CEUS (time-to- peak index and peak index) data. The cut-off values were similar for Thy 3a and Thy 3f nodules. Data showed that malignancy can be suspected if the US score is 〉2, ELX 1/2 strain index 〉1, time-to-peakindex 〉1, and peak index 〈1. In a sub-group of 24 revised nodules (12 Thy 3a and 12 Thy 3f) with histology as a reference, the diagnostic power of cumulative pre-surgical analysis by means of US, USE, and CEUS showed high positive and negative predictive values (83% and 100%, respectively) for the presence of malignancy in Thy 3a and Thy 3f nodules. In conclusion, in our series of revised Thy 3 nodules, malignancy was low and displayed no significant differences between Thy 3a and Thy 3f categories. The use of cut-offs based on histology as a reference could reduce surgery. Our data support the conviction that, in mutation-negative Thy 3a and Thy 3f nodules, observation should be the first choice when not all instrumental results are suspect.