摘要
The traces of immunoglobulin domain similarities were searched in sequences of higher plants using bioinformatic tools to look for possible early phylogenic structural relationships. 280 thousand sequence IDs, obtained by sixteen types of primary BLAST searches, were differently processed by seventeen selection procedures and an anti-redundant sequence-related approach using JavaScript, PHP, Windows programs and conserved domain searches by means CDD. The resulting seventeen sets of records describing conserved domain similarities of 1323 different sequence IDs yielded a set of next generation (final set) comprising forty-nine records containing superior (“non-refutable”) conserved immunoglobulin domain similarities. The selected sets and their subsets were mapped and subsequently statistically compared with respect to immunoglobulin-related as well as other reciprocal domain linkages. The list of frequently occurring conserved domain similarities concerned first of all domains important for plant and metazoan immunity, e.g. tyrosine kinases accompanying variable immunoglobulin domains in early Metazoa, toll-like receptors, lectin and leucine-rich repeat domains. Detailed description of immunoglobulin domain similarities occurring in the final set was completed by fold analysis of the restricted segments. The data were then discussed with respect to i) immunoglobulin fold evolution, ii) possible structural importance of domains cd14066 (IRAK) and PLN00113 (LRR-associated kinase) for deep evolution of catalytic serine/threonine/tyrosine kinase domains, iii) interatomic, structural and specificity standpoints and iv) traces of antibody-like phosphorylation sites described in our previous paper.
The traces of immunoglobulin domain similarities were searched in sequences of higher plants using bioinformatic tools to look for possible early phylogenic structural relationships. 280 thousand sequence IDs, obtained by sixteen types of primary BLAST searches, were differently processed by seventeen selection procedures and an anti-redundant sequence-related approach using JavaScript, PHP, Windows programs and conserved domain searches by means CDD. The resulting seventeen sets of records describing conserved domain similarities of 1323 different sequence IDs yielded a set of next generation (final set) comprising forty-nine records containing superior (“non-refutable”) conserved immunoglobulin domain similarities. The selected sets and their subsets were mapped and subsequently statistically compared with respect to immunoglobulin-related as well as other reciprocal domain linkages. The list of frequently occurring conserved domain similarities concerned first of all domains important for plant and metazoan immunity, e.g. tyrosine kinases accompanying variable immunoglobulin domains in early Metazoa, toll-like receptors, lectin and leucine-rich repeat domains. Detailed description of immunoglobulin domain similarities occurring in the final set was completed by fold analysis of the restricted segments. The data were then discussed with respect to i) immunoglobulin fold evolution, ii) possible structural importance of domains cd14066 (IRAK) and PLN00113 (LRR-associated kinase) for deep evolution of catalytic serine/threonine/tyrosine kinase domains, iii) interatomic, structural and specificity standpoints and iv) traces of antibody-like phosphorylation sites described in our previous paper.