10 likes | 112 Vues
This study presents a comprehensive analysis of 54,036 homologous families and identifies 5,326 candidate families across multiple phylogenetic domains. The intra- and inter-phylum families are examined, including NFn and NFp classifications. Utilizing MCL clustering and HMM screening methods on 214,159 polypeptides, we uncovered 8,296 Pfam A families, revealing 1,593 novel families after redundancy removal and overlap adjustments. Our findings contribute to the understanding of protein diversity and functional similarities among different taxa.
E N D
Homologous families 54,036 Candidate families 5,326 Intra-phylum NFn 462 NFp 296 3 sp. Inter-phylum non-NF (835) 1.0e-03 Yes No NR-nonNem similarity? MCL (IF=1.8) HMM screening EST polypeptides + full-length proteins (214,159) Pfam A (8,296) No Novel families 1,593 Remove redundancy 10% overlap Yes 3,387 346