HomoloGene is a system for automated detection of homologs among the annotated genes of several completely sequenced eukaryotic genomes.
HomoloGene Release 67 Statistics
Initial numbers of genes from complete genomes, numbers of genes placed in a homology group, and the numbers of groups for each species.
Species Number of Genes HomoloGene
Input Grouped groups
M.oryzae 12,832
7,330 6,403
M.mulatta 22,566
17,151 16,706
H.sapiens 19,217
18,867 18,473
P.troglodytes 19,655
17,367 17,078
C.lupus 19,176
17,800 17,186
B.taurus 21,121
18,771 17,458
M.musculus 21,432
* 20,744 18,858
R.norvegicus 21,943
19,868 17,882
G.gallus 16,731
14,171 13,149
D.rerio 26,326
20,504 14,183
D.melanogaster 13,555
9,252 7,713
A.gambiae 12,460
8,827 7,500
C.elegans 20,185
8,698 4,811
S.pombe 5,018
3,238 2,946
S.cerevisiae 5,882
4,854 4,373
K.lactis 4,978
4,404 4,338
E.gossypii 4,722
3,932 3,888
N.crassa 9,821
6,301 6,158
A.thaliana 27,388
19,965 11,078
O.sativa 28,456
17,086 10,431
'*' indicates organisms where new genome annotation data is used in this build.
Last updated on: Thu Dec 13 2012
We have recently adopted a new build procedure that makes use of amino acid sequence searching (blastp) to find more distant relationships, but the procedure still refers to the DNA sequence for computation of some of the statistics. The matching strategy is guided by the taxonomic tree such that more closely related organisms are compared first. Moreover, HomoloGene entries now include paralogs in addition to orthologs.
What's New
HomoloGene 67 includes genes from the the latest mouse genome release (GRCm38).
Related Resources
Protein Clusters
List of Genomes
Taxonomy