HomoloGene

HomoloGene comes from the National Center for Biotechnology Information, and is described by its creators thusly:

HomoloGene is a resource of curated and calculated orthologs for genes as represented by UniGene or by annotation of genomic sequences.
[online; accessed 6/26/2003; http://www.ncbi.nlm.nih.gov/homoloGene/]

Schema in CLSD

HomoloGene tables are found under the 'homologene' schema. The Match table contains the homology information. The Organism table contains statically-defined identifiers for the species represented in the Match table.

To facilitate searching for matches between specific organism pairs, the original HomoloGene data was updated so that the order of any particular homologous pair will always include the lower-numbered taxonomy_id first.

TableFieldTypeDescription
MATCH taxonomy_id1BIGINT 
taxonomy_id2BIGINT 
match_typeCHAR(1) 
locus_id1BIGINT 
homologene_id1BIGINT 
acc1VARCHAR(30) 
locus_id2BIGINT 
homologene_id2BIGINT 
acc2VARCHAR(30) 
percentage_matchVARCHAR(10) 
urlVARCHAR(500) 
ORGANISM taxonomy_idBIGINT 
nameVARCHAR(30)