Uniprot: selected protein information from the Uniprot Knowledge Base
- The Uniprot data is downloaded from the Uniprot web site
www.uniprot.org.
Only one table is included within the Uniprot schema within CLSD: MAPPINGS.
It is derived from the file
idmapping_selected.tab
downloaded from the directorydatabases/uniprot/current_release/knowledgebase/idmapping/
fromftp://ftp.uniprot.org/pub/
The Uniprot "idmapping" data files are updated in conjunction with the UniProt Knowledgebase (UniProtKB). Whenever available the mappings are extracted from the UniProtKB records.Structure of the Uniprot Mappings table
UNIPROT_AC UNIPROT_ID ENTREZGENE REFSEQ GIID PDB PFAM GO PIRSF IPI UNIREF_100 UNIREF_90 UNIREF_50 UNIPARC PIR_PSD TAXON_ID OMIM UNIGENE ENSEMBLE_ID PMID EMBL_DNA_AC EMBL_PROTEIN_AC The mappings table is included to assist in mapping among various gene/protein ID systems. Two fields were deemed too hold too much information for the intended purpose here: GIID and EMBL_PROTEIN_AC. They will return only "N/A" values.




