UniProt
UniProt ( uni versal prot ein database) is the largest bioinformatic database for proteins of all living beings and viruses , and contains information about protein function and structure as well as links to other relevant databases. It combines the data from Swiss-Prot , TrEMBL and Protein Information Resource (PIR) and is published at regular intervals.
What is UniProt made of?
UniProt is a consortium that came together in 2002 from the following components:
- the European Bioinformatics Institute (EBI)
- the Swiss Institute of Bioinformatics (SIB)
- of the Protein Information Resource (PIR)
The EBI has a great source bioinformatic data SIB houses the server ( ExPASy ) ( Ex pert P rotein A nalysis Sy stem) which essential information for proteomics provide. PIR, operated by the National Biomedical Research Foundation (NBRF), is derived from the oldest protein sequence database ( Margaret Oakley Dayhoff's Atlas of Protein sequence and structure ).
The UniProt databases
Each member of the UniProt consortium "maintains" the databases. Until recently, EBI and SIB produced Swiss-Prot and TrEMBL together . The PIR provided the PIR-PSD (Protein Sequence Database) database.
Swiss-Prot is probably the best-known protein database due to its extensive cross-references, literature citations, the integration of other databases and its minimal redundancy. TrEMBL (Translated EMBL Nucleotide Sequence Data Library) is a computer-annotated addition to the Swiss-Prot database that contains all translations of EMBL nucleotide entries that are not yet integrated in Swiss-Prot. This enables fast data provision.
Organization of the UniProt databases
UniProt includes three elements that are specialized for a specific use:
- The UniProt Knowledgebase ( UniProtKB ) is the central database for protein sequences. It provides information about the function and classifications of proteins and creates cross-references.
- The UniProt Archive ( UniParc ) stores the entirety of all publicly available protein sequence data.
- The UniProt Reference Clusters ( UniRef ) are databases that enable the user to search faster by preventing redundant links from appearing of available sequences. Among other things, identical sequences and pre-fragments (from different organisms) are combined in one data entry.
Web links
Individual evidence
- ^ The UniProt Consortium (2007): The Universal Protein Resource (UniProt). In: Nucleic Acids Res. Vol. 35, pp. D193-D197. PMID 17142230 doi : 10.1093 / nar / gkl929
- ↑ UniProt background information