UniProt

from Wikipedia, the free encyclopedia

UniProt ( uni versal prot ein database) is the largest bioinformatic database for proteins of all living beings and viruses , and contains information about protein function and structure as well as links to other relevant databases. It combines the data from Swiss-Prot , TrEMBL and Protein Information Resource (PIR) and is published at regular intervals.

What is UniProt made of?

UniProt is a consortium that came together in 2002 from the following components:

The EBI has a great source bioinformatic data SIB houses the server ( ExPASy ) ( Ex pert P rotein A nalysis Sy stem) which essential information for proteomics provide. PIR, operated by the National Biomedical Research Foundation (NBRF), is derived from the oldest protein sequence database ( Margaret Oakley Dayhoff's Atlas of Protein sequence and structure ).

The UniProt databases

Each member of the UniProt consortium "maintains" the databases. Until recently, EBI and SIB produced Swiss-Prot and TrEMBL together . The PIR provided the PIR-PSD (Protein Sequence Database) database.

Swiss-Prot is probably the best-known protein database due to its extensive cross-references, literature citations, the integration of other databases and its minimal redundancy. TrEMBL (Translated EMBL Nucleotide Sequence Data Library) is a computer-annotated addition to the Swiss-Prot database that contains all translations of EMBL nucleotide entries that are not yet integrated in Swiss-Prot. This enables fast data provision.

Organization of the UniProt databases

UniProt includes three elements that are specialized for a specific use:

  • The UniProt Knowledgebase ( UniProtKB ) is the central database for protein sequences. It provides information about the function and classifications of proteins and creates cross-references.
  • The UniProt Archive ( UniParc ) stores the entirety of all publicly available protein sequence data.
  • The UniProt Reference Clusters ( UniRef ) are databases that enable the user to search faster by preventing redundant links from appearing of available sequences. Among other things, identical sequences and pre-fragments (from different organisms) are combined in one data entry.

Web links

Individual evidence

  1. ^ The UniProt Consortium (2007): The Universal Protein Resource (UniProt). In: Nucleic Acids Res. Vol. 35, pp. D193-D197. PMID 17142230 doi : 10.1093 / nar / gkl929
  2. UniProt background information