Gene Ontology (GO) is an international bioinformatics initiative to standardize part of the vocabulary of the life sciences . Result is the same ontology - database , which is now used worldwide by many biological databases and is constantly evolving. Further efforts are the assignment of GO terms (annotation) to individual genes and their proteins and the provision of appropriate software for using the ontology.

The majority of the institutions participating in GO are American and are supported by governments and a company ( AstraZeneca ). GO is primarily in English and species-neutral and is freely available. It is part of a larger project, the Open Biomedical Ontologies .

Database and terms

GO is a biomedical ontology that covers three areas: “Cellular Component”, “Biological Process” and “Molecular Function”. Each term consists of a name, a number and associated data. The ontology has the topology of a directed acyclic graph .


id: GO:0000016
name: lactase activity
namespace: molecular_function
def: "Catalysis of the reaction: lactose + H2O = D-glucose + D-galactose." [EC:]
synonym: "lactase-phlorizin hydrolase activity" BROAD [EC:]
synonym: "lactose galactohydrolase activity" EXACT [EC:]
xref: EC:
xref: MetaCyc:LACTASE-RXN
xref: Reactome:20536
is_a: GO:0004553 ! hydrolase activity, hydrolyzing O-glycosyl compounds

Data Source:


The gene ontology, like other ontologies, is an attempt to present biological knowledge in a clear manner. Such a representation, even if it claims to be optimal, would have many uses in addition to a standardization of the language, including in the publishing and library sectors. In addition, the structured representation enables use in software that uses biological and clinical knowledge to answer questions and analyze experimental data ( logical reasoning , data mining ).

The most important tools for looking through the GO entries are the ontology editor OBO-edit and the browser AmiGO, which is available as a website. In addition to the presentation of the ontology, OBO-edit provides tools for querying and filtering the ontology information.

For the analysis of experiments that result in a large number of values ​​that are assigned to individual genes, different data mining objectives with different algorithms, together with the given gene ontology, can lead to nontrivial conclusions from the experiment. For example, cluster analysis algorithms are used to determine which biological processes are mainly changed by certain environmental toxins in cells by analyzing the results of corresponding microarray experiments using the GO annotations of all genes in the organism concerned.

