PostgreSQL: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
mNo edit summary
Line 161: Line 161:
==External links==
==External links==
{{ External links | January 2007 }}
{{ External links | January 2007 }}
===About PostgreSQL===
*[http://www.pervasivepostgres.com/postgresql/pgd/index.asp PostgreSQL Universe] Comprehensive (Link-)Directory
*[http://www.pervasivepostgres.com/postgresql/pgd/index.asp PostgreSQL Universe] Comprehensive (Link-)Directory
*{{dmoz|Computers/Software/Databases/PostgreSQL}}
*{{dmoz|Computers/Software/Databases/PostgreSQL}}
Line 171: Line 172:
*[http://www.kenjennings.cc/st/stprgux.html#test_pgc Test_PGC], Example embedded SQL/C program for PostgreSQL showing database operations and SQLSTATE testing.
*[http://www.kenjennings.cc/st/stprgux.html#test_pgc Test_PGC], Example embedded SQL/C program for PostgreSQL showing database operations and SQLSTATE testing.


== External PostgreSQL-Related Projects ==
=== External PostgreSQL-related projects ===


The developers of PostgreSQL try to keep the system itself down to a set of "core" features, rather than encouraging extensions to be rolled into the main system. Here are places where "secondary" projects are managed:
The developers of PostgreSQL try to keep the system itself down to a set of "core" features, rather than encouraging extensions to be rolled into the main system. Here are places where "secondary" projects are managed:
Line 178: Line 179:
*[http://sourceforge.net/softwaremap/trove_list.php?form_cat=525 SourceForge PostgreSQL-related projects]
*[http://sourceforge.net/softwaremap/trove_list.php?form_cat=525 SourceForge PostgreSQL-related projects]


== PostgreSQL Documentation ==
=== PostgreSQL documentation ===
*[http://www.postgresql.org/docs/faqs.FAQ.html PostgreSQL FAQ] (Frequently Asked Questions)
*[http://www.postgresql.org/docs/faqs.FAQ.html PostgreSQL FAQ] (Frequently Asked Questions)
*[http://www.postgresql.org/ PostgreSQL Website]
*[http://www.postgresql.org/ PostgreSQL Website]
Line 184: Line 185:




== Performance Tuning Documentation ==
=== Performance tuning documentation ===
*[http://www.revsys.com/writings/postgresql-performance.html PostgreSQL Performance Tuning]
*[http://www.revsys.com/writings/postgresql-performance.html PostgreSQL Performance Tuning]
*[http://www.varlena.com/varlena/GeneralBits/Tidbits/perf.html Tuning PostgreSQL for performance]
*[http://www.varlena.com/varlena/GeneralBits/Tidbits/perf.html Tuning PostgreSQL for performance]

Revision as of 20:37, 20 July 2007

PostgreSQL
Developer(s)PostgreSQL Global Development Group
Stable release
8.2.4 / April 23, 2007
Repository
Operating systemCross-platform
TypeORDBMS
LicenseBSD
Websitewww.postgresql.org

PostgreSQL is a free software object-relational database management system (ORDBMS), released under a BSD-style license. It offers an alternative to other database systems. Similar to other free software projects such as Apache, GNU/Linux, and MediaWiki, PostgreSQL is not controlled by any single company, but relies on a global community of developers and companies to develop it.

PostgreSQL's unusual-looking name makes some readers pause when trying to pronounce it, especially those who pronounce SQL as "sequel". PostgreSQL's developers pronounce it /poːst ɡɹɛs kjuː ɛl/. (Audio sample, 5.6k MP3). It is also common to hear it abbreviated as simply "postgres", which was its original name. The name refers to the project's origins as a "post-Ingres" database, the original authors having also developed the Ingres database.

History

PostgreSQL has had a lengthy evolution, starting with the Ingres project at UC Berkeley. The project leader, Michael Stonebraker, had left Berkeley to commercialize Ingres in 1982, but eventually returned to academia. After returning to Berkeley in 1985, Stonebraker started a post-Ingres project to address the problems with contemporary database systems that had become increasingly clear during the early 1980s. While they share many of the same ideas, the code bases of PostgreSQL and Ingres started (and remain) completely separated.

The resulting project, named POSTGRES, aimed to introduce the minimum number of features needed to add complete support for types. These features included the ability to define types, but also the ability to fully describe relationships – something used widely before this time but maintained entirely by the user. In POSTGRES the database "understood" relationships, and could retrieve information in related tables in a natural way using rules.

Starting in 1986 the team released a number of papers describing the basis of the system, and by 1988 the project had a prototype version up and running. The team released version 1 to a small number of users in June 1989, followed by version 2 with a re-written rules system in June 1990. 1991's version 3 re-wrote the rules system again, but also added support for multiple storage managers and for an improved query engine. By 1993 a huge number of users existed and began to overwhelm the project with requests for support and features. After releasing a Version 4 — primarily as a cleanup — the project ended.

Although the POSTGRES project had officially ended, the BSD license (under which Berkeley had released POSTGRES) enabled open-source developers to obtain copies and to develop the system further. In 1994 two UC Berkeley graduate students, Andrew Yu and Jolly Chen, added a SQL language interpreter to replace the earlier Ingres-based QUEL system, creating Postgres95. The code was subsequently released to the web to find its own way in the world.

In July 1996, Marc Fournier at Hub.Org Networking Services provided the first non-university development server for the open source development effort. Along with Bruce Momjian and Vadim B. Mikheev, work began to stabilize the code inherited from UC Berkeley, with the first open source version released on August 1st 1996.

1996 saw a re-naming of the project: in order to reflect the database's new SQL query language, Postgres95 became PostgreSQL. The first PostgreSQL release formed version 6.0 in January 1997. Since then, a group of database developers and volunteers from around the world, coordinating via the Internet, have maintained the software.

Although the license allowed for the commercialization of Postgres, the Postgres code did not develop commercially with the same rapidity as Ingres — somewhat surprisingly considering the advantages Postgres offered. The main offshoot originated when Paula Hawthorn (an original Ingres team member who moved from Ingres) and Michael Stonebraker formed Illustra Information Technologies to commercialize Postgres.

In 2000, former Red Hat investors put together a company known as Great Bridge to commercialize PostgreSQL and compete against commercial database vendors. Great Bridge sponsored several PostgreSQL developers and donated many resources back to the community, however by late 2001 the company closed its doors citing tough competition from companies like Red Hat as well as poor market conditions.

In 2001, Command Prompt, Inc. released Mammoth PostgreSQL, the oldest surviving commercial PostgreSQL distribution. They continue to actively support the PostgreSQL community through developer sponsorships and projects including PL/Perl, PL/php, and hosting of community projects such as the PostgreSQL Build Farm.

In January 2005, PostgreSQL received backing by another database vendor. Pervasive Software, well known for their Btrieve product which was ubiquitous on the Novell NetWare platform, announced commercial support & community participation. While they achieved success for a time, in July 2006, Pervasive left the PostgreSQL support market.[1]

In mid-2005 two other companies announced plans to commercialize PostgreSQL with focus on separate niche markets. EnterpriseDB announced plans to focus on adding functionality to allow applications written to work with Oracle to be more readily run atop PostgreSQL. Greenplum contributed enhancements directed at data warehouse and business intelligence applications, notably including the BizGres project.

In October 2005, John Loiacono, executive vice-president of software at Sun Microsystems, commented that "We're not going to OEM Microsoft but we are looking at PostgreSQL right now,"[2] although no specifics were released at that time. By November 2005, Sun Microsystems had announced support for PostgreSQL.[3] As of June 2006, Sun Solaris 10 6/06 ships PostgreSQL.

As for the PostgreSQL project itself, it continues to make yearly major releases and minor "bugfix" releases, all available under the BSD license, based on contributions from both commercial vendors, support companies, and open source hackers at large.

Features

Functions

Functions allow blocks of code to be executed by the server. Although these blocks can be written in SQL, the lack of basic programming operations, such as branching and looping, has driven the adoption of other languages inside of functions. Some of the languages can even execute inside of triggers. Functions in PostgreSQL can be written in the following languages:

PostgreSQL supports row-returning functions, where the output of the function is a set of values which can be treated much like a table within queries.

Functions can be defined to execute with the privileges of either the caller or the user who defined the function. Functions are sometimes referred to as stored procedures, although there is a slight technical distinction between the two.

Indices

User-defined index methods can be created, or the built-in B-tree, hash table and GiST indices can be used. Indexes in PostgreSQL also support the following features:

  • PostgreSQL is capable of scanning indexes backwards when needed; a separate index is never needed to support ORDER BY field DESC.
  • Expression indexes can be created with an index of the result of an expression or function, instead of simply the value of a column.
  • Partial indexes, which only index part of a table, can be created by adding a WHERE clause to the end of the CREATE INDEX statement. This allows a smaller index to be created.
  • Bitmap index scans are supported as of version 8.1. This involves reading multiple indexes and generating a bitmap that expresses their intersection with the tuples that match the selection criteria. This provides a way of composing indexes together; on a table with 20 columns, there are, in principle, 20! indexes that could be defined — which is far too many to actually use. If one index is created on each column, bitmap scans can compose arbitrary combinations of those indexes at query time for each column that seems worth considering as a constraint.

Triggers

Triggers are events triggered by the action of SQL query. For example, an INSERT query might activate a trigger that checked if the values of the query were valid. Most triggers are only activated by either INSERT or UPDATE queries.

Triggers are fully supported and can be attached to tables but not to views. Views can have rules, though. Multiple triggers are fired in alphabetical order. In addition to calling functions written in the native PL/PgSQL, triggers can also invoke functions written in other languages like PL/Perl.

MVCC

PostgreSQL manages concurrency through a system known as Multi-Version Concurrency Control (MVCC), which gives each user a "snapshot" of the database, allowing changes to be made without being visible to other users until a transaction is committed. This largely eliminates the need for read locks, and ensures the database maintains the ACID principles in an efficient manner.

Rules

Rules allow the "query tree" of an incoming query to be rewritten. One common usage is to implement updatable views.

Data types

A wide variety of native data types are supported, including:

In addition, users can create their own data types which can usually be made fully indexable via PostgreSQL's GiST infrastructure.

Examples of these are the Geographic information system (GIS) data types from the PostGIS project for PostgreSQL.

User-defined objects

New types of almost all objects inside the database can be created, including:

Inheritance

Tables can be set to inherit their characteristics from a "parent" table. Data is shared between "parent" and "child(ren)" tables. Tuples inserted or deleted in the "child" table will respectively be inserted or deleted in the "parent" table. Also adding a column in the parent table will cause that column to appear in the child table as well. This feature is not fully supported yet—in particular, table constraints are not currently inheritable. This means that attempting to insert the id of a row from a child table into table that has a foreign key constraint referencing a parent table will fail because Postgres doesn't recognize that the id from the child table is also a valid id in the parent table.

Inheritance provides a way to map the features of generalization hierarchies depicted in Entity Relationship Diagrams (ERD) directly into the PostgreSQL database.

Other features

Add-ons

Benchmarks

Many informal performance studies of PostgreSQL have been done[4] but the first industry-standard and peer-validated benchmark was completed in June 2007 using the Sun Java Systems Application Server 9.0 Platform Edition and Postgres 8.2[5]. This result of 778.14 SPECjAppServer2004 JOPS@Standard compares favourably with the 874 JOPS@Standard with Oracle 10 on an Itanium based HP-UX[4]

Prominent users

References

  • Matthew, Neil. Beginning Databases with PostgreSQL (Second Edition ed.). ISBN 1-59059-478-9. {{cite book}}: |edition= has extra text (help); Unknown parameter |coauthors= ignored (|author= suggested) (help)
  • Gilmore, W. Jason. Beginning PHP and PostgreSQL 8: From Novice to Professional. ISBN 1-59059-547-5. {{cite book}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  • Worsley, John C. Practical PostgreSQL. ISBN 1-56592-846-6. {{cite book}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  • Douglas, Korry. PostgreSQL. ISBN 0-672-32756-2.

Notes

  1. ^ John Farr (2006-07-25). "Open letter to the PostgreSQL Community". Pervasive Software. Retrieved 2007-02-13.
  2. ^ Rodney Gedda (2005-10-05). "Sun's software chief eyes databases, groupware". Computerworld. Retrieved 2007-02-13.
  3. ^ "Sun Announces Support for Postgres Database on Solaris 10" (Press release). Sun Microsystems. 2005-11-17. Retrieved 2007-02-13.
  4. ^ a b Josh Berkus (2007-07-06). "PostgreSQL publishes first real benchmark". Retrieved 2007-07-10.
  5. ^ "SPECjAppServer®2004 Result". SPEC. 2007-07-06. Retrieved 2007-07-10.

External links

About PostgreSQL

External PostgreSQL-related projects

The developers of PostgreSQL try to keep the system itself down to a set of "core" features, rather than encouraging extensions to be rolled into the main system. Here are places where "secondary" projects are managed:

PostgreSQL documentation


Performance tuning documentation