X-SAMPA
X-SAMPA [ ˈɛksˌsæmpə ] (Abbreviation for Extended Speech Assessment Methods Phonetic Alphabet [ ɪkˈstɛndɪd spiːtʃ əˈsɛsmənt ˌmɛθədz fəʊˈnɛtɪk ˈælfəbɛt ]) is a further development of SAMPA , a phonetic alphabet that can be represented by 7-bit ASCII . X-SAMPA was developed in 1995 by the British phonetician John C. Wells in order to bring the different SAMPA variants for different languages to a common denominator, as well as to be able to represent all characters that occur in the International Phonetic Alphabet . Despite the spread of Unicode for the purpose of creating and archiving print products, X-SAMPA is increasingly being used in phonetic and language technology contexts around the world, especially when it comes to the automatic processing of large amounts of transcriptions in local languages for statistical purposes, speech input / output To develop systems and to create lexicographic databases.
literature
Complete definitions and application examples of SAMPA and X-SAMPA can be found in the following manuals:
- Gibbon, Dafydd, Roger Moore and Richard Winski (Eds.): Handbook of Standards and Resources for Spoken Language Systems. Mouton de Gruyter, Berlin 1997
- Gibbon, Dafydd, Inge Mertins and Roger Moore (Eds.): Handbook of Multimodal and Spoken Language Systems: Resources, Terminology and Product Evaluation. Kluwer Academic Publishers, Boston, etc. 2000
Web links
- Publication of X-SAMPA (English)
- X-Sampa online conversion to Unicode IPA (by Henrik Theiling) (English and partly German)
- IPA-SAMPA table with sound recordings from the Phonetic Laboratory of the University of Turin, Italy (English and Italian)