Diacritic

Diacritical marks

accent

acute accent ( ˊ )
double acute accent ( ˝ )
grave accent ( ˋ )

breve ( ˘ )
caron / háček ( ˇ )
cedilla ( ¸ )
circumflex ( ˆ )
diaeresis ( ¨ )
dot ( · )

anunaasika / superdot ( ˙ )
anusvaara / subdot (  ̣ )

hook / dấu hỏi (  ̉ )
macron ( ˉ )
ogonek / "Polish hook" ( ˛ )
ring / kroužek ( ˚ )
spiritus asper ( ʽ )
spiritus lenis (  ʼ )
umlaut ( ¨ )

Marks sometimes used as diacritics

apostrophe ( ' )
bar ( | )
colon ( : )
comma ( , )
hyphen ( ˗ )
tilde ( ˜ )
titlo (  ҃ )

A diacritical mark or diacritic, sometimes called an accent mark, is a mark added to a letter to alter a word's pronunciation or to distinguish between similar words. The word derives from the Greek word διακρητικός (distinguishing). Note that diacritic is a noun and diacritical is the corresponding adjective.

A diacritical mark can appear above or below the letter to which it is added, or in some other position; however, note that not all such marks are diacritical. For example, in English, the tittle (dot) on the letters i and j is not a diacritical mark, but rather part of the letter itself. Further, a mark may be diacritical in one language, but not in another; for example, in French, e and ë are considered the same letter, while in German, they are considered to be the separate letters. (In the former case, the mark is a diaeresis, while in the latter, it is an umlaut.)

The main usage of a diacritic is to change the phonetic meaning of the letter, but the term is also used in a more general sense of changing the meaning of the letter or even the whole word. Examples are writing numerals in numeral systems, such as early Greek numerals and marking abbreviations with the titlo in old Slavic texts.

Contents

Types of diacritic

Marks that are sometimes diacritics, but also have other uses, are:

Usage

Non-diacritic usage

In all these cases they are not seen as additional marks over the vowel, but are actually a necessary part of these characters, as they represent entirely different sounds to the basic forms, like also for instance the Estonian "õ".

Non-alphabetic scripts

Some non-alphabetic scripts also employ symbols that function essentially as diacritics.

Alphabetization or collation

Different languages use different rules to put diacritic characters in alphabetical order. French treats letters with diacritical marks the same as the underlying letter for purposes of ordering and dictionaries. The same is true in German, and in cases where two words differ only by an umlaut, the word without it is sorted first in German dictionaries (eg "schon" and then "schön", or "fallen" and then "fällen"). However, when names are concerned (eg in phone books or in author catalogues in libraries), umlauts are often treated as combinations of the vowel with a suffixed 'e'; Austrian phone books now treat umlauts as separate letters (immediately following the underlying letter).

The Scandinavian languages, by contrast, treat the diacritic characters ä, ö and å as new and separate letters of the alphabet, and sort them after z. Usually ä is sorted as equal to æ (ash) and ö is sorted as equal to ø (o-slash). Other diacritically marked letters are treated as variants of the underlying letter.

Other languages treat diacritically marked letters as variants of the underlying letter, but alphabetize them following the unmarked letter. In Spanish ñ is considered a new letter different from n and placed between n and o, however, acute accents and diaeresis are ignored.

The technical term for alphabetization is collation.

See also: Alphabet, Latin alphabet

Generation with computers

Depending on the keyboard layout, which differs amongst countries, it is more or less easy to enter letters with diacritics on computers and typewriters. Some have their own keys, some are created by first pressing the key with the diacritic mark followed by the letter to place it on. Such a key is sometimes referred to as a dead key, as it produces no output of its own, but modifies the output of the key pressed after it.

On computers with the Microsoft Windows operating system, one can also enter each character of the current codepage, e.g. windows-1252, by holding the Alt key and entering the respective decimal position on the Num pad, e.g. Alt+0210 is Ò. Additionally, on Windows XP, it is possible to enter any Unicode character from the Basic Multilingual Plane (i.e. up to U+FFFF) by pressing Alt and then, with Alt still pressed, the plus sign and the digits of the Unicode number each after the other. Alt with plus, D and 2 yields U+00D2: Ò.

In modern Microsoft Windows operating systems, the keyboard layout US International allows one to type almost all diacritics directly: "+e gives ë, ~+o gives õ etc.. In addition to this, the layout provides many 'special characters' behind the AltGr modifier: AltGr+t is þ, AltGr+z is æ, etc..

Using the Microsoft Keyboard Layout Creator (MSKLC) people using Windows 2000, Windows XP or Windows Server 2003 can edit or create any keyboard layout.

On Apple Macintosh computers, there are keyboard shortcuts for the most common diacritics:

On computers it is also a matter of available codepages, whether you can use certain diacritics. Unicode tries to solve this problem, among others.

In GNOME applications (found on many GNU/Linux and UNIX computers) abritrary Unicode characters may be entered by holding down the ctrl and shift keys while typeing the hexadecimal codepoint. After releasing ctrl-shift the digits will be converted into the symbol. For example ctrl-shift 1E3 produces ǣ.

With Unicode it is also possible to combine diacritical marks with most characters.

See also

External links

See also: Diacritic, Abjad, Abugida, Acute, Acute accent, Alphabet, Angstrom