Text normalization

Text normalization is a process by which text is transformed in some way to make it consistent in some way which it may not have been before. Text normalization is often performed before a text is processed in some way, such as generating synthesized speech, automated language translation, and storage in a database.

Examples of text normalization:

While this may be done manually, and usually is in the case of ad hoc and personal documents, many programming languages support mechanisms which enable text normalization.

External links

Missing image
IPA_lezh.PNG


This language-related article is a stub. You can help Wikipedia by expanding it.

See also: Text normalization, Automated language translation, Database, Language, Programming language, Speech synthesis