Package pology :: Package lang :: Package sr :: Module reduce

Module reduce

Reductions of Serbian text convenient in various special uses.

Author: Chusslove Illich (Часлав Илић) <caslav.ilic@gmx.net>

License: GPLv3

Functions

words_ec(text)
Reduce text to space-separated Ekavian Cyrillic words [type F1A hook].

words_ec_lw(text)
Reduce text to space-separated Ekavian Cyrillic words, in lower case [type F1A hook].

words_ic(text)
Reduce text to space-separated Ijekavian Cyrillic words [type F1A hook].

words_ic_lw(text)
Reduce text to space-separated Ijekavian Cyrillic words, in lower case [type F1A hook].

words_ic_lw_dlc(text)
Reduce text to space-separated Ijekavian Cyrillic words containing at least three letters, one of which is 'е' or 'и', in lower case [type F1A hook].

Variables
	__package__ = `'pology.lang.sr'`

Function Details

words_ec(text)

Reduce text to space-separated Ekavian Cyrillic words [type F1A hook].

Words containing only Serbian Cyrillic characters are extracted, sorted, and joined by spaces into a string. In case the text contains dialect and script hybridization, it is passed through hictoec() to resolve it into clean Ekavian Cyrillic. In case the text contains accent marks, it is passed through remove_accents() to remove them.

words_ec_lw(text)

Reduce text to space-separated Ekavian Cyrillic words, in lower case [type F1A hook].

Like words_ec, but the result is lowercased.

words_ic(text)

Reduce text to space-separated Ijekavian Cyrillic words [type F1A hook].

Like words_ec, but if the text was hybrid it is resolved into clean Ijekavian Cyrillic (see hictoic()).

words_ic_lw(text)

Reduce text to space-separated Ijekavian Cyrillic words, in lower case [type F1A hook].

Like words_ic, but the result is lowercased.

words_ic_lw_dlc(text)

Reduce text to space-separated Ijekavian Cyrillic words containing at least three letters, one of which is 'е' or 'и', in lower case [type F1A hook].

Like words_ic, but the result is lowercased.