Semantic markup of nouns and adjectives for the Electronic corpus of texts in Tuvan language

Bajlak Ch. Oorzhak, Arzhaana B. Khertek, Marija A. Kuzhuget, Valentina S. Ondar

Novye Issledovaniâ Tuvy, Vol 0, Iss 4 (2016)

Saved in:

Title

Authors

Bajlak Ch. Oorzhak, Arzhaana B. Khertek, Marija A. Kuzhuget, Valentina S. Ondar

Publication Year

2016

Source

Novye Issledovaniâ Tuvy, Vol 0, Iss 4 (2016)

Description

The article examines the progress of semantic markup of the Electronic corpus of texts in Tuvan language (ECTTL), which is another stage of adding Tuvan texts to the database and marking up the corpus. ECTTL is a collaborative project by researchers from Tuvan State University (Research and Education Center of Turkic Studies and Department of Information Technologies). Semantic markup of Tuvan lexis will come as a search engine and reference system which will help users find text snippets containing words with desired meanings in ECTTL. The first stage of this process is setting up databases of basic lexemes of Tuvan language. All meaningful lexemes were classified into the following semantic groups: humans, animals, objects, natural objects and phenomena, and abstract concepts. All Tuvan object nouns, as well as both descriptive and relative adjectives, were assigned to one of these lexico-semantic classes. Each class, sub-class and descriptor is tagged in Tuvan, Russian and English; these tags, in turn, will help automatize searching. The databases of meaningful lexemes of Tuvan language will also outline their lexical combinations. The automatized system will contain information on semantic combinations of adjectives with nouns, adverbs with verbs, nouns with verbs, as well as on the combinations which are semantically incompatible.

Document Type

article

Language

Russian

Publisher Information

Novye Issledovaniâ Tuvy, 2016.

Subject Terms

тувинский язык, электронная база данных, автоматизированная система поиска, лексический фонд, лексико-семантические классы и подклассы, дескрипторы, тэги, имя существительное, имя прилагательное, лексическая сочетаемость, Communities. Classes. Races, HT51-1595

Get it

Holdings