А corpus-based frequency statistic of kazakh language

Show simple item record

dc.contributor.author Altenbek, G.
dc.contributor.author Wang, X.L.
dc.date.accessioned 2018-02-01T06:26:21Z
dc.date.available 2018-02-01T06:26:21Z
dc.date.issued 2017-06-30
dc.identifier.citation Altenbek G. А corpus-based frequency statistic of kazakh language/G. Altenbek, X.L. Wang//Қарағанды универисетінің хабаршысы. ФИЛОЛОГИЯ Сериясы.=Вестник Карагандинского университета. Серия ФИЛОЛОГИЯ.=Bulletin of the Karaganda University. PHILOLOGY Series.-2017. №2. Р.14-23. ru_RU
dc.identifier.uri http://rep.ksu.kz/handle/data/2181
dc.description.abstract Kazakh language is an agglutinative language and it belongs to the Turkish Language group. Kazakh is a lowresource language by Arabic script in China, there are still many serious challenges in these research areas by natural language processing. This paper standardized the processing coding and storage scheme of Kazakh corpus, then constructed Kazakh Language Corpus (KzLC), which lay the foundation for further research on syntactic analysis etc. of Kazakh language processing. Aiming at frequency issue of Kazakh language, this paper focused on relation of Zipf's law of power law in Kazakh word, which is based on frequency statistic of the word. On the basis of frequency statistics of Kazakh words from Kazakh textbooks, this research came up worth word information analysis and statistic method based on corpus, which revealed language rule and phenomenon among Kazakh words information. ru_RU
dc.language.iso en ru_RU
dc.publisher KSU Publ. ru_RU
dc.relation.ispartofseries Қарағанды универисетінің хабаршысы. ФИЛОЛОГИЯ Сериясы.=Вестник Карагандинского университета. Серия ФИЛОЛОГИЯ.=Bulletin of the Karaganda University. PHILOLOGY Series.;
dc.subject Kazakh language ru_RU
dc.subject statistics ru_RU
dc.subject corpus linguistics ru_RU
dc.subject word frequency ru_RU
dc.subject Information retrieval ru_RU
dc.subject morphological analysis ru_RU
dc.title А corpus-based frequency statistic of kazakh language ru_RU
dc.title.alternative Қазақша сөздерді қолдану жиілігінің статистикалық зерттеулері ru_RU
dc.title.alternative Статистические исследования частотности употребления казахских слов ru_RU
dc.type Article ru_RU


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account