APPLICATION OF THE N-GRAM MODEL TO THE KARAKALPAK LANGUAGE

Toxirov F.J.

Abstract

Most automatic speech recognition and text processing systems use statistical models called n-grams that specify the probability of occurrence for different sequences of words in a language. This article discusses the application of the n-gram model to the text in the Karakalpak language in order to analyze individual Karakalpak words or phrases and in which part of the sentence a given word occurs.