Skip to content

mcanalesmayo/LanguageModel-NGram

Repository files navigation

LanguageModel-NGram

Description

N-gram generalized solutions have O(K^N) storage complexity, where K = number of words, and N = complexity of the Markov chain. Therefore:

  • The ideal solution has O(K^K), which makes it computationally impossible to work
  • Bigrams have O(K^2) complexity
  • Trigrams have O(K^3) complexity
  • ... and so on

The aim of this program is to try the performance of bigrams and trigrams in language modeling problems.

References

If you use this code, please reference: Javier Civera - [email protected]

About

Measuring the performance of bigrams and trigrams in language modeling problems

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages