You can get a theoretical background on the use of cross-entropy with language models in various textbooks, for example. "Speech and Language Processing" by Jurafsky and Martin, pages 116-118 in the second edition. As for the specific use, in most language modeling tools, cross-entropy is not directly measured, but "Excellence", which is the result of cross-entropy. Complexity, in turn, can be used to classify documents. see, for example, the documentation for the “evallm” team in the UDF, Carnegie-Melon University language modeling tools (http://www.speech.cs.cmu.edu/SLM/toolkit_documentation.html)
good luck :)
source
share