I am trying to apply linear discriminant analysis to a problem with two classes. As I understand it, the LDA assumes that both classes have the same covariance matrix, and then model probability as a Gaussian distribution in various ways.
Another classifier I tried is naive Bayesian. It ignores any correlation between predictor variables.
Now I don’t understand what happens when I apply a PCA to a dataset. By its very definition, the PCA algorithm rotates the data so that the covariance matrix is diagonal. Nothing is lost in rotation, but since the covariance matrix is now diagonal, will the naive Bayesian language not be as good as the LDA, or even better, since the LDA will have much more parameters for evaluation? However, to my knowledge, Naive Bayes is superior to the LDA with or without a PCA.
Indeed, the data is exactly the same as long as I use all the basic components that tell me that the result really should be what it is. However, the covariance matrix is diagonal ... brain crisis
Can someone explain this to me? I hope that I have formulated my question quite clearly. Thank!
source
share