I tried the naive classifier of the bays, and it works very poorly. SVM works a little better, but still awful. Most of the articles that I read about SVM and naive bays with some variations (n-gram, POS, etc.), But they all give results close to 50% (the authors of the articles say 80% and high, but I donβt I can get accurate accurate according to real data).
Are there any more powerful methods besides lexixal analys? SVM and Bayes believe that words are independent. This approach is called the word bag. What if we assume these words are related?
For example: use the apriory algorithm to find that if the sentences contain "bad and terrible", then the probability of 70% of this sentence is negative. We can also use the distance between words, etc.
Is this a good idea or am I reinventing the wheel?
Neir0 source
share