I have been provided with a file with hundreds of thousands of values, inconsistently formatted in a .CSV file. The structure may resemble:
A, B, C, D
E, F
G, H, I, J, K, L, M, N, O
P, Q, R, S
Etc.
All I need to do is: a) list unique values and b) count instances of the same. We will be happy to do this in R, Excel or any other recommended tool.
I usually use something like the Google Docs' = UNIQUE and = COUNT functions, but the spreadsheet is too large to load there. And I did not find the exact equivalents in Excel, oddly enough.
Any help was appreciated.
source
share