I am writing a hashtag scraper for facebook, and every regular expression that I come across to get hashtags seems to include punctuation marks as well as alphanumeric characters. Here is an example of what I would like:
Hello World! I am a # m4king scraper #fac_book and would like a nice regular #expression.
I would like to match world, m4king, facand expression(note that I would like it to be turned off if it has reached the punctuation, including spaces). It would be nice if it did not include a hash symbol, but this is not very important.
Just make it important, I will use the ruby string scan method to capture more than one tag.
Thanks for the heaps in advance!
source
share