I have text input from a mobile device. It contains emoji. In C #, I have text like
Text 🍫🌐 text
Simply put, I want the output to be
Text text
I am trying to just remove all such emojis from text with rejex .. except, I'm not sure how to convert this emoji to a unicode sequence. How should I do it?
edit:
I am trying to save user login in mysql. It seems that mysql UTF8 does not really support Unicode characters, and the correct way to do this would be to change the schema , but I don't think this is an option for me. Therefore, I am trying to simply delete all emoji characters before storing it in the database.
This is my schema for the corresponding column:

I use Nhibernate as my ORM, and the generated insert request looks like this:
Insert into `Content` (ContentTypeId, Comments, DateCreated)
values (?p0, ?p1, ?p2);
?p0 = 4 [Type: Int32 (0)]. ?p1 = 'Text 🍫🌐 text' [Type: String (20)], ?p2 = 19/01/2015 10:38:23 [Type: DateTime (0)]
mysql, :
1 warning(s): 1366 Incorrect string value: '\xF0\x9F\x98\x80 t...' for column 'Comments' at row 1 0.000 sec
, , .
