I have that string in my text file: ├░┬č┬Ź┬ć
What is known is that it was emoji or at least some surrogate character/character created by javascript string of length 2 or 4
Because of some reason it end up in that form.
(It was obtained from mysql database which is utf8_general_ci and by node.js/mysql2/connection with charset latin1_swedish_ci)
How can I find what emoji it was? Is it possible?
Other examples:
├░┬č┬ĺ┬Ž
├░┬č┬ś┬ł
├░┬č┬ą┬Á
Algorithm written in JS would be best option.
It's double mojibake as shown in the following
pythoncode snippet (sorry, I cannot give Javascript equivalent):Possible repair (although prevention is better than cure):
FYI, those emojis are (column
CodePointcontains Unicode (U+hhhh) and UTF-8 bytes; columnDescriptioncontains surrogate pairs in parentheses):