Sunday, July 18, 2010

sed strip unicode out of file

e2 80 8b is the hex utf8 for unicode code point U+200b
sed -e "s/\xe2\x80\x8b//g" input.u8 >output.u8

No comments: