Substantial overhaul of the UTF-8 decoder. It now uses 26 bits of state