Using Body Cleanup Regexps to remove 4 byte characters?

On our wiki we previously suggested using a particular regular expression to clean up problematic 4 byte characters from email bodies to avoid problems with certain MySQL database setups.  It has come to our attention that the previous regular expression would compile differently to what was expected, and it would incorrectly remove hyphens from the email body.  Please use the following body cleanup regular expression instead:

[\x{10000}-\x{10FFFF}]+

The subject cleaner pre-processing task mentioned on the above wiki page does not make use of regular expressions and is therefore not affected.

Related link:

Handling UTF-8 multi-byte characters with a MySQL database