Message174170
> I suggest apply patch A to 3.3 as it fixes performance > regression (2x) and is very simple. ASCII and UTF-8 are the two most common codecs in the world, so it's justified to have heavily optimized encoders and decoders. I don't know any application using UTF-32-LE or UTF-32-BE. So I don't want to waste Python memory/code size with a heavily optimized decoder. The patch A looks to be enough. -- 32 bit units is commonly used with wchar_t, but this format already has a fast decoder, PyUnicode_FromWideChar(), which uses memcpy() or _PyUnicode_CONVERT_BYTES(). | |
| Date | User | Action | Args | | 2012-10-30 00:59:28 | vstinner | set | recipients: + vstinner, georg.brandl, pitrou, ezio.melotti, Arfrever, asvetlov, serhiy.storchaka | | 2012-10-30 00:59:28 | vstinner | set | messageid: <1351558768.81.0.0171848183601.issue14625@psf.upfronthosting.co.za> | | 2012-10-30 00:59:28 | vstinner | link | issue14625 messages | | 2012-10-30 00:59:28 | vstinner | create | | |