[prev] [thread] [next] [lurker] [Date index for 2005/10/17]
> Yuck, then you’re saddled with endianness issues. Good, that'll shake out the last of the big-endian systems. > Plus null bytes > can then be part of the data, so most charset-oblivious software > breaks. I thought breaking 8-bit-only software was a good thing. > Not worth it, considering that 99.99% of text processing > is either gluing strings together without looking inside, > or processing them character-by-character. Processing them character by character in UCS-4 is so much easier than doing it in UTF-8. So is gluing them together. > Blindly indexing into a > string without having scanned it previously is so rare it doesn’t > merrit consideration. Blindly indexing into a file without having scanned it previously is so common that you don't even remark on it happening. A file, remember, is a string.
Generated at 20:00 on 17 Oct 2005 by mariachi 0.52