File Explorer

/usr/share/doc/gawk

This explorer reads the filesystem of the server it runs on, so /workspace/user isn't present here. Browsing and the terminal still work against this server's own disk from /.

0 dirs
4 files
README.multibyte1.1 KB · 30 lines
Fri Jun  3 12:20:17 IDT 2005============================ As noted in the NEWS file, as of 3.1.5, gawk uses character values insteadof byte values for `index', `length', `substr' and `match'.  This worksin multibyte and unicode locales. Wed Jun 18 16:47:31 IDT 2003============================ Multibyte locales can cause occasional weirdness, in particular withranges inside brackets: /[....]/.  Something that works great for ASCIIwill choke for, e.g., en_US.UTF-8.  One such program is test/gsubtst5.awk. By default, the test suite runs with LC_ALL=C and LANG=C. Youcan change this by doing (from a Bourne-style shell): 	$ GAWKLOCALE=some_locale make check Then the test suite will set LC_ALL and LANG to the given locale. As of this writing, this works for en_US.UTF-8, and all testspass except gsubtst5. For the normal case of RS = "\n", the locale is largely irrelevant.For other single byte record separators, using LC_ALL=C will give youmuch better performance when reading records.  Otherwise, gawk has tomake several function calls, *per input character* to find the recordterminator.  You have been warned.