Changeset 915 in tests for trunk/data/formatting/utf-8/README
- Timestamp:
- 07/19/2012 02:41:52 PM (13 years ago)
- Location:
- trunk/data/formatting/utf-8
- Files:
-
- 1 added
- 1 copied
-
. (added)
-
README (copied) (copied from trunk/data/jacob/README) (1 diff)
Legend:
- Unmodified
- Added
- Removed
-
trunk/data/formatting/utf-8/README
r909 r915 2 2 support is much, much, much, much better than PHP's. 3 3 4 * `generate_remove_accents_tests.py` generates all of the 5 remove_accents_from_* files. 6 7 Call it with `python generate_remove_accents_tests.py` 8 9 * `urlencode.py`, `u-urlencode.py` and `entitize.py` process UTF-8 4 * `utf-8/urlencode.py`, `utf-8/u-urlencode.py` and `utf-8/entitize.py` process UTF-8 10 5 into a few different formats (%-encoding, %u-encoding, &#decimal;) 11 6 and are used like normal UNIXy pipes. 12 7 13 8 Try: 14 15 `python urlencode.py < utf-8.txt > utf-8-urlencoded.txt`16 `python u-urlencode.py < utf-8.txt > utf-8-u-urlencoded.txt`17 `python entitize.py < utf-8.txt > utf-8-entitized.txt`18 19 * I think `windows-1252.py` converts Windows-only smart-quotes20 and things into their unicode &#decimal reference; equivalents.21 9 10 `python urlencode.py < utf-8.txt > urlencoded.txt` 11 `python u-urlencode.py < utf-8.txt > u-urlencoded.txt` 12 `python entitize.py < utf-8.txt > entitized.txt` 22 13 23 24 25 14 * `windows-1252.py` converts Windows-only smart-quotes and things 15 into their unicode &#decimal reference; equivalents.
Note: See TracChangeset
for help on using the changeset viewer.