Age | Commit message (Collapse) | Author |
|
|
|
* it is not the responsibility of a codepoint iterator to cache the resolved codepoint for reuse
** if this is required by the user of this class can iterate it better in the context it is required
** e.g. implement a "CachedIterator" template
|
|
* utility.h and utility.cc now contain the UTF8-codepoint and unit bitmasks and read / write functions
* Modified users of these functions and unions accordingly
* Added the new compilation unit to the Makefile
* Changed bitmask specification from plain integer literals to shift expressions for better readability
|
|
|
|
* CodepointIterator is a simple C++ iterator class which iterates through unicode codepoints in a UTF8-encoded string
* It is derived from std::iterator and implements the std::bidirectional_iterator_tag
* Dereferencing an instance of the class provides the codepoint as char32_t
* Tests require Google Test and use UTF8-samples from http://www.columbia.edu/~fdc/utf8/
|