#include <mi32/ucstring.h>
Public Member Functions | |
| const MIUNICODE * | GetComponants () const |
Static Public Member Functions | |
| static const UCDATA * | GetCharData (MIUNICODE ucval) |
Public Attributes | |
| MIUNICODE | altcase |
| UINT8 | biditag |
| UINT8 | category |
| UINT8 | decomptag |
| MIUNICODE | firstcomp |
| MIUNICODE | nextcomp |
| UINT8 | numcomp |
| MIUNICODE | ucval |
(see www.unicode.org). Note, 3 of the values in this structure could be enums, but I want them to take as little space as possible, and I can't guarentee that enum will be a byte on all platforms. (have to be sure, because this struct is read out of a file.
Definition at line 596 of file ucstring.h.
Return the CHARDATA for a given Unicode character.
May return NULL if no data exists for a given Unicode char.
| const MIUNICODE* UCDATA::GetComponants | ( | ) | const [inline] |
Get the list of componants for this character If numcomp > 0, the unicode character can be decomposed into that many componants.
This method will get the list of componants, assuming you got the componant by calling UCDATA::GetCharData()
Definition at line 654 of file ucstring.h.
If cagegory == UCCAT_Lu, (Letter, Uppercase), this will be the Unicode value of the lowercase version of the same letter.
If category == UCCAT_Ll (Letter, Lowercase) this will be the uppercase version of the same letter
Definition at line 604 of file ucstring.h.
Bidirectional category tag.
Enums defined below and explained in Chapter 3 of the Unicode Standard
Definition at line 634 of file ucstring.h.
General character category.
This is one of the UCCAT enums defined below. The enum is based on the two letter code from UnicodeData.txt, so if UnicodeData.txt says a character is Lo (Letter, Other), the enum is UCCAT_Lo.
Definition at line 630 of file ucstring.h.
Unicode value of the first composite character that starts with this Unicode value.
For example, Unicode value 0x0627 (ARABIC LETTER ALEF) might have a firstcomp of 0x0626 (ARABIC LETTER ALEF WITH HAMZA ABOVE). Due to the recursive nature of composition, this can't be done as a single linked list. ARABIC LETTER ALEF WITH HAMZA ABOVE might have a nextcomp of ARABIC LETTER ALEF WITH HAMZA BELOW (next in the chain of things that started with ARABIC LETTER ALEF. But it might have a firstcomp of ARABIC LETTER ALEF WITH HAMZA ABOVE FINAL FORM (beginning a chain of things starting with ARABIC LETTER ALEF WITH HAMZA ABOVE)
Definition at line 618 of file ucstring.h.
Unicode value of the first composite character that starts with this Unicode value.
For example, Unicode value 0x00E0 (ARABIC SMALL LETTER A WITH GRAVE) might have a nextcomp of 0x00E1 (ARABIC SMALL LETTER A WITH ACUTE).
Definition at line 624 of file ucstring.h.
Number of componants if this is a composite glyph or a variant of the glyph.
Definition at line 641 of file ucstring.h.
1.5.2