#include <ucstring.h>
Public Member Functions | |
| const UNICODE * | GetComponants () const |
Static Public Member Functions | |
| const UCDATA * | GetCharData (UNICODE ucval) |
Public Attributes | |
| UNICODE | ucval |
| UNICODE | altcase |
| UNICODE | firstcomp |
| UNICODE | nextcomp |
| UINT8 | category |
| UINT8 | biditag |
| UINT8 | decomptag |
| UINT8 | numcomp |
(see www.unicode.org). Note, 3 of the values in this structure could be enums, but I want them to take as little space as possible, and I can't guarentee that enum will be a byte on all platforms. (have to be sure, because this struct is read out of a file.
Definition at line 601 of file ucstring.h.
|
|
Return the CHARDATA for a given Unicode character. May return NULL if no data exists for a given Unicode char. |
|
|
Get the list of componants for this character If numcomp > 0, the unicode character can be decomposed into that many componants. This method will get the list of componants, assuming you got the componant by calling UCDATA::GetCharData() Definition at line 659 of file ucstring.h. |
|
|
If cagegory == UCCAT_Lu, (Letter, Uppercase), this will be the Unicode value of the lowercase version of the same letter. If category == UCCAT_Ll (Letter, Lowercase) this will be the uppercase version of the same letter Definition at line 609 of file ucstring.h. |
|
|
Bidirectional category tag. Enums defined below and explained in Chapter 3 of the Unicode Standard Definition at line 639 of file ucstring.h. |
|
|
General character category. This is one of the UCCAT enums defined below. The enum is based on the two letter code from UnicodeData.txt, so if UnicodeData.txt says a character is Lo (Letter, Other), the enum is UCCAT_Lo. Definition at line 635 of file ucstring.h. |
|
|
Decomposition tag.
Definition at line 642 of file ucstring.h. |
|
|
Unicode value of the first composite character that starts with this Unicode value. For example, Unicode value 0x0627 (ARABIC LETTER ALEF) might have a firstcomp of 0x0626 (ARABIC LETTER ALEF WITH HAMZA ABOVE). Due to the recursive nature of composition, this can't be done as a single linked list. ARABIC LETTER ALEF WITH HAMZA ABOVE might have a nextcomp of ARABIC LETTER ALEF WITH HAMZA BELOW (next in the chain of things that started with ARABIC LETTER ALEF. But it might have a firstcomp of ARABIC LETTER ALEF WITH HAMZA ABOVE FINAL FORM (beginning a chain of things starting with ARABIC LETTER ALEF WITH HAMZA ABOVE) Definition at line 623 of file ucstring.h. |
|
|
Unicode value of the first composite character that starts with this Unicode value. For example, Unicode value 0x00E0 (ARABIC SMALL LETTER A WITH GRAVE) might have a nextcomp of 0x00E1 (ARABIC SMALL LETTER A WITH ACUTE). Definition at line 629 of file ucstring.h. |
|
|
Number of componants if this is a composite glyph or a variant of the glyph.
Definition at line 646 of file ucstring.h. |
|
|
The actual Unicode value.
Definition at line 603 of file ucstring.h. |
1.3.4-20031026