UCDATA Struct Reference

Structure representing "cooked" entry from UnicodeData.txt. More...

#include <mi32/ucstring.h>

List of all members.

Public Member Functions

const MIUNICODEGetComponants () const

Static Public Member Functions

static const UCDATAGetCharData (MIUNICODE ucval)

Public Attributes

MIUNICODE altcase
UINT8 biditag
UINT8 category
UINT8 decomptag
MIUNICODE firstcomp
MIUNICODE nextcomp
UINT8 numcomp
MIUNICODE ucval


Detailed Description

Structure representing "cooked" entry from UnicodeData.txt.

(see www.unicode.org). Note, 3 of the values in this structure could be enums, but I want them to take as little space as possible, and I can't guarentee that enum will be a byte on all platforms. (have to be sure, because this struct is read out of a file.

Definition at line 596 of file ucstring.h.


Member Function Documentation

static const UCDATA* UCDATA::GetCharData ( MIUNICODE  ucval  )  [static]

Return the CHARDATA for a given Unicode character.

May return NULL if no data exists for a given Unicode char.

const MIUNICODE* UCDATA::GetComponants (  )  const [inline]

Get the list of componants for this character If numcomp > 0, the unicode character can be decomposed into that many componants.

This method will get the list of componants, assuming you got the componant by calling UCDATA::GetCharData()

Definition at line 654 of file ucstring.h.


Member Data Documentation

MIUNICODE UCDATA::altcase

If cagegory == UCCAT_Lu, (Letter, Uppercase), this will be the Unicode value of the lowercase version of the same letter.

If category == UCCAT_Ll (Letter, Lowercase) this will be the uppercase version of the same letter

Definition at line 604 of file ucstring.h.

UINT8 UCDATA::biditag

Bidirectional category tag.

Enums defined below and explained in Chapter 3 of the Unicode Standard

Definition at line 634 of file ucstring.h.

UINT8 UCDATA::category

General character category.

This is one of the UCCAT enums defined below. The enum is based on the two letter code from UnicodeData.txt, so if UnicodeData.txt says a character is Lo (Letter, Other), the enum is UCCAT_Lo.

Definition at line 630 of file ucstring.h.

UINT8 UCDATA::decomptag

Decomposition tag.

Definition at line 637 of file ucstring.h.

MIUNICODE UCDATA::firstcomp

Unicode value of the first composite character that starts with this Unicode value.

For example, Unicode value 0x0627 (ARABIC LETTER ALEF) might have a firstcomp of 0x0626 (ARABIC LETTER ALEF WITH HAMZA ABOVE). Due to the recursive nature of composition, this can't be done as a single linked list. ARABIC LETTER ALEF WITH HAMZA ABOVE might have a nextcomp of ARABIC LETTER ALEF WITH HAMZA BELOW (next in the chain of things that started with ARABIC LETTER ALEF. But it might have a firstcomp of ARABIC LETTER ALEF WITH HAMZA ABOVE FINAL FORM (beginning a chain of things starting with ARABIC LETTER ALEF WITH HAMZA ABOVE)

Definition at line 618 of file ucstring.h.

MIUNICODE UCDATA::nextcomp

Unicode value of the first composite character that starts with this Unicode value.

For example, Unicode value 0x00E0 (ARABIC SMALL LETTER A WITH GRAVE) might have a nextcomp of 0x00E1 (ARABIC SMALL LETTER A WITH ACUTE).

Definition at line 624 of file ucstring.h.

UINT8 UCDATA::numcomp

Number of componants if this is a composite glyph or a variant of the glyph.

Definition at line 641 of file ucstring.h.

MIUNICODE UCDATA::ucval

The actual Unicode value.

Definition at line 598 of file ucstring.h.


The documentation for this struct was generated from the following file:
Generated on Thu Apr 26 04:47:42 2007 for TNTsdk by  doxygen 1.5.2