UCDATA Struct Reference

Structure representing "cooked" entry from UnicodeData.txt. More...

#include <ucstring.h>

List of all members.

Public Member Functions

const UNICODEGetComponants () const

Static Public Member Functions

const UCDATAGetCharData (UNICODE ucval)

Public Attributes

UNICODE ucval
UNICODE altcase
UNICODE firstcomp
UNICODE nextcomp
UINT8 category
UINT8 biditag
UINT8 decomptag
UINT8 numcomp


Detailed Description

Structure representing "cooked" entry from UnicodeData.txt.

(see www.unicode.org). Note, 3 of the values in this structure could be enums, but I want them to take as little space as possible, and I can't guarentee that enum will be a byte on all platforms. (have to be sure, because this struct is read out of a file.

Definition at line 601 of file ucstring.h.


Member Function Documentation

const UCDATA* UCDATA::GetCharData UNICODE  ucval  )  [static]
 

Return the CHARDATA for a given Unicode character.

May return NULL if no data exists for a given Unicode char.

const UNICODE* UCDATA::GetComponants  )  const [inline]
 

Get the list of componants for this character If numcomp > 0, the unicode character can be decomposed into that many componants.

This method will get the list of componants, assuming you got the componant by calling UCDATA::GetCharData()

Definition at line 659 of file ucstring.h.


Member Data Documentation

UNICODE UCDATA::altcase
 

If cagegory == UCCAT_Lu, (Letter, Uppercase), this will be the Unicode value of the lowercase version of the same letter.

If category == UCCAT_Ll (Letter, Lowercase) this will be the uppercase version of the same letter

Definition at line 609 of file ucstring.h.

UINT8 UCDATA::biditag
 

Bidirectional category tag.

Enums defined below and explained in Chapter 3 of the Unicode Standard

Definition at line 639 of file ucstring.h.

UINT8 UCDATA::category
 

General character category.

This is one of the UCCAT enums defined below. The enum is based on the two letter code from UnicodeData.txt, so if UnicodeData.txt says a character is Lo (Letter, Other), the enum is UCCAT_Lo.

Definition at line 635 of file ucstring.h.

UINT8 UCDATA::decomptag
 

Decomposition tag.

Definition at line 642 of file ucstring.h.

UNICODE UCDATA::firstcomp
 

Unicode value of the first composite character that starts with this Unicode value.

For example, Unicode value 0x0627 (ARABIC LETTER ALEF) might have a firstcomp of 0x0626 (ARABIC LETTER ALEF WITH HAMZA ABOVE). Due to the recursive nature of composition, this can't be done as a single linked list. ARABIC LETTER ALEF WITH HAMZA ABOVE might have a nextcomp of ARABIC LETTER ALEF WITH HAMZA BELOW (next in the chain of things that started with ARABIC LETTER ALEF. But it might have a firstcomp of ARABIC LETTER ALEF WITH HAMZA ABOVE FINAL FORM (beginning a chain of things starting with ARABIC LETTER ALEF WITH HAMZA ABOVE)

Definition at line 623 of file ucstring.h.

UNICODE UCDATA::nextcomp
 

Unicode value of the first composite character that starts with this Unicode value.

For example, Unicode value 0x00E0 (ARABIC SMALL LETTER A WITH GRAVE) might have a nextcomp of 0x00E1 (ARABIC SMALL LETTER A WITH ACUTE).

Definition at line 629 of file ucstring.h.

UINT8 UCDATA::numcomp
 

Number of componants if this is a composite glyph or a variant of the glyph.

Definition at line 646 of file ucstring.h.

UNICODE UCDATA::ucval
 

The actual Unicode value.

Definition at line 603 of file ucstring.h.


The documentation for this struct was generated from the following file:
Generated on Thu Aug 12 06:27:35 2004 for TNTsdk by doxygen 1.3.4-20031026