Added in API level 24

UProperty


public interface UProperty

android.icu.lang.UProperty


Selection constants for Unicode properties.

These constants are used in functions like UCharacter.hasBinaryProperty(int) to select one of the Unicode properties.

The properties APIs are intended to reflect Unicode properties as defined in the Unicode Character Database (UCD) and Unicode Technical Reports (UTR).

For details about the properties see UAX #44: Unicode Character Database.

Important: If ICU is built with UCD files from Unicode versions below 3.2, then properties marked with "new" are not or not fully available. Check UCharacter.getUnicodeVersion() to be sure.

See also:

Summary

Nested classes

interface UProperty.NameChoice

Selector constants for UCharacter.getPropertyName() and UCharacter.getPropertyValueName(). 

Constants

int AGE

String property Age.

int ALPHABETIC

Binary property Alphabetic.

int ASCII_HEX_DIGIT

Binary property ASCII_Hex_Digit (0-9 A-F a-f).

int BASIC_EMOJI

Binary property of strings Basic_Emoji.

int BIDI_CLASS

Enumerated property Bidi_Class.

int BIDI_CONTROL

Binary property Bidi_Control.

int BIDI_MIRRORED

Binary property Bidi_Mirrored.

int BIDI_MIRRORING_GLYPH

String property Bidi_Mirroring_Glyph.

int BIDI_PAIRED_BRACKET

String property Bidi_Paired_Bracket (new in Unicode 6.3).

int BIDI_PAIRED_BRACKET_TYPE

Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3).

int BINARY_START

First constant for binary Unicode properties.

int BLOCK

Enumerated property Block.

int CANONICAL_COMBINING_CLASS

Enumerated property Canonical_Combining_Class.

int CASED

Binary property Cased.

int CASE_FOLDING

String property Case_Folding.

int CASE_IGNORABLE

Binary property Case_Ignorable.

int CASE_SENSITIVE

Binary property Case_Sensitive.

int CHANGES_WHEN_CASEFOLDED

Binary property Changes_When_Casefolded.

int CHANGES_WHEN_CASEMAPPED

Binary property Changes_When_Casemapped.

int CHANGES_WHEN_LOWERCASED

Binary property Changes_When_Lowercased.

int CHANGES_WHEN_NFKC_CASEFOLDED

Binary property Changes_When_NFKC_Casefolded.

int CHANGES_WHEN_TITLECASED

Binary property Changes_When_Titlecased.

int CHANGES_WHEN_UPPERCASED

Binary property Changes_When_Uppercased.

int DASH

Binary property Dash.

int DECOMPOSITION_TYPE

Enumerated property Decomposition_Type.

int DEFAULT_IGNORABLE_CODE_POINT

Binary property Default_Ignorable_Code_Point (new).

int DEPRECATED

Binary property Deprecated (new).

int DIACRITIC

Binary property Diacritic.

int DOUBLE_START

First constant for double Unicode properties.

int EAST_ASIAN_WIDTH

Enumerated property East_Asian_Width.

int EMOJI

Binary property Emoji.

int EMOJI_COMPONENT

Binary property Emoji_Component.

int EMOJI_KEYCAP_SEQUENCE

Binary property of strings Emoji_Keycap_Sequence.

int EMOJI_MODIFIER

Binary property Emoji_Modifier.

int EMOJI_MODIFIER_BASE

Binary property Emoji_Modifier_Base.

int EMOJI_PRESENTATION

Binary property Emoji_Presentation.

int EXTENDED_PICTOGRAPHIC

Binary property Extended_Pictographic.

int EXTENDER

Binary property Extender.

int FULL_COMPOSITION_EXCLUSION

Binary property Full_Composition_Exclusion.

int GENERAL_CATEGORY

Enumerated property General_Category.

int GENERAL_CATEGORY_MASK

Bitmask property General_Category_Mask.

int GRAPHEME_BASE

Binary property Grapheme_Base (new).

int GRAPHEME_CLUSTER_BREAK

Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1).

int GRAPHEME_EXTEND

Binary property Grapheme_Extend (new).

int GRAPHEME_LINK

Binary property Grapheme_Link (new).

int HANGUL_SYLLABLE_TYPE

Enumerated property Hangul_Syllable_Type, new in Unicode 4.

int HEX_DIGIT

Binary property Hex_Digit.

int HYPHEN

Binary property Hyphen.

int IDEOGRAPHIC

Binary property Ideographic.

int IDS_BINARY_OPERATOR

Binary property IDS_Binary_Operator (new).

int IDS_TRINARY_OPERATOR

Binary property IDS_Trinary_Operator (new).

int IDS_UNARY_OPERATOR

Binary property IDS_Unary_Operator.

int ID_COMPAT_MATH_CONTINUE

Binary property ID_Compat_Math_Continue.

int ID_COMPAT_MATH_START

Binary property ID_Compat_Math_Start.

int ID_CONTINUE

Binary property ID_Continue.

int ID_START

Binary property ID_Start.

int INDIC_POSITIONAL_CATEGORY

Enumerated property Indic_Positional_Category.

int INDIC_SYLLABIC_CATEGORY

Enumerated property Indic_Syllabic_Category.

int INT_START

First constant for enumerated/integer Unicode properties.

int JOINING_GROUP

Enumerated property Joining_Group.

int JOINING_TYPE

Enumerated property Joining_Type.

int JOIN_CONTROL

Binary property Join_Control.

int LEAD_CANONICAL_COMBINING_CLASS

Enumerated property Lead_Canonical_Combining_Class.

int LINE_BREAK

Enumerated property Line_Break.

int LOGICAL_ORDER_EXCEPTION

Binary property Logical_Order_Exception (new).

int LOWERCASE

Binary property Lowercase.

int LOWERCASE_MAPPING

String property Lowercase_Mapping.

int MASK_START

First constant for bit-mask Unicode properties.

int MATH

Binary property Math.

int NAME

String property Name.

int NFC_INERT

Binary property NFC_Inert.

int NFC_QUICK_CHECK

Enumerated property NFC_Quick_Check.

int NFD_INERT

Binary property NFD_Inert.

int NFD_QUICK_CHECK

Enumerated property NFD_Quick_Check.

int NFKC_INERT

Binary property NFKC_Inert.

int NFKC_QUICK_CHECK

Enumerated property NFKC_Quick_Check.

int NFKD_INERT

Binary property NFKD_Inert.

int NFKD_QUICK_CHECK

Enumerated property NFKD_Quick_Check.

int NONCHARACTER_CODE_POINT

Binary property Noncharacter_Code_Point.

int NUMERIC_TYPE

Enumerated property Numeric_Type.

int NUMERIC_VALUE

Double property Numeric_Value.

int OTHER_PROPERTY_START

First constant for Unicode properties with unusual value types.

int PATTERN_SYNTAX

Binary property Pattern_Syntax (new in Unicode 4.1).

int PATTERN_WHITE_SPACE

Binary property Pattern_White_Space (new in Unicode 4.1).

int POSIX_ALNUM

Binary property alnum (a C/POSIX character class).

int POSIX_BLANK

Binary property blank (a C/POSIX character class).

int POSIX_GRAPH

Binary property graph (a C/POSIX character class).

int POSIX_PRINT

Binary property print (a C/POSIX character class).

int POSIX_XDIGIT

Binary property xdigit (a C/POSIX character class).

int PREPENDED_CONCATENATION_MARK

Binary property Prepended_Concatenation_Mark.

int QUOTATION_MARK

Binary property Quotation_Mark.

int RADICAL

Binary property Radical (new).

int REGIONAL_INDICATOR

Binary property Regional_Indicator.

int RGI_EMOJI

Binary property of strings RGI_Emoji.

int RGI_EMOJI_FLAG_SEQUENCE

Binary property of strings RGI_Emoji_Flag_Sequence.

int RGI_EMOJI_MODIFIER_SEQUENCE

Binary property of strings RGI_Emoji_Modifier_Sequence.

int RGI_EMOJI_TAG_SEQUENCE

Binary property of strings RGI_Emoji_Tag_Sequence.

int RGI_EMOJI_ZWJ_SEQUENCE

Binary property of strings RGI_Emoji_ZWJ_Sequence.

int SCRIPT

Enumerated property Script.

int SCRIPT_EXTENSIONS

Miscellaneous property Script_Extensions (new in Unicode 6.0).

int SEGMENT_STARTER

Binary Property Segment_Starter.

int SENTENCE_BREAK

Enumerated property Sentence_Break (new in Unicode 4.1).

int SIMPLE_CASE_FOLDING

String property Simple_Case_Folding.

int SIMPLE_LOWERCASE_MAPPING

String property Simple_Lowercase_Mapping.

int SIMPLE_TITLECASE_MAPPING

String property Simple_Titlecase_Mapping.

int SIMPLE_UPPERCASE_MAPPING

String property Simple_Uppercase_Mapping.

int SOFT_DOTTED

Binary property Soft_Dotted (new).

int STRING_START

First constant for string Unicode properties.

int S_TERM

Binary property STerm (new in Unicode 4.0.1).

int TERMINAL_PUNCTUATION

Binary property Terminal_Punctuation.

int TITLECASE_MAPPING

String property Titlecase_Mapping.

int TRAIL_CANONICAL_COMBINING_CLASS

Enumerated property Trail_Canonical_Combining_Class.

int UNIFIED_IDEOGRAPH

Binary property Unified_Ideograph (new).

int UPPERCASE

Binary property Uppercase.

int UPPERCASE_MAPPING

String property Uppercase_Mapping.

int VARIATION_SELECTOR

Binary property Variation_Selector (new in Unicode 4.0.1).

int VERTICAL_ORIENTATION

Enumerated property Vertical_Orientation.

int WHITE_SPACE

Binary property White_Space.

int WORD_BREAK

Enumerated property Word_Break (new in Unicode 4.1).

int XID_CONTINUE

Binary property XID_Continue.

int XID_START

Binary property XID_Start.

Constants

AGE

Added in API level 24
public static final int AGE

String property Age. Corresponds to UCharacter.getAge(int).

Constant Value: 16384 (0x00004000)

ALPHABETIC

Added in API level 24
public static final int ALPHABETIC

Binary property Alphabetic.

Property for UCharacter.isUAlphabetic(), different from the property in UCharacter.isalpha().

Lu + Ll + Lt + Lm + Lo + Nl + Other_Alphabetic.

Constant Value: 0 (0x00000000)

ASCII_HEX_DIGIT

Added in API level 24
public static final int ASCII_HEX_DIGIT

Binary property ASCII_Hex_Digit (0-9 A-F a-f).

Constant Value: 1 (0x00000001)

BASIC_EMOJI

Added in API level 34
public static final int BASIC_EMOJI

Binary property of strings Basic_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets

Constant Value: 65 (0x00000041)

BIDI_CLASS

Added in API level 24
public static final int BIDI_CLASS

Enumerated property Bidi_Class. Same as UCharacter.getDirection(int), returns UCharacterDirection values.

Constant Value: 4096 (0x00001000)

BIDI_CONTROL

Added in API level 24
public static final int BIDI_CONTROL

Binary property Bidi_Control.

Format controls which have specific functions in the Bidi Algorithm.

Constant Value: 2 (0x00000002)

BIDI_MIRRORED

Added in API level 24
public static final int BIDI_MIRRORED

Binary property Bidi_Mirrored.

Characters that may change display in RTL text.

Property for UCharacter.isMirrored().

See Bidi Algorithm; UTR 9.

Constant Value: 3 (0x00000003)

BIDI_MIRRORING_GLYPH

Added in API level 24
public static final int BIDI_MIRRORING_GLYPH

String property Bidi_Mirroring_Glyph. Corresponds to UCharacter.getMirror(int).

Constant Value: 16385 (0x00004001)

BIDI_PAIRED_BRACKET

Added in API level 24
public static final int BIDI_PAIRED_BRACKET

String property Bidi_Paired_Bracket (new in Unicode 6.3). Corresponds to UCharacter.getBidiPairedBracket.

Constant Value: 16397 (0x0000400d)

BIDI_PAIRED_BRACKET_TYPE

Added in API level 24
public static final int BIDI_PAIRED_BRACKET_TYPE

Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3). Used in UAX #9: Unicode Bidirectional Algorithm (http://www.unicode.org/reports/tr9/) Returns UCharacter.BidiPairedBracketType values.

Constant Value: 4117 (0x00001015)

BINARY_START

Added in API level 24
public static final int BINARY_START

First constant for binary Unicode properties.

Constant Value: 0 (0x00000000)

BLOCK

Added in API level 24
public static final int BLOCK

Enumerated property Block. Same as UCharacter.UnicodeBlock.of(int), returns UCharacter.UnicodeBlock values.

Constant Value: 4097 (0x00001001)

CANONICAL_COMBINING_CLASS

Added in API level 24
public static final int CANONICAL_COMBINING_CLASS

Enumerated property Canonical_Combining_Class. Same as UCharacter.getCombiningClass(int), returns 8-bit numeric values.

Constant Value: 4098 (0x00001002)

CASED

Added in API level 24
public static final int CASED

Binary property Cased. For Lowercase, Uppercase and Titlecase characters.

Constant Value: 49 (0x00000031)

CASE_FOLDING

Added in API level 24
public static final int CASE_FOLDING

String property Case_Folding. Corresponds to UCharacter.foldCase(String, boolean).

Constant Value: 16386 (0x00004002)

CASE_IGNORABLE

Added in API level 24
public static final int CASE_IGNORABLE

Binary property Case_Ignorable. Used in context-sensitive case mappings.

Constant Value: 50 (0x00000032)

CASE_SENSITIVE

Added in API level 24
public static final int CASE_SENSITIVE

Binary property Case_Sensitive.

Either the source of a case mapping or in the target of a case mapping. Not the same as the general category Cased_Letter.

Constant Value: 34 (0x00000022)

CHANGES_WHEN_CASEFOLDED

Added in API level 24
public static final int CHANGES_WHEN_CASEFOLDED

Binary property Changes_When_Casefolded.

Constant Value: 54 (0x00000036)

CHANGES_WHEN_CASEMAPPED

Added in API level 24
public static final int CHANGES_WHEN_CASEMAPPED

Binary property Changes_When_Casemapped.

Constant Value: 55 (0x00000037)

CHANGES_WHEN_LOWERCASED

Added in API level 24
public static final int CHANGES_WHEN_LOWERCASED

Binary property Changes_When_Lowercased.

Constant Value: 51 (0x00000033)

CHANGES_WHEN_NFKC_CASEFOLDED

Added in API level 24
public static final int CHANGES_WHEN_NFKC_CASEFOLDED

Binary property Changes_When_NFKC_Casefolded.

Constant Value: 56 (0x00000038)

CHANGES_WHEN_TITLECASED

Added in API level 24
public static final int CHANGES_WHEN_TITLECASED

Binary property Changes_When_Titlecased.

Constant Value: 53 (0x00000035)

CHANGES_WHEN_UPPERCASED

Added in API level 24
public static final int CHANGES_WHEN_UPPERCASED

Binary property Changes_When_Uppercased.

Constant Value: 52 (0x00000034)

DASH

Added in API level 24
public static final int DASH

Binary property Dash.

Variations of dashes.

Constant Value: 4 (0x00000004)

DECOMPOSITION_TYPE

Added in API level 24
public static final int DECOMPOSITION_TYPE

Enumerated property Decomposition_Type. Returns UCharacter.DecompositionType values.

Constant Value: 4099 (0x00001003)

DEFAULT_IGNORABLE_CODE_POINT

Added in API level 24
public static final int DEFAULT_IGNORABLE_CODE_POINT

Binary property Default_Ignorable_Code_Point (new).

Property that indicates codepoint is ignorable in most processing.

Codepoints (2060..206F, FFF0..FFFB, E0000..E0FFF) + Other_Default_Ignorable_Code_Point + (Cf + Cc + Cs - White_Space)

Constant Value: 5 (0x00000005)

DEPRECATED

Added in API level 24
public static final int DEPRECATED

Binary property Deprecated (new).

The usage of deprecated characters is strongly discouraged.

Constant Value: 6 (0x00000006)

DIACRITIC

Added in API level 24
public static final int DIACRITIC

Binary property Diacritic.

Characters that linguistically modify the meaning of another character to which they apply.

Constant Value: 7 (0x00000007)

DOUBLE_START

Added in API level 24
public static final int DOUBLE_START

First constant for double Unicode properties.

Constant Value: 12288 (0x00003000)

EAST_ASIAN_WIDTH

Added in API level 24
public static final int EAST_ASIAN_WIDTH

Enumerated property East_Asian_Width. See http://www.unicode.org/reports/tr11/ Returns UCharacter.EastAsianWidth values.

Constant Value: 4100 (0x00001004)

EMOJI

Added in API level 28
public static final int EMOJI

Binary property Emoji. See http://www.unicode.org/reports/tr51/#Emoji_Properties

Constant Value: 57 (0x00000039)

EMOJI_COMPONENT

Added in API level 28
public static final int EMOJI_COMPONENT

Binary property Emoji_Component. See http://www.unicode.org/reports/tr51/#Emoji_Properties

Constant Value: 61 (0x0000003d)

EMOJI_KEYCAP_SEQUENCE

Added in API level 34
public static final int EMOJI_KEYCAP_SEQUENCE

Binary property of strings Emoji_Keycap_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets

Constant Value: 66 (0x00000042)

EMOJI_MODIFIER

Added in API level 28
public static final int EMOJI_MODIFIER

Binary property Emoji_Modifier. See http://www.unicode.org/reports/tr51/#Emoji_Properties

Constant Value: 59 (0x0000003b)

EMOJI_MODIFIER_BASE

Added in API level 28
public static final int EMOJI_MODIFIER_BASE

Binary property Emoji_Modifier_Base. See http://www.unicode.org/reports/tr51/#Emoji_Properties

Constant Value: 60 (0x0000003c)

EMOJI_PRESENTATION

Added in API level 28
public static final int EMOJI_PRESENTATION

Binary property Emoji_Presentation. See http://www.unicode.org/reports/tr51/#Emoji_Properties

Constant Value: 58 (0x0000003a)

EXTENDED_PICTOGRAPHIC

Added in API level 29
public static final int EXTENDED_PICTOGRAPHIC

Binary property Extended_Pictographic. See http://www.unicode.org/reports/tr51/#Emoji_Properties

Constant Value: 64 (0x00000040)

EXTENDER

Added in API level 24
public static final int EXTENDER

Binary property Extender.

Extend the value or shape of a preceding alphabetic character, e.g. length and iteration marks.

Constant Value: 8 (0x00000008)

FULL_COMPOSITION_EXCLUSION

Added in API level 24
public static final int FULL_COMPOSITION_EXCLUSION

Binary property Full_Composition_Exclusion.

CompositionExclusions.txt + Singleton Decompositions + Non-Starter Decompositions.

Constant Value: 9 (0x00000009)

GENERAL_CATEGORY

Added in API level 24
public static final int GENERAL_CATEGORY

Enumerated property General_Category. Same as UCharacter.getType(int), returns UCharacterCategory values.

Constant Value: 4101 (0x00001005)

GENERAL_CATEGORY_MASK

Added in API level 24
public static final int GENERAL_CATEGORY_MASK

Bitmask property General_Category_Mask. This is the General_Category property returned as a bit mask. When used in UCharacter.getIntPropertyValue(c), returns bit masks for UCharacterCategory values where exactly one bit is set. When used with UCharacter.getPropertyValueName() and UCharacter.getPropertyValueEnum(), a multi-bit mask is used for sets of categories like "Letters".

Constant Value: 8192 (0x00002000)

GRAPHEME_BASE

Added in API level 24
public static final int GRAPHEME_BASE

Binary property Grapheme_Base (new).

For programmatic determination of grapheme cluster boundaries. [0..10FFFF]-Cc-Cf-Cs-Co-Cn-Zl-Zp-Grapheme_Link-Grapheme_Extend-CGJ

Constant Value: 10 (0x0000000a)

GRAPHEME_CLUSTER_BREAK

Added in API level 24
public static final int GRAPHEME_CLUSTER_BREAK

Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.GraphemeClusterBreak values.

Constant Value: 4114 (0x00001012)

GRAPHEME_EXTEND

Added in API level 24
public static final int GRAPHEME_EXTEND

Binary property Grapheme_Extend (new).

For programmatic determination of grapheme cluster boundaries.

Me+Mn+Mc+Other_Grapheme_Extend-Grapheme_Link-CGJ

Constant Value: 11 (0x0000000b)

Added in API level 24
public static final int GRAPHEME_LINK

Binary property Grapheme_Link (new).

For programmatic determination of grapheme cluster boundaries.

Constant Value: 12 (0x0000000c)

HANGUL_SYLLABLE_TYPE

Added in API level 24
public static final int HANGUL_SYLLABLE_TYPE

Enumerated property Hangul_Syllable_Type, new in Unicode 4. Returns UCharacter.HangulSyllableType values.

Constant Value: 4107 (0x0000100b)

HEX_DIGIT

Added in API level 24
public static final int HEX_DIGIT

Binary property Hex_Digit.

Characters commonly used for hexadecimal numbers.

Constant Value: 13 (0x0000000d)

HYPHEN

Added in API level 24
public static final int HYPHEN

Binary property Hyphen.

Dashes used to mark connections between pieces of words, plus the Katakana middle dot.

Constant Value: 14 (0x0000000e)

IDEOGRAPHIC

Added in API level 24
public static final int IDEOGRAPHIC

Binary property Ideographic.

CJKV ideographs.

Constant Value: 17 (0x00000011)

IDS_BINARY_OPERATOR

Added in API level 24
public static final int IDS_BINARY_OPERATOR

Binary property IDS_Binary_Operator (new).

For programmatic determination of Ideographic Description Sequences.

Constant Value: 18 (0x00000012)

IDS_TRINARY_OPERATOR

Added in API level 24
public static final int IDS_TRINARY_OPERATOR

Binary property IDS_Trinary_Operator (new).

For programmatic determination of Ideographic Description Sequences.

Constant Value: 19 (0x00000013)

IDS_UNARY_OPERATOR

public static final int IDS_UNARY_OPERATOR

Binary property IDS_Unary_Operator. For programmatic determination of Ideographic Description Sequences.

Constant Value: 72 (0x00000048)

ID_COMPAT_MATH_CONTINUE

public static final int ID_COMPAT_MATH_CONTINUE

Binary property ID_Compat_Math_Continue.

Used in mathematical identifier profile in UAX #31.

Constant Value: 74 (0x0000004a)

ID_COMPAT_MATH_START

public static final int ID_COMPAT_MATH_START

Binary property ID_Compat_Math_Start.

Used in mathematical identifier profile in UAX #31.

Constant Value: 73 (0x00000049)

ID_CONTINUE

Added in API level 24
public static final int ID_CONTINUE

Binary property ID_Continue.

Characters that can continue an identifier.

ID_Start+Mn+Mc+Nd+Pc

Constant Value: 15 (0x0000000f)

ID_START

Added in API level 24
public static final int ID_START

Binary property ID_Start.

Characters that can start an identifier.

Lu+Ll+Lt+Lm+Lo+Nl

Constant Value: 16 (0x00000010)

INDIC_POSITIONAL_CATEGORY

Added in API level 29
public static final int INDIC_POSITIONAL_CATEGORY

Enumerated property Indic_Positional_Category. New in Unicode 6.0 as provisional property Indic_Matra_Category; renamed and changed to informative in Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicPositionalCategory.txt

Constant Value: 4118 (0x00001016)

INDIC_SYLLABIC_CATEGORY

Added in API level 29
public static final int INDIC_SYLLABIC_CATEGORY

Enumerated property Indic_Syllabic_Category. New in Unicode 6.0 as provisional; informative since Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicSyllabicCategory.txt

Constant Value: 4119 (0x00001017)

INT_START

Added in API level 24
public static final int INT_START

First constant for enumerated/integer Unicode properties.

Constant Value: 4096 (0x00001000)

JOINING_GROUP

Added in API level 24
public static final int JOINING_GROUP

Enumerated property Joining_Group. Returns UCharacter.JoiningGroup values.

Constant Value: 4102 (0x00001006)

JOINING_TYPE

Added in API level 24
public static final int JOINING_TYPE

Enumerated property Joining_Type. Returns UCharacter.JoiningType values.

Constant Value: 4103 (0x00001007)

JOIN_CONTROL

Added in API level 24
public static final int JOIN_CONTROL

Binary property Join_Control.

Format controls for cursive joining and ligation.

Constant Value: 20 (0x00000014)

LEAD_CANONICAL_COMBINING_CLASS

Added in API level 24
public static final int LEAD_CANONICAL_COMBINING_CLASS

Enumerated property Lead_Canonical_Combining_Class. ICU-specific property for the ccc of the first code point of the decomposition, or lccc(c)=ccc(NFD(c)[0]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.

Constant Value: 4112 (0x00001010)

LINE_BREAK

Added in API level 24
public static final int LINE_BREAK

Enumerated property Line_Break. Returns UCharacter.LineBreak values.

Constant Value: 4104 (0x00001008)

LOGICAL_ORDER_EXCEPTION

Added in API level 24
public static final int LOGICAL_ORDER_EXCEPTION

Binary property Logical_Order_Exception (new).

Characters that do not use logical order and require special handling in most processing.

Constant Value: 21 (0x00000015)

LOWERCASE

Added in API level 24
public static final int LOWERCASE

Binary property Lowercase.

Same as UCharacter.isULowercase(), different from UCharacter.islower().

Ll+Other_Lowercase

Constant Value: 22 (0x00000016)

LOWERCASE_MAPPING

Added in API level 24
public static final int LOWERCASE_MAPPING

String property Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(String).

Constant Value: 16388 (0x00004004)

MASK_START

Added in API level 24
public static final int MASK_START

First constant for bit-mask Unicode properties.

Constant Value: 8192 (0x00002000)

MATH

Added in API level 24
public static final int MATH

Binary property Math.

Sm+Other_Math

Constant Value: 23 (0x00000017)

NAME

Added in API level 24
public static final int NAME

String property Name. Corresponds to UCharacter.getName(int).

Constant Value: 16389 (0x00004005)

NFC_INERT

Added in API level 24
public static final int NFC_INERT

Binary property NFC_Inert. ICU-specific property for characters that are inert under NFC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.

See also:

Constant Value: 39 (0x00000027)

NFC_QUICK_CHECK

Added in API level 24
public static final int NFC_QUICK_CHECK

Enumerated property NFC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.

Constant Value: 4110 (0x0000100e)

NFD_INERT

Added in API level 24
public static final int NFD_INERT

Binary property NFD_Inert. ICU-specific property for characters that are inert under NFD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions. There is one such property per normalization form. These properties are computed as follows - an inert character is: a) unassigned, or ALL of the following: b) of combining class 0. c) not decomposed by this normalization form. AND if NFC or NFKC, d) can never compose with a previous character. e) can never compose with a following character. f) can never change if another character is added. Example: a-breve might satisfy all but f, but if you add an ogonek it changes to a-ogonek + breve See also com.ibm.text.UCD.NFSkippable in the ICU4J repository, and icu/source/common/unormimp.h .

Constant Value: 37 (0x00000025)

NFD_QUICK_CHECK

Added in API level 24
public static final int NFD_QUICK_CHECK

Enumerated property NFD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.

Constant Value: 4108 (0x0000100c)

NFKC_INERT

Added in API level 24
public static final int NFKC_INERT

Binary property NFKC_Inert. ICU-specific property for characters that are inert under NFKC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.

See also:

Constant Value: 40 (0x00000028)

NFKC_QUICK_CHECK

Added in API level 24
public static final int NFKC_QUICK_CHECK

Enumerated property NFKC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.

Constant Value: 4111 (0x0000100f)

NFKD_INERT

Added in API level 24
public static final int NFKD_INERT

Binary property NFKD_Inert. ICU-specific property for characters that are inert under NFKD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.

See also:

Constant Value: 38 (0x00000026)

NFKD_QUICK_CHECK

Added in API level 24
public static final int NFKD_QUICK_CHECK

Enumerated property NFKD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.

Constant Value: 4109 (0x0000100d)

NONCHARACTER_CODE_POINT

Added in API level 24
public static final int NONCHARACTER_CODE_POINT

Binary property Noncharacter_Code_Point.

Code points that are explicitly defined as illegal for the encoding of characters.

Constant Value: 24 (0x00000018)

NUMERIC_TYPE

Added in API level 24
public static final int NUMERIC_TYPE

Enumerated property Numeric_Type. Returns UCharacter.NumericType values.

Constant Value: 4105 (0x00001009)

NUMERIC_VALUE

Added in API level 24
public static final int NUMERIC_VALUE

Double property Numeric_Value. Corresponds to UCharacter.getUnicodeNumericValue(int).

Constant Value: 12288 (0x00003000)

OTHER_PROPERTY_START

Added in API level 24
public static final int OTHER_PROPERTY_START

First constant for Unicode properties with unusual value types.

Constant Value: 28672 (0x00007000)

PATTERN_SYNTAX

Added in API level 24
public static final int PATTERN_SYNTAX

Binary property Pattern_Syntax (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)

Constant Value: 42 (0x0000002a)

PATTERN_WHITE_SPACE

Added in API level 24
public static final int PATTERN_WHITE_SPACE

Binary property Pattern_White_Space (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)

Constant Value: 43 (0x0000002b)

POSIX_ALNUM

Added in API level 24
public static final int POSIX_ALNUM

Binary property alnum (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

Constant Value: 44 (0x0000002c)

POSIX_BLANK

Added in API level 24
public static final int POSIX_BLANK

Binary property blank (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

Constant Value: 45 (0x0000002d)

POSIX_GRAPH

Added in API level 24
public static final int POSIX_GRAPH

Binary property graph (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

Constant Value: 46 (0x0000002e)

POSIX_PRINT

Added in API level 24
public static final int POSIX_PRINT

Binary property print (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

Constant Value: 47 (0x0000002f)

POSIX_XDIGIT

Added in API level 24
public static final int POSIX_XDIGIT

Binary property xdigit (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

Constant Value: 48 (0x00000030)

PREPENDED_CONCATENATION_MARK

Added in API level 28
public static final int PREPENDED_CONCATENATION_MARK

Binary property Prepended_Concatenation_Mark.

Constant Value: 63 (0x0000003f)

QUOTATION_MARK

Added in API level 24
public static final int QUOTATION_MARK

Binary property Quotation_Mark.

Constant Value: 25 (0x00000019)

RADICAL

Added in API level 24
public static final int RADICAL

Binary property Radical (new).

For programmatic determination of Ideographic Description Sequences.

Constant Value: 26 (0x0000001a)

REGIONAL_INDICATOR

Added in API level 28
public static final int REGIONAL_INDICATOR

Binary property Regional_Indicator.

Constant Value: 62 (0x0000003e)

RGI_EMOJI

Added in API level 34
public static final int RGI_EMOJI

Binary property of strings RGI_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets

Constant Value: 71 (0x00000047)

RGI_EMOJI_FLAG_SEQUENCE

Added in API level 34
public static final int RGI_EMOJI_FLAG_SEQUENCE

Binary property of strings RGI_Emoji_Flag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets

Constant Value: 68 (0x00000044)

RGI_EMOJI_MODIFIER_SEQUENCE

Added in API level 34
public static final int RGI_EMOJI_MODIFIER_SEQUENCE

Binary property of strings RGI_Emoji_Modifier_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets

Constant Value: 67 (0x00000043)

RGI_EMOJI_TAG_SEQUENCE

Added in API level 34
public static final int RGI_EMOJI_TAG_SEQUENCE

Binary property of strings RGI_Emoji_Tag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets

Constant Value: 69 (0x00000045)

RGI_EMOJI_ZWJ_SEQUENCE

Added in API level 34
public static final int RGI_EMOJI_ZWJ_SEQUENCE

Binary property of strings RGI_Emoji_ZWJ_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets

Constant Value: 70 (0x00000046)

SCRIPT

Added in API level 24
public static final int SCRIPT

Enumerated property Script. Same as UScript.getScript(int), returns UScript values.

Constant Value: 4106 (0x0000100a)

SCRIPT_EXTENSIONS

Added in API level 24
public static final int SCRIPT_EXTENSIONS

Miscellaneous property Script_Extensions (new in Unicode 6.0). Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/. Corresponds to UScript.hasScript and UScript.getScriptExtensions.

Constant Value: 28672 (0x00007000)

SEGMENT_STARTER

Added in API level 24
public static final int SEGMENT_STARTER

Binary Property Segment_Starter. ICU-specific property for characters that are starters in terms of Unicode normalization and combining character sequences. They have ccc=0 and do not occur in non-initial position of the canonical decomposition of any character (like " in NFD(a-umlaut) and a Jamo T in an NFD(Hangul LVT)). ICU uses this property for segmenting a string for generating a set of canonically equivalent strings, e.g. for canonical closure while processing collation tailoring rules.

Constant Value: 41 (0x00000029)

SENTENCE_BREAK

Added in API level 24
public static final int SENTENCE_BREAK

Enumerated property Sentence_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.SentenceBreak values.

Constant Value: 4115 (0x00001013)

SIMPLE_CASE_FOLDING

Added in API level 24
public static final int SIMPLE_CASE_FOLDING

String property Simple_Case_Folding. Corresponds to UCharacter.foldCase(int, boolean).

Constant Value: 16390 (0x00004006)

SIMPLE_LOWERCASE_MAPPING

Added in API level 24
public static final int SIMPLE_LOWERCASE_MAPPING

String property Simple_Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(int).

Constant Value: 16391 (0x00004007)

SIMPLE_TITLECASE_MAPPING

Added in API level 24
public static final int SIMPLE_TITLECASE_MAPPING

String property Simple_Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(int).

Constant Value: 16392 (0x00004008)

SIMPLE_UPPERCASE_MAPPING

Added in API level 24
public static final int SIMPLE_UPPERCASE_MAPPING

String property Simple_Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(int).

Constant Value: 16393 (0x00004009)

SOFT_DOTTED

Added in API level 24
public static final int SOFT_DOTTED

Binary property Soft_Dotted (new).

Characters with a "soft dot", like i or j.

An accent placed on these characters causes the dot to disappear.

Constant Value: 27 (0x0000001b)

STRING_START

Added in API level 24
public static final int STRING_START

First constant for string Unicode properties.

Constant Value: 16384 (0x00004000)

S_TERM

Added in API level 24
public static final int S_TERM

Binary property STerm (new in Unicode 4.0.1). Sentence Terminal. Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/)

Constant Value: 35 (0x00000023)

TERMINAL_PUNCTUATION

Added in API level 24
public static final int TERMINAL_PUNCTUATION

Binary property Terminal_Punctuation.

Punctuation characters that generally mark the end of textual units.

Constant Value: 28 (0x0000001c)

TITLECASE_MAPPING

Added in API level 24
public static final int TITLECASE_MAPPING

String property Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(String).

Constant Value: 16394 (0x0000400a)

TRAIL_CANONICAL_COMBINING_CLASS

Added in API level 24
public static final int TRAIL_CANONICAL_COMBINING_CLASS

Enumerated property Trail_Canonical_Combining_Class. ICU-specific property for the ccc of the last code point of the decomposition, or lccc(c)=ccc(NFD(c)[last]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.

Constant Value: 4113 (0x00001011)

UNIFIED_IDEOGRAPH

Added in API level 24
public static final int UNIFIED_IDEOGRAPH

Binary property Unified_Ideograph (new).

For programmatic determination of Ideographic Description Sequences.

Constant Value: 29 (0x0000001d)

UPPERCASE

Added in API level 24
public static final int UPPERCASE

Binary property Uppercase.

Same as UCharacter.isUUppercase(), different from UCharacter.isUpperCase().

Lu+Other_Uppercase

Constant Value: 30 (0x0000001e)

UPPERCASE_MAPPING

Added in API level 24
public static final int UPPERCASE_MAPPING

String property Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(String).

Constant Value: 16396 (0x0000400c)

VARIATION_SELECTOR

Added in API level 24
public static final int VARIATION_SELECTOR

Binary property Variation_Selector (new in Unicode 4.0.1). Indicates all those characters that qualify as Variation Selectors. For details on the behavior of these characters, see StandardizedVariants.html and 15.6 Variation Selectors.

Constant Value: 36 (0x00000024)

VERTICAL_ORIENTATION

Added in API level 29
public static final int VERTICAL_ORIENTATION

Enumerated property Vertical_Orientation. Used for UAX #50 Unicode Vertical Text Layout (https://www.unicode.org/reports/tr50/). New as a UCD property in Unicode 10.0.

Constant Value: 4120 (0x00001018)

WHITE_SPACE

Added in API level 24
public static final int WHITE_SPACE

Binary property White_Space.

Same as UCharacter.isUWhiteSpace(), different from UCharacter.isSpace() and UCharacter.isWhitespace(). Space characters+TAB+CR+LF-ZWSP-ZWNBSP

Constant Value: 31 (0x0000001f)

WORD_BREAK

Added in API level 24
public static final int WORD_BREAK

Enumerated property Word_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.WordBreak values.

Constant Value: 4116 (0x00001014)

XID_CONTINUE

Added in API level 24
public static final int XID_CONTINUE

Binary property XID_Continue.

ID_Continue modified to allow closure under normalization forms NFKC and NFKD.

Constant Value: 32 (0x00000020)

XID_START

Added in API level 24
public static final int XID_START

Binary property XID_Start.

ID_Start modified to allow closure under normalization forms NFKC and NFKD.

Constant Value: 33 (0x00000021)