UProperty
interface UProperty
android.icu.lang.UProperty |
Selection constants for Unicode properties.
These constants are used in functions like UCharacter.hasBinaryProperty(int) to select one of the Unicode properties.
The properties APIs are intended to reflect Unicode properties as defined in the Unicode Character Database (UCD) and Unicode Technical Reports (UTR).
For details about the properties see UAX #44: Unicode Character Database.
Important: If ICU is built with UCD files from Unicode versions below 3.2, then properties marked with "new" are not or not fully available. Check UCharacter.getUnicodeVersion() to be sure.
Summary
Nested classes | |
---|---|
abstract |
Selector constants for UCharacter. |
Constants | |
---|---|
static Int |
String property Age. |
static Int |
Binary property Alphabetic. |
static Int |
Binary property ASCII_Hex_Digit (0-9 A-F a-f). |
static Int |
Binary property of strings Basic_Emoji. |
static Int |
Enumerated property Bidi_Class. |
static Int |
Binary property Bidi_Control. |
static Int |
Binary property Bidi_Mirrored. |
static Int |
String property Bidi_Mirroring_Glyph. |
static Int |
String property Bidi_Paired_Bracket (new in Unicode 6.3). |
static Int |
Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3). |
static Int |
First constant for binary Unicode properties. |
static Int |
Enumerated property Block. |
static Int |
Enumerated property Canonical_Combining_Class. |
static Int |
Binary property Cased. |
static Int |
String property Case_Folding. |
static Int |
Binary property Case_Ignorable. |
static Int |
Binary property Case_Sensitive. |
static Int |
Binary property Changes_When_Casefolded. |
static Int |
Binary property Changes_When_Casemapped. |
static Int |
Binary property Changes_When_Lowercased. |
static Int |
Binary property Changes_When_NFKC_Casefolded. |
static Int |
Binary property Changes_When_Titlecased. |
static Int |
Binary property Changes_When_Uppercased. |
static Int |
Binary property Dash. |
static Int |
Enumerated property Decomposition_Type. |
static Int |
Binary property Default_Ignorable_Code_Point (new). |
static Int |
Binary property Deprecated (new). |
static Int |
Binary property Diacritic. |
static Int |
First constant for double Unicode properties. |
static Int |
Enumerated property East_Asian_Width. |
static Int |
Binary property Emoji. |
static Int |
Binary property Emoji_Component. |
static Int |
Binary property of strings Emoji_Keycap_Sequence. |
static Int |
Binary property Emoji_Modifier. |
static Int |
Binary property Emoji_Modifier_Base. |
static Int |
Binary property Emoji_Presentation. |
static Int |
Binary property Extended_Pictographic. |
static Int |
Binary property Extender. |
static Int |
Binary property Full_Composition_Exclusion. |
static Int |
Enumerated property General_Category. |
static Int |
Bitmask property General_Category_Mask. |
static Int |
Binary property Grapheme_Base (new). |
static Int |
Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1). |
static Int |
Binary property Grapheme_Extend (new). |
static Int |
Binary property Grapheme_Link (new). |
static Int |
Enumerated property Hangul_Syllable_Type, new in Unicode 4. |
static Int |
Binary property Hex_Digit. |
static Int |
Binary property Hyphen. |
static Int |
Binary property Ideographic. |
static Int |
Binary property IDS_Binary_Operator (new). |
static Int |
Binary property IDS_Trinary_Operator (new). |
static Int |
Binary property IDS_Unary_Operator. |
static Int |
Binary property ID_Compat_Math_Continue. |
static Int |
Binary property ID_Compat_Math_Start. |
static Int |
Binary property ID_Continue. |
static Int |
Binary property ID_Start. |
static Int |
Enumerated property Indic_Positional_Category. |
static Int |
Enumerated property Indic_Syllabic_Category. |
static Int |
First constant for enumerated/integer Unicode properties. |
static Int |
Enumerated property Joining_Group. |
static Int |
Enumerated property Joining_Type. |
static Int |
Binary property Join_Control. |
static Int |
Enumerated property Lead_Canonical_Combining_Class. |
static Int |
Enumerated property Line_Break. |
static Int |
Binary property Logical_Order_Exception (new). |
static Int |
Binary property Lowercase. |
static Int |
String property Lowercase_Mapping. |
static Int |
First constant for bit-mask Unicode properties. |
static Int |
Binary property Math. |
static Int |
String property Name. |
static Int |
Binary property NFC_Inert. |
static Int |
Enumerated property NFC_Quick_Check. |
static Int |
Binary property NFD_Inert. |
static Int |
Enumerated property NFD_Quick_Check. |
static Int |
Binary property NFKC_Inert. |
static Int |
Enumerated property NFKC_Quick_Check. |
static Int |
Binary property NFKD_Inert. |
static Int |
Enumerated property NFKD_Quick_Check. |
static Int |
Binary property Noncharacter_Code_Point. |
static Int |
Enumerated property Numeric_Type. |
static Int |
Double property Numeric_Value. |
static Int |
First constant for Unicode properties with unusual value types. |
static Int |
Binary property Pattern_Syntax (new in Unicode 4.1). |
static Int |
Binary property Pattern_White_Space (new in Unicode 4.1). |
static Int |
Binary property alnum (a C/POSIX character class). |
static Int |
Binary property blank (a C/POSIX character class). |
static Int |
Binary property graph (a C/POSIX character class). |
static Int |
Binary property print (a C/POSIX character class). |
static Int |
Binary property xdigit (a C/POSIX character class). |
static Int |
Binary property Prepended_Concatenation_Mark. |
static Int |
Binary property Quotation_Mark. |
static Int |
Binary property Radical (new). |
static Int |
Binary property Regional_Indicator. |
static Int |
Binary property of strings RGI_Emoji. |
static Int |
Binary property of strings RGI_Emoji_Flag_Sequence. |
static Int |
Binary property of strings RGI_Emoji_Modifier_Sequence. |
static Int |
Binary property of strings RGI_Emoji_Tag_Sequence. |
static Int |
Binary property of strings RGI_Emoji_ZWJ_Sequence. |
static Int |
Enumerated property Script. |
static Int |
Miscellaneous property Script_Extensions (new in Unicode 6.0). |
static Int |
Binary Property Segment_Starter. |
static Int |
Enumerated property Sentence_Break (new in Unicode 4.1). |
static Int |
String property Simple_Case_Folding. |
static Int |
String property Simple_Lowercase_Mapping. |
static Int |
String property Simple_Titlecase_Mapping. |
static Int |
String property Simple_Uppercase_Mapping. |
static Int |
Binary property Soft_Dotted (new). |
static Int |
First constant for string Unicode properties. |
static Int |
Binary property STerm (new in Unicode 4.0.1). |
static Int |
Binary property Terminal_Punctuation. |
static Int |
String property Titlecase_Mapping. |
static Int |
Enumerated property Trail_Canonical_Combining_Class. |
static Int |
Binary property Unified_Ideograph (new). |
static Int |
Binary property Uppercase. |
static Int |
String property Uppercase_Mapping. |
static Int |
Binary property Variation_Selector (new in Unicode 4.0.1). |
static Int |
Enumerated property Vertical_Orientation. |
static Int |
Binary property White_Space. |
static Int |
Enumerated property Word_Break (new in Unicode 4.1). |
static Int |
Binary property XID_Continue. |
static Int |
Binary property XID_Start. |
Constants
AGE
static val AGE: Int
String property Age. Corresponds to UCharacter.getAge(int).
Value: 16384
ALPHABETIC
static val ALPHABETIC: Int
Binary property Alphabetic.
Property for UCharacter.isUAlphabetic(), different from the property in UCharacter.isalpha().
Lu + Ll + Lt + Lm + Lo + Nl + Other_Alphabetic.
Value: 0
ASCII_HEX_DIGIT
static val ASCII_HEX_DIGIT: Int
Binary property ASCII_Hex_Digit (0-9 A-F a-f).
Value: 1
BASIC_EMOJI
static val BASIC_EMOJI: Int
Binary property of strings Basic_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 65
BIDI_CLASS
static val BIDI_CLASS: Int
Enumerated property Bidi_Class. Same as UCharacter.getDirection(int), returns UCharacterDirection values.
Value: 4096
BIDI_CONTROL
static val BIDI_CONTROL: Int
Binary property Bidi_Control.
Format controls which have specific functions in the Bidi Algorithm.
Value: 2
BIDI_MIRRORED
static val BIDI_MIRRORED: Int
Binary property Bidi_Mirrored.
Characters that may change display in RTL text.
Property for UCharacter.isMirrored().
See Bidi Algorithm; UTR 9.
Value: 3
BIDI_MIRRORING_GLYPH
static val BIDI_MIRRORING_GLYPH: Int
String property Bidi_Mirroring_Glyph. Corresponds to UCharacter.getMirror(int).
Value: 16385
BIDI_PAIRED_BRACKET
static val BIDI_PAIRED_BRACKET: Int
String property Bidi_Paired_Bracket (new in Unicode 6.3). Corresponds to UCharacter.getBidiPairedBracket.
Value: 16397
BIDI_PAIRED_BRACKET_TYPE
static val BIDI_PAIRED_BRACKET_TYPE: Int
Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3). Used in UAX #9: Unicode Bidirectional Algorithm (http://www.unicode.org/reports/tr9/) Returns UCharacter.BidiPairedBracketType values.
Value: 4117
BINARY_START
static val BINARY_START: Int
First constant for binary Unicode properties.
Value: 0
BLOCK
static val BLOCK: Int
Enumerated property Block. Same as UCharacter.UnicodeBlock.of(int), returns UCharacter.UnicodeBlock values.
Value: 4097
CANONICAL_COMBINING_CLASS
static val CANONICAL_COMBINING_CLASS: Int
Enumerated property Canonical_Combining_Class. Same as UCharacter.getCombiningClass(int), returns 8-bit numeric values.
Value: 4098
CASED
static val CASED: Int
Binary property Cased. For Lowercase, Uppercase and Titlecase characters.
Value: 49
CASE_FOLDING
static val CASE_FOLDING: Int
String property Case_Folding. Corresponds to UCharacter.foldCase(String, boolean).
Value: 16386
CASE_IGNORABLE
static val CASE_IGNORABLE: Int
Binary property Case_Ignorable. Used in context-sensitive case mappings.
Value: 50
CASE_SENSITIVE
static val CASE_SENSITIVE: Int
Binary property Case_Sensitive.
Either the source of a case mapping or in the target of a case mapping. Not the same as the general category Cased_Letter.
Value: 34
CHANGES_WHEN_CASEFOLDED
static val CHANGES_WHEN_CASEFOLDED: Int
Binary property Changes_When_Casefolded.
Value: 54
CHANGES_WHEN_CASEMAPPED
static val CHANGES_WHEN_CASEMAPPED: Int
Binary property Changes_When_Casemapped.
Value: 55
CHANGES_WHEN_LOWERCASED
static val CHANGES_WHEN_LOWERCASED: Int
Binary property Changes_When_Lowercased.
Value: 51
CHANGES_WHEN_NFKC_CASEFOLDED
static val CHANGES_WHEN_NFKC_CASEFOLDED: Int
Binary property Changes_When_NFKC_Casefolded.
Value: 56
CHANGES_WHEN_TITLECASED
static val CHANGES_WHEN_TITLECASED: Int
Binary property Changes_When_Titlecased.
Value: 53
CHANGES_WHEN_UPPERCASED
static val CHANGES_WHEN_UPPERCASED: Int
Binary property Changes_When_Uppercased.
Value: 52
DASH
static val DASH: Int
Binary property Dash.
Variations of dashes.
Value: 4
DECOMPOSITION_TYPE
static val DECOMPOSITION_TYPE: Int
Enumerated property Decomposition_Type. Returns UCharacter.DecompositionType values.
Value: 4099
DEFAULT_IGNORABLE_CODE_POINT
static val DEFAULT_IGNORABLE_CODE_POINT: Int
Binary property Default_Ignorable_Code_Point (new).
Property that indicates codepoint is ignorable in most processing.
Codepoints (2060..206F, FFF0..FFFB, E0000..E0FFF) + Other_Default_Ignorable_Code_Point + (Cf + Cc + Cs - White_Space)
Value: 5
DEPRECATED
static val DEPRECATED: Int
Binary property Deprecated (new).
The usage of deprecated characters is strongly discouraged.
Value: 6
DIACRITIC
static val DIACRITIC: Int
Binary property Diacritic.
Characters that linguistically modify the meaning of another character to which they apply.
Value: 7
DOUBLE_START
static val DOUBLE_START: Int
First constant for double Unicode properties.
Value: 12288
EAST_ASIAN_WIDTH
static val EAST_ASIAN_WIDTH: Int
Enumerated property East_Asian_Width. See http://www.unicode.org/reports/tr11/ Returns UCharacter.EastAsianWidth values.
Value: 4100
EMOJI
static val EMOJI: Int
Binary property Emoji. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 57
EMOJI_COMPONENT
static val EMOJI_COMPONENT: Int
Binary property Emoji_Component. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 61
EMOJI_KEYCAP_SEQUENCE
static val EMOJI_KEYCAP_SEQUENCE: Int
Binary property of strings Emoji_Keycap_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 66
EMOJI_MODIFIER
static val EMOJI_MODIFIER: Int
Binary property Emoji_Modifier. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 59
EMOJI_MODIFIER_BASE
static val EMOJI_MODIFIER_BASE: Int
Binary property Emoji_Modifier_Base. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 60
EMOJI_PRESENTATION
static val EMOJI_PRESENTATION: Int
Binary property Emoji_Presentation. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 58
EXTENDED_PICTOGRAPHIC
static val EXTENDED_PICTOGRAPHIC: Int
Binary property Extended_Pictographic. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 64
EXTENDER
static val EXTENDER: Int
Binary property Extender.
Extend the value or shape of a preceding alphabetic character, e.g. length and iteration marks.
Value: 8
FULL_COMPOSITION_EXCLUSION
static val FULL_COMPOSITION_EXCLUSION: Int
Binary property Full_Composition_Exclusion.
CompositionExclusions.txt + Singleton Decompositions + Non-Starter Decompositions.
Value: 9
GENERAL_CATEGORY
static val GENERAL_CATEGORY: Int
Enumerated property General_Category. Same as UCharacter.getType(int), returns UCharacterCategory values.
Value: 4101
GENERAL_CATEGORY_MASK
static val GENERAL_CATEGORY_MASK: Int
Bitmask property General_Category_Mask. This is the General_Category property returned as a bit mask. When used in UCharacter.getIntPropertyValue(c), returns bit masks for UCharacterCategory values where exactly one bit is set. When used with UCharacter.getPropertyValueName() and UCharacter.getPropertyValueEnum(), a multi-bit mask is used for sets of categories like "Letters".
Value: 8192
GRAPHEME_BASE
static val GRAPHEME_BASE: Int
Binary property Grapheme_Base (new).
For programmatic determination of grapheme cluster boundaries. [0..10FFFF]-Cc-Cf-Cs-Co-Cn-Zl-Zp-Grapheme_Link-Grapheme_Extend-CGJ
Value: 10
GRAPHEME_CLUSTER_BREAK
static val GRAPHEME_CLUSTER_BREAK: Int
Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.GraphemeClusterBreak values.
Value: 4114
GRAPHEME_EXTEND
static val GRAPHEME_EXTEND: Int
Binary property Grapheme_Extend (new).
For programmatic determination of grapheme cluster boundaries.
Me+Mn+Mc+Other_Grapheme_Extend-Grapheme_Link-CGJ
Value: 11
GRAPHEME_LINK
static val GRAPHEME_LINK: Int
Binary property Grapheme_Link (new).
For programmatic determination of grapheme cluster boundaries.
Value: 12
HANGUL_SYLLABLE_TYPE
static val HANGUL_SYLLABLE_TYPE: Int
Enumerated property Hangul_Syllable_Type, new in Unicode 4. Returns UCharacter.HangulSyllableType values.
Value: 4107
HEX_DIGIT
static val HEX_DIGIT: Int
Binary property Hex_Digit.
Characters commonly used for hexadecimal numbers.
Value: 13
HYPHEN
static val HYPHEN: Int
Binary property Hyphen.
Dashes used to mark connections between pieces of words, plus the Katakana middle dot.
Value: 14
IDEOGRAPHIC
static val IDEOGRAPHIC: Int
Binary property Ideographic.
CJKV ideographs.
Value: 17
IDS_BINARY_OPERATOR
static val IDS_BINARY_OPERATOR: Int
Binary property IDS_Binary_Operator (new).
For programmatic determination of Ideographic Description Sequences.
Value: 18
IDS_TRINARY_OPERATOR
static val IDS_TRINARY_OPERATOR: Int
Binary property IDS_Trinary_Operator (new).
For programmatic determination of Ideographic Description Sequences.
Value: 19
IDS_UNARY_OPERATOR
static val IDS_UNARY_OPERATOR: Int
Binary property IDS_Unary_Operator. For programmatic determination of Ideographic Description Sequences.
Value: 72
ID_COMPAT_MATH_CONTINUE
static val ID_COMPAT_MATH_CONTINUE: Int
Binary property ID_Compat_Math_Continue.
Used in mathematical identifier profile in UAX #31.
Value: 74
ID_COMPAT_MATH_START
static val ID_COMPAT_MATH_START: Int
Binary property ID_Compat_Math_Start.
Used in mathematical identifier profile in UAX #31.
Value: 73
ID_CONTINUE
static val ID_CONTINUE: Int
Binary property ID_Continue.
Characters that can continue an identifier.
ID_Start+Mn+Mc+Nd+Pc
Value: 15
ID_START
static val ID_START: Int
Binary property ID_Start.
Characters that can start an identifier.
Lu+Ll+Lt+Lm+Lo+Nl
Value: 16
INDIC_POSITIONAL_CATEGORY
static val INDIC_POSITIONAL_CATEGORY: Int
Enumerated property Indic_Positional_Category. New in Unicode 6.0 as provisional property Indic_Matra_Category; renamed and changed to informative in Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicPositionalCategory.txt
Value: 4118
INDIC_SYLLABIC_CATEGORY
static val INDIC_SYLLABIC_CATEGORY: Int
Enumerated property Indic_Syllabic_Category. New in Unicode 6.0 as provisional; informative since Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicSyllabicCategory.txt
Value: 4119
INT_START
static val INT_START: Int
First constant for enumerated/integer Unicode properties.
Value: 4096
JOINING_GROUP
static val JOINING_GROUP: Int
Enumerated property Joining_Group. Returns UCharacter.JoiningGroup values.
Value: 4102
JOINING_TYPE
static val JOINING_TYPE: Int
Enumerated property Joining_Type. Returns UCharacter.JoiningType values.
Value: 4103
JOIN_CONTROL
static val JOIN_CONTROL: Int
Binary property Join_Control.
Format controls for cursive joining and ligation.
Value: 20
LEAD_CANONICAL_COMBINING_CLASS
static val LEAD_CANONICAL_COMBINING_CLASS: Int
Enumerated property Lead_Canonical_Combining_Class. ICU-specific property for the ccc of the first code point of the decomposition, or lccc(c)=ccc(NFD(c)[0]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.
Value: 4112
LINE_BREAK
static val LINE_BREAK: Int
Enumerated property Line_Break. Returns UCharacter.LineBreak values.
Value: 4104
LOGICAL_ORDER_EXCEPTION
static val LOGICAL_ORDER_EXCEPTION: Int
Binary property Logical_Order_Exception (new).
Characters that do not use logical order and require special handling in most processing.
Value: 21
LOWERCASE
static val LOWERCASE: Int
Binary property Lowercase.
Same as UCharacter.isULowercase(), different from UCharacter.islower().
Ll+Other_Lowercase
Value: 22
LOWERCASE_MAPPING
static val LOWERCASE_MAPPING: Int
String property Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(String).
Value: 16388
MASK_START
static val MASK_START: Int
First constant for bit-mask Unicode properties.
Value: 8192
NAME
static val NAME: Int
String property Name. Corresponds to UCharacter.getName(int).
Value: 16389
NFC_INERT
static val NFC_INERT: Int
Binary property NFC_Inert. ICU-specific property for characters that are inert under NFC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
Value: 39
See Also
NFC_QUICK_CHECK
static val NFC_QUICK_CHECK: Int
Enumerated property NFC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
Value: 4110
NFD_INERT
static val NFD_INERT: Int
Binary property NFD_Inert. ICU-specific property for characters that are inert under NFD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions. There is one such property per normalization form. These properties are computed as follows - an inert character is: a) unassigned, or ALL of the following: b) of combining class 0. c) not decomposed by this normalization form. AND if NFC or NFKC, d) can never compose with a previous character. e) can never compose with a following character. f) can never change if another character is added. Example: a-breve might satisfy all but f, but if you add an ogonek it changes to a-ogonek + breve See also com.ibm.text.UCD.NFSkippable in the ICU4J repository, and icu/source/common/unormimp.h .
Value: 37
NFD_QUICK_CHECK
static val NFD_QUICK_CHECK: Int
Enumerated property NFD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
Value: 4108
NFKC_INERT
static val NFKC_INERT: Int
Binary property NFKC_Inert. ICU-specific property for characters that are inert under NFKC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
Value: 40
See Also
NFKC_QUICK_CHECK
static val NFKC_QUICK_CHECK: Int
Enumerated property NFKC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
Value: 4111
NFKD_INERT
static val NFKD_INERT: Int
Binary property NFKD_Inert. ICU-specific property for characters that are inert under NFKD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
Value: 38
See Also
NFKD_QUICK_CHECK
static val NFKD_QUICK_CHECK: Int
Enumerated property NFKD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
Value: 4109
NONCHARACTER_CODE_POINT
static val NONCHARACTER_CODE_POINT: Int
Binary property Noncharacter_Code_Point.
Code points that are explicitly defined as illegal for the encoding of characters.
Value: 24
NUMERIC_TYPE
static val NUMERIC_TYPE: Int
Enumerated property Numeric_Type. Returns UCharacter.NumericType values.
Value: 4105
NUMERIC_VALUE
static val NUMERIC_VALUE: Int
Double property Numeric_Value. Corresponds to UCharacter.getUnicodeNumericValue(int).
Value: 12288
OTHER_PROPERTY_START
static val OTHER_PROPERTY_START: Int
First constant for Unicode properties with unusual value types.
Value: 28672
PATTERN_SYNTAX
static val PATTERN_SYNTAX: Int
Binary property Pattern_Syntax (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)
Value: 42
PATTERN_WHITE_SPACE
static val PATTERN_WHITE_SPACE: Int
Binary property Pattern_White_Space (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)
Value: 43
POSIX_ALNUM
static val POSIX_ALNUM: Int
Binary property alnum (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 44
POSIX_BLANK
static val POSIX_BLANK: Int
Binary property blank (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 45
POSIX_GRAPH
static val POSIX_GRAPH: Int
Binary property graph (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 46
POSIX_PRINT
static val POSIX_PRINT: Int
Binary property print (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 47
POSIX_XDIGIT
static val POSIX_XDIGIT: Int
Binary property xdigit (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 48
PREPENDED_CONCATENATION_MARK
static val PREPENDED_CONCATENATION_MARK: Int
Binary property Prepended_Concatenation_Mark.
Value: 63
QUOTATION_MARK
static val QUOTATION_MARK: Int
Binary property Quotation_Mark.
Value: 25
RADICAL
static val RADICAL: Int
Binary property Radical (new).
For programmatic determination of Ideographic Description Sequences.
Value: 26
REGIONAL_INDICATOR
static val REGIONAL_INDICATOR: Int
Binary property Regional_Indicator.
Value: 62
RGI_EMOJI
static val RGI_EMOJI: Int
Binary property of strings RGI_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 71
RGI_EMOJI_FLAG_SEQUENCE
static val RGI_EMOJI_FLAG_SEQUENCE: Int
Binary property of strings RGI_Emoji_Flag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 68
RGI_EMOJI_MODIFIER_SEQUENCE
static val RGI_EMOJI_MODIFIER_SEQUENCE: Int
Binary property of strings RGI_Emoji_Modifier_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 67
RGI_EMOJI_TAG_SEQUENCE
static val RGI_EMOJI_TAG_SEQUENCE: Int
Binary property of strings RGI_Emoji_Tag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 69
RGI_EMOJI_ZWJ_SEQUENCE
static val RGI_EMOJI_ZWJ_SEQUENCE: Int
Binary property of strings RGI_Emoji_ZWJ_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 70
SCRIPT
static val SCRIPT: Int
Enumerated property Script. Same as UScript.getScript(int), returns UScript values.
Value: 4106
SCRIPT_EXTENSIONS
static val SCRIPT_EXTENSIONS: Int
Miscellaneous property Script_Extensions (new in Unicode 6.0). Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/. Corresponds to UScript.hasScript and UScript.getScriptExtensions.
Value: 28672
SEGMENT_STARTER
static val SEGMENT_STARTER: Int
Binary Property Segment_Starter. ICU-specific property for characters that are starters in terms of Unicode normalization and combining character sequences. They have ccc=0 and do not occur in non-initial position of the canonical decomposition of any character (like " in NFD(a-umlaut) and a Jamo T in an NFD(Hangul LVT)). ICU uses this property for segmenting a string for generating a set of canonically equivalent strings, e.g. for canonical closure while processing collation tailoring rules.
Value: 41
SENTENCE_BREAK
static val SENTENCE_BREAK: Int
Enumerated property Sentence_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.SentenceBreak values.
Value: 4115
SIMPLE_CASE_FOLDING
static val SIMPLE_CASE_FOLDING: Int
String property Simple_Case_Folding. Corresponds to UCharacter.foldCase(int, boolean).
Value: 16390
SIMPLE_LOWERCASE_MAPPING
static val SIMPLE_LOWERCASE_MAPPING: Int
String property Simple_Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(int).
Value: 16391
SIMPLE_TITLECASE_MAPPING
static val SIMPLE_TITLECASE_MAPPING: Int
String property Simple_Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(int).
Value: 16392
SIMPLE_UPPERCASE_MAPPING
static val SIMPLE_UPPERCASE_MAPPING: Int
String property Simple_Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(int).
Value: 16393
SOFT_DOTTED
static val SOFT_DOTTED: Int
Binary property Soft_Dotted (new).
Characters with a "soft dot", like i or j.
An accent placed on these characters causes the dot to disappear.
Value: 27
STRING_START
static val STRING_START: Int
First constant for string Unicode properties.
Value: 16384
S_TERM
static val S_TERM: Int
Binary property STerm (new in Unicode 4.0.1). Sentence Terminal. Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/)
Value: 35
TERMINAL_PUNCTUATION
static val TERMINAL_PUNCTUATION: Int
Binary property Terminal_Punctuation.
Punctuation characters that generally mark the end of textual units.
Value: 28
TITLECASE_MAPPING
static val TITLECASE_MAPPING: Int
String property Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(String).
Value: 16394
TRAIL_CANONICAL_COMBINING_CLASS
static val TRAIL_CANONICAL_COMBINING_CLASS: Int
Enumerated property Trail_Canonical_Combining_Class. ICU-specific property for the ccc of the last code point of the decomposition, or lccc(c)=ccc(NFD(c)[last]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.
Value: 4113
UNIFIED_IDEOGRAPH
static val UNIFIED_IDEOGRAPH: Int
Binary property Unified_Ideograph (new).
For programmatic determination of Ideographic Description Sequences.
Value: 29
UPPERCASE
static val UPPERCASE: Int
Binary property Uppercase.
Same as UCharacter.isUUppercase(), different from UCharacter.isUpperCase().
Lu+Other_Uppercase
Value: 30
UPPERCASE_MAPPING
static val UPPERCASE_MAPPING: Int
String property Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(String).
Value: 16396
VARIATION_SELECTOR
static val VARIATION_SELECTOR: Int
Binary property Variation_Selector (new in Unicode 4.0.1). Indicates all those characters that qualify as Variation Selectors. For details on the behavior of these characters, see StandardizedVariants.html and 15.6 Variation Selectors.
Value: 36
VERTICAL_ORIENTATION
static val VERTICAL_ORIENTATION: Int
Enumerated property Vertical_Orientation. Used for UAX #50 Unicode Vertical Text Layout (https://www.unicode.org/reports/tr50/). New as a UCD property in Unicode 10.0.
Value: 4120
WHITE_SPACE
static val WHITE_SPACE: Int
Binary property White_Space.
Same as UCharacter.isUWhiteSpace(), different from UCharacter.isSpace() and UCharacter.isWhitespace(). Space characters+TAB+CR+LF-ZWSP-ZWNBSP
Value: 31
WORD_BREAK
static val WORD_BREAK: Int
Enumerated property Word_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.WordBreak values.
Value: 4116
XID_CONTINUE
static val XID_CONTINUE: Int
Binary property XID_Continue.
ID_Continue modified to allow closure under normalization forms NFKC and NFKD.
Value: 32
XID_START
static val XID_START: Int
Binary property XID_Start.
ID_Start modified to allow closure under normalization forms NFKC and NFKD.
Value: 33