Unicode
Table of Contents
Classes
- Utf8String
- A class for manipulating UTF-8 strings.
Functions
- utf8_casefold_simple_maps() : array<string|int, mixed>
- Helper function for utf8_casefold.
- utf8_casefold_maps() : array<string|int, mixed>
- Helper function for utf8_casefold.
- utf8_strtolower_simple_maps() : array<string|int, mixed>
- Helper function for utf8_strtolower.
- utf8_strtolower_maps() : array<string|int, mixed>
- Helper function for utf8_strtolower.
- utf8_titlecase_simple_maps() : array<string|int, mixed>
- Helper function for utf8_convert_case.
- utf8_titlecase_maps() : array<string|int, mixed>
- Helper function for utf8_convert_case.
- utf8_strtoupper_simple_maps() : array<string|int, mixed>
- Helper function for utf8_strtoupper.
- utf8_strtoupper_maps() : array<string|int, mixed>
- Helper function for utf8_strtoupper.
- utf8_combining_classes() : array<string|int, mixed>
- Helper function for utf8_normalize_d.
- utf8_compose_maps() : array<string|int, mixed>
- Helper function for utf8_compose.
- utf8_normalize_d_maps() : array<string|int, mixed>
- Helper function for utf8_normalize_d.
- utf8_normalize_kd_maps() : array<string|int, mixed>
- Helper function for utf8_normalize_kd.
- utf8_default_ignorables() : array<string|int, mixed>
- Helper function for utf8_normalize_kc_casefold.
- idna_maps() : array<string|int, mixed>
- Helper function for idn_to_* polyfills.
- idna_maps_deviation() : array<string|int, mixed>
- Helper function for idn_to_* polyfills.
- idna_maps_not_std3() : array<string|int, mixed>
- Helper function for idn_to_* polyfills.
- idna_regex() : array<string|int, mixed>
- Helper function for idn_to_* polyfills.
- utf8_regex_quick_check() : array<string|int, mixed>
- Helper function for utf8_is_normalized.
- utf8_regex_properties() : array<string|int, mixed>
- Helper function for utf8_sanitize_invisibles and utf8_convert_case.
- utf8_regex_variation_selectors() : array<string|int, mixed>
- Helper function for utf8_sanitize_invisibles.
- utf8_regex_joining_type() : array<string|int, mixed>
- Helper function for utf8_sanitize_invisibles.
- utf8_regex_indic() : array<string|int, mixed>
- Helper function for utf8_sanitize_invisibles.
Functions
utf8_casefold_simple_maps()
Helper function for utf8_casefold.
utf8_casefold_simple_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Casefolding maps.
utf8_casefold_maps()
Helper function for utf8_casefold.
utf8_casefold_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Casefolding maps.
utf8_strtolower_simple_maps()
Helper function for utf8_strtolower.
utf8_strtolower_simple_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Uppercase to lowercase maps.
utf8_strtolower_maps()
Helper function for utf8_strtolower.
utf8_strtolower_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Uppercase to lowercase maps.
utf8_titlecase_simple_maps()
Helper function for utf8_convert_case.
utf8_titlecase_simple_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Simple title case maps.
utf8_titlecase_maps()
Helper function for utf8_convert_case.
utf8_titlecase_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Full title case maps.
utf8_strtoupper_simple_maps()
Helper function for utf8_strtoupper.
utf8_strtoupper_simple_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Lowercase to uppercase maps.
utf8_strtoupper_maps()
Helper function for utf8_strtoupper.
utf8_strtoupper_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Lowercase to uppercase maps.
utf8_combining_classes()
Helper function for utf8_normalize_d.
utf8_combining_classes() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Combining Class data for Unicode normalization.
utf8_compose_maps()
Helper function for utf8_compose.
utf8_compose_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Composition maps for Unicode normalization.
utf8_normalize_d_maps()
Helper function for utf8_normalize_d.
utf8_normalize_d_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Canonical Decomposition maps for Unicode normalization.
utf8_normalize_kd_maps()
Helper function for utf8_normalize_kd.
utf8_normalize_kd_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Compatibility Decomposition maps for Unicode normalization.
utf8_default_ignorables()
Helper function for utf8_normalize_kc_casefold.
utf8_default_ignorables() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Characters with the 'Default_Ignorable_Code_Point' property.
idna_maps()
Helper function for idn_to_* polyfills.
idna_maps() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Character maps for IDNA processing.
idna_maps_deviation()
Helper function for idn_to_* polyfills.
idna_maps_deviation() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —"Deviation" character maps for IDNA processing.
idna_maps_not_std3()
Helper function for idn_to_* polyfills.
idna_maps_not_std3() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Non-STD3 character maps for IDNA processing.
idna_regex()
Helper function for idn_to_* polyfills.
idna_regex() : array<string|int, mixed>
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Regular expressions useful for IDNA processing.
utf8_regex_quick_check()
Helper function for utf8_is_normalized.
utf8_regex_quick_check() : array<string|int, mixed>
Character class lists compiled from: https://unicode.org/Public/UNIDATA/extracted/DerivedNormalizationProps.txt
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Character classes for disallowed characters in normalization forms.
utf8_regex_properties()
Helper function for utf8_sanitize_invisibles and utf8_convert_case.
utf8_regex_properties() : array<string|int, mixed>
Character class lists compiled from: https://unicode.org/Public/UNIDATA/DerivedCoreProperties.txt https://unicode.org/Public/UNIDATA/PropList.txt https://unicode.org/Public/UNIDATA/emoji/emoji-data.txt https://unicode.org/Public/UNIDATA/extracted/DerivedGeneralCategory.txt
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Character classes for various Unicode properties.
utf8_regex_variation_selectors()
Helper function for utf8_sanitize_invisibles.
utf8_regex_variation_selectors() : array<string|int, mixed>
Character class lists compiled from: https://unicode.org/Public/UNIDATA/StandardizedVariants.txt https://unicode.org/Public/UNIDATA/emoji/emoji-variation-sequences.txt
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Character classes for filtering variation selectors.
utf8_regex_joining_type()
Helper function for utf8_sanitize_invisibles.
utf8_regex_joining_type() : array<string|int, mixed>
Character class lists compiled from: https://unicode.org/Public/UNIDATA/extracted/DerivedJoiningType.txt
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Character classes for joining characters in certain scripts.
utf8_regex_indic()
Helper function for utf8_sanitize_invisibles.
utf8_regex_indic() : array<string|int, mixed>
Character class lists compiled from: https://unicode.org/Public/UNIDATA/extracted/DerivedCombiningClass.txt https://unicode.org/Public/UNIDATA/IndicSyllabicCategory.txt
Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.
Return values
array<string|int, mixed> —Character classes for Indic scripts that use viramas.