Documentation

Unicode

Table of Contents

Classes

Utf8String
A class for manipulating UTF-8 strings.

Functions

utf8_casefold_simple_maps()  : array<string|int, mixed>
Helper function for utf8_casefold.
utf8_casefold_maps()  : array<string|int, mixed>
Helper function for utf8_casefold.
utf8_strtolower_simple_maps()  : array<string|int, mixed>
Helper function for utf8_strtolower.
utf8_strtolower_maps()  : array<string|int, mixed>
Helper function for utf8_strtolower.
utf8_titlecase_simple_maps()  : array<string|int, mixed>
Helper function for utf8_convert_case.
utf8_titlecase_maps()  : array<string|int, mixed>
Helper function for utf8_convert_case.
utf8_strtoupper_simple_maps()  : array<string|int, mixed>
Helper function for utf8_strtoupper.
utf8_strtoupper_maps()  : array<string|int, mixed>
Helper function for utf8_strtoupper.
utf8_combining_classes()  : array<string|int, mixed>
Helper function for utf8_normalize_d.
utf8_compose_maps()  : array<string|int, mixed>
Helper function for utf8_compose.
utf8_normalize_d_maps()  : array<string|int, mixed>
Helper function for utf8_normalize_d.
utf8_normalize_kd_maps()  : array<string|int, mixed>
Helper function for utf8_normalize_kd.
utf8_default_ignorables()  : array<string|int, mixed>
Helper function for utf8_normalize_kc_casefold.
idna_maps()  : array<string|int, mixed>
Helper function for idn_to_* polyfills.
idna_maps_deviation()  : array<string|int, mixed>
Helper function for idn_to_* polyfills.
idna_maps_not_std3()  : array<string|int, mixed>
Helper function for idn_to_* polyfills.
idna_regex()  : array<string|int, mixed>
Helper function for idn_to_* polyfills.
utf8_regex_quick_check()  : array<string|int, mixed>
Helper function for utf8_is_normalized.
utf8_regex_properties()  : array<string|int, mixed>
Helper function for utf8_sanitize_invisibles and utf8_convert_case.
utf8_regex_variation_selectors()  : array<string|int, mixed>
Helper function for utf8_sanitize_invisibles.
utf8_regex_joining_type()  : array<string|int, mixed>
Helper function for utf8_sanitize_invisibles.
utf8_regex_indic()  : array<string|int, mixed>
Helper function for utf8_sanitize_invisibles.

Functions

utf8_casefold_simple_maps()

Helper function for utf8_casefold.

utf8_casefold_simple_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Casefolding maps.

utf8_casefold_maps()

Helper function for utf8_casefold.

utf8_casefold_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Casefolding maps.

utf8_strtolower_simple_maps()

Helper function for utf8_strtolower.

utf8_strtolower_simple_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Uppercase to lowercase maps.

utf8_strtolower_maps()

Helper function for utf8_strtolower.

utf8_strtolower_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Uppercase to lowercase maps.

utf8_titlecase_simple_maps()

Helper function for utf8_convert_case.

utf8_titlecase_simple_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Simple title case maps.

utf8_titlecase_maps()

Helper function for utf8_convert_case.

utf8_titlecase_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Full title case maps.

utf8_strtoupper_simple_maps()

Helper function for utf8_strtoupper.

utf8_strtoupper_simple_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Lowercase to uppercase maps.

utf8_strtoupper_maps()

Helper function for utf8_strtoupper.

utf8_strtoupper_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Lowercase to uppercase maps.

utf8_combining_classes()

Helper function for utf8_normalize_d.

utf8_combining_classes() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Combining Class data for Unicode normalization.

utf8_compose_maps()

Helper function for utf8_compose.

utf8_compose_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Composition maps for Unicode normalization.

utf8_normalize_d_maps()

Helper function for utf8_normalize_d.

utf8_normalize_d_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Canonical Decomposition maps for Unicode normalization.

utf8_normalize_kd_maps()

Helper function for utf8_normalize_kd.

utf8_normalize_kd_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Compatibility Decomposition maps for Unicode normalization.

utf8_default_ignorables()

Helper function for utf8_normalize_kc_casefold.

utf8_default_ignorables() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Characters with the 'Default_Ignorable_Code_Point' property.

idna_maps()

Helper function for idn_to_* polyfills.

idna_maps() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character maps for IDNA processing.

idna_maps_deviation()

Helper function for idn_to_* polyfills.

idna_maps_deviation() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

"Deviation" character maps for IDNA processing.

idna_maps_not_std3()

Helper function for idn_to_* polyfills.

idna_maps_not_std3() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Non-STD3 character maps for IDNA processing.

idna_regex()

Helper function for idn_to_* polyfills.

idna_regex() : array<string|int, mixed>

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Regular expressions useful for IDNA processing.

utf8_regex_quick_check()

Helper function for utf8_is_normalized.

utf8_regex_quick_check() : array<string|int, mixed>

Character class lists compiled from: https://unicode.org/Public/UNIDATA/extracted/DerivedNormalizationProps.txt

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character classes for disallowed characters in normalization forms.

utf8_regex_properties()

Helper function for utf8_sanitize_invisibles and utf8_convert_case.

utf8_regex_properties() : array<string|int, mixed>

Character class lists compiled from: https://unicode.org/Public/UNIDATA/DerivedCoreProperties.txt https://unicode.org/Public/UNIDATA/PropList.txt https://unicode.org/Public/UNIDATA/emoji/emoji-data.txt https://unicode.org/Public/UNIDATA/extracted/DerivedGeneralCategory.txt

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character classes for various Unicode properties.

utf8_regex_variation_selectors()

Helper function for utf8_sanitize_invisibles.

utf8_regex_variation_selectors() : array<string|int, mixed>

Character class lists compiled from: https://unicode.org/Public/UNIDATA/StandardizedVariants.txt https://unicode.org/Public/UNIDATA/emoji/emoji-variation-sequences.txt

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character classes for filtering variation selectors.

utf8_regex_joining_type()

Helper function for utf8_sanitize_invisibles.

utf8_regex_joining_type() : array<string|int, mixed>

Character class lists compiled from: https://unicode.org/Public/UNIDATA/extracted/DerivedJoiningType.txt

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character classes for joining characters in certain scripts.

utf8_regex_indic()

Helper function for utf8_sanitize_invisibles.

utf8_regex_indic() : array<string|int, mixed>

Character class lists compiled from: https://unicode.org/Public/UNIDATA/extracted/DerivedCombiningClass.txt https://unicode.org/Public/UNIDATA/IndicSyllabicCategory.txt

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character classes for Indic scripts that use viramas.


        
On this page

Search results