Today's software market is a global one in which it is desirable to develop and maintain one application (single source/single binary) that supports a wide variety of languages. The International Components for Unicode (ICU) libraries provide robust and full-featured Unicode services on a wide variety of platforms to help this design goal. The ICU libraries provide support for:
- The latest version of the Unicode standard
- Character set conversions with support for over 220 codepages
- Locale data for more than 300 locales
- Language sensitive text collation (sorting) and searching based on the Unicode Collation Algorithm (=ISO 14651)
- Regular expression matching and Unicode sets
- Transformations for normalization, upper/lowercase, script transliterations (50+ pairs)
- Resource bundles for storing and accessing localized information
- Date/Number/Message formatting and parsing of culture specific input/output formats
- Calendar specific date and time manipulation
- Complex text layout for Arabic, Hebrew, Indic and Thai
- Text boundary analysis for finding characters, word and sentence boundaries
ICU has a sister project ICU4J that extends the internationalization capabilities of Java to a level similar to ICU. The ICU C/C++ project is also called ICU4C when a distinction is necessary.
Licensed under ICU License - ICU 1.8.1 and later.
Add new comment