Here are
38 public repositories
matching this topic...
Unicode (UTF-8) capable std::string
Visualize different Unicode encodings in the terminal
Updated
Dec 29, 2021
Ruby
UTF-8/16/32 C++11 header only library for Windows/Linux/MacOs
My work on high-speed conversion of UTF-8 to UTF-32/UTF-16
A performant and memory efficient storage for immutable strings with C++17. Supports all standard char types: char, wchar_t, char16_t, char32_t and C++20's char8_t.
Materials from my talks from CppCon 2018
A simple library to decode a string that contains escaped UTF-16/32 characters into unescaped UTF-16.
Legacy Unicode string types
Updated
Sep 15, 2020
Julia
The C++ unicode_traits class template makes using unicode easier
An implementation targeting high performance for frequently reading/writing operations for multi-byte string.
UTF-16 readable C preprocessor (A fork of mcpp 2.7.2)
C/C++ UTF-8, UTF-16, UTF-32 conversion library
C++ library to manage strings with different encodings
Unicode information. Blocks, scripts, classes and ranges for Haxe. [Experimental]
Updated
Feb 25, 2020
Haxe
Helper methods to cross-compile Unicode regular expressions in Haxe.
Updated
Jun 25, 2020
Haxe
A regular expression engine for UTF-32 string, based on deterministic finite automaton (DFA), implemented in C++.
Very lightweight, versatile and portable C library for handling Unicode strings. Source code of library conforms to ANSI C 89/90 Standard.
Maximum Unicode code point.
Updated
May 1, 2022
Makefile
Maximum Unicode code point in the Basic Multilingual Plane (BMP).
Updated
May 1, 2022
Makefile
A purely static zero dependencies Helper to handle unicode BOMs
Standard string unicode constants.
Updated
May 1, 2022
Makefile
A low-level string character analysis library for PHP.
🔤 UTF8-16-32 analysis and manipulation library
An istream extension converting Unicode encodings (i.e. UTF-16 Little Endian) on the fly to UTF-8.
Repository for Unicode related things
Text files Unicode charset detector w/wo BOM.
Converts UTF-16 (BE/LE), UTF-32 (BE/LE), ISO-8859-N to UTF-8. Removes BOM and surrogate pairs from UTF-8, converting a codepoint between U-D800 and U-DBFF followed by a codepoint between U-DC00 and U-DFFF to one valid codepoint > U-FFFF.
A C library for encoding, decoding and manipulating Unicode texts.
Improve this page
Add a description, image, and links to the
utf-32
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
utf-32
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.
Currently uses a very naive algorithm.
Use something like Boyer–Moore instead.