Elucent's Magical Unicode Library

Not actually that magical...just a fairly lightweight unicode utility. Still kinda fun though!

This is neither the fastest nor most complete thing out there, but it covers a few core unicode features: encoding, decoding, and character types. These are implemented at the bottom of utf8.c. The library is intended to be linked statically, does not depend on libc, and produces a binary less than 32 kB in size (at least on my current laptop).

Most of the code / binary size in this library comes from the code-point range tables in utf8.c. These arrays can be automatically generated from a Unicode Character Database distribution using the included genranges.py script.

By default, the library also generates a character type lookup table at runtime, occupying 2 MiB of memory. This can be turned off in the Makefile, in which case the program will perform a binary search over the included code-point ranges.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
genranges.py		genranges.py
prof.c		prof.c
test.c		test.c
test_file.txt		test_file.txt
utf8.c		utf8.c
utf8.h		utf8.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Elucent's Magical Unicode Library

About

Uh oh!

Releases

Packages

Languages

elucent/unicode-lib

Folders and files

Latest commit

History

Repository files navigation

Elucent's Magical Unicode Library

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages