Showing 1238 open source projects for "utf-16"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    JSON implementation for Ruby

    JSON implementation for Ruby

    JSON implementation for Ruby

    ...You can think of it as a low-fat alternative to XML if you want to store data on disk or transmit it over a network rather than use a verbose markup language. Both variants of the JSON generator generate UTF-8 character sequences by default. If an:ascii_only option with a true value is given, they escape all non-ASCII and control characters with \uXXXX escape sequences, and support UTF-16 surrogate pairs in order to be able to generate the whole range of Unicode code points. All strings, that are to be encoded as JSON strings, should be UTF-8 byte sequences on the Ruby side. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Tesseract OCR

    Tesseract OCR

    Open Source OCR Engine

    ...Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports various output formats, including plain text, HTML, PDF and more. It also has unicode (UTF-8) support.
    Downloads: 2,564 This Week
    Last Update:
    See Project
  • 3
    Runeset

    Runeset

    Fast UTF-8 codepoint sets for Zig

    This library offers a compact data structure for "generalized"1 UTF-8 encoded codepoints. The design is based on an implicit data structure2, which uses @popCount and bit masking to check membership quickly, with minimal branching, and without having to decode the UTF-8 into another format (for instance, a codepoint). This design is original, in the sense that I invented it. There may be prior art, it's remarkably difficult to search for "UTF-8 character sets" and find papers on set data structures, so I can't say with high confidence that it's truly novel; in a sense, it's an obvious extension of the widespread practice of using a pair of u64 bitmasks to detect a set of ASCII values. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Windows Terminal

    Windows Terminal

    The New Windows Terminal

    The Windows Terminal is a new, modern, fast, efficient, powerful, and productive terminal application for users of command-line tools and shells like Command Prompt, PowerShell, and WSL.
    Downloads: 258 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    ArduinoJson

    ArduinoJson

    JSON library for Arduino and embedded C++. Simple and efficient

    ArduinoJson is a C++ JSON library for Arduino and IoT (Internet Of Things). ArduinoJson has a simple and intuitive syntax to handle objects and arrays. ArduinoJson supports both JSON serialization and deserialization. ArduinoJson uses a fixed memory allocation, allowing to work on devices with very little RAM. ArduinoJson can filter large inputs to keep only fields that are relevant to your application, thereby saving a lot of memory. ArduinoJson can parse directly from an input Stream or...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    CssToInlineStyles class

    CssToInlineStyles class

    Enables you to convert HTML-pages/files into HTML-pages/files

    CssToInlineStyles is a class that enables you to convert HTML-pages/files into HTML-pages/files with inline styles. This is very usefull when you're sending emails. UTF-8 charset is not always detected correctly. Make sure you set the charset to UTF-8 using the correct meta-tag in the head. CssToInlineStyles class is used on each site based on Fork CMS. CssToInlineStyles class is used by Laravel Framework.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    SJTUThesis

    SJTUThesis

    Shanghai Jiao Tong University LaTeX Thesis Template

    A full example template demonstrating the use of the SJTUThesis LaTeX document class for Shanghai Jiao Tong University degree theses and coursework. It showcases formatting for formulas, tables, algorithms, references and supports UTF‑8 encoding.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Takes

    Takes

    True object-oriented Java web framework without NULLs

    Takes is a true object-oriented and immutable Java8 web development framework. Pay attention that UTF-8 encoding is set on the command line. The entire framework relies on your default Java encoding, which is not necessarily UTF-8 by default. To be sure, always set it on the command line with file.encoding Java argument. We decided not to hard-code "UTF-8" in our code mostly because this would be against the entire idea of Java localization, according to which a user always should have a choice of encoding and language selection. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    Ark Pixel Font

    Ark Pixel Font

    Ark pixel font - Open source Pan-CJK pixel font

    ...Among them, 12 pixels are the main development target. 10, 16 pixels are experimental. Efforts are currently underway to achieve the availability of 12 pixels under the GB2312 character set. We have a temporary interim solution available for production until the full font is available, check out the Stitcher Pixel Font project.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    Vince's CSV Parser

    Vince's CSV Parser

    A modern C++ library for reading, writing, and analyzing CSV

    There's plenty of other CSV parsers in the wild, but I had a hard time finding what I wanted. Inspired by Python's csv module, I wanted a library with simple, intuitive syntax. Furthermore, I wanted support for special use cases such as calculating statistics on very large files. With the deluge of large datasets available, a performant CSV parser is a necessity. By using overlapped threads, memory-mapped IO, and efficient data structures, this parser can quickly tackle large CSV files....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Soufflé

    Soufflé

    Datalog variant for tool designers crafting analyses in Horn clauses

    ...Futamura projections/partial evaluation for effective translation to parallel C++; optimized staged compilation; specialized data-structures for logical relations. Efficient translation to parallel C++ of Datalog programs (CAV'16, CC'16) Efficient interpretation using de-specialization techniques (PLDI'21) Specialized data structure for relations (PACT'19, PPoPP'19, PMAM'19) with optimal index selection (VLDB'18) Extended semantics of Datalog, e.g., permitting unbounded recursions with numbers and terms. Simple component model for Datalog specifications. Recursively defined record types/ADTs (aka. constructors) for tuples. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    simdjson

    simdjson

    Parsing gigabytes of JSON per second

    JSON is everywhere on the Internet. Servers spend a *lot* of time parsing it. We need a fresh approach. The simdjson library uses commonly available SIMD instructions and microparallel algorithms to parse JSON 4x faster than RapidJSON and 25x faster than JSON for Modern C++. The simdjson library uses three-quarters less instructions than state-of-the-art parser RapidJSON. To our knowledge, simdjson is the first fully-validating JSON parser to run at gigabytes per second (GB/s) on commodity...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    cryptii

    cryptii

    Web app and framework offering modular conversion and encoding

    ...When the content gets edited inside a Viewer or when Brick settings get changed, the result propagates through the Pipe's Bricks in order and in both directions. Chain objects encapsulate UTF-8 text or binary-based content exchanged between Bricks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Smaller C

    Smaller C

    Simple C compiler

    Smaller C is a simple and small single-pass C compiler, currently supporting most of the C language common between C89/ANSI C and C99 (minus some C89 and plus some C99 features). Currently, it generates 16-bit and 32-bit 80386+ assembly code for NASM that can then be assembled and linked into DOS, Windows, Linux and Mac OS X programs. (You may use YASM or FASM instead of NASM) Code generation for MIPS CPUs is also supported (primarily for RetroBSD). The compiler is capable of compiling itself. The core compiler comes with a preprocessor (ucpp), a linker, and a compiler driver (the driver invokes the preprocessor, the core compiler, the assembler, and the linker and supports options similar to those of gcc).
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    Jansson

    Jansson

    C library for encoding, decoding and manipulating JSON data

    Jansson is a C library for encoding, decoding and manipulating JSON data.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    NovaSR

    NovaSR

    A lightning fast audio upsampler

    NovaSR is an extremely lightweight and high-performance audio upsampling model that transforms low-quality 16 kHz audio into clearer, high-fidelity 48 kHz audio with remarkable speed and efficiency. At only about 50 KB in size, the model is orders of magnitude smaller than typical audio super-resolution networks, yet it achieves high quality and realtime performance thanks to its compact architecture and efficient convolutional design. NovaSR is especially valuable for post-processing tasks in speech enhancement, TTS pipelines, and dataset restoration where low sampling rates degrade perceived audio clarity; the minimal model size also makes it suitable for edge and embedded use cases where memory is at a premium. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    Pyxel

    Pyxel

    A retro game engine for Python

    A retro game engine for Python. Thanks to its simple specifications inspired by retro gaming consoles, such as only 16 colors can be displayed and only 4 sounds can be played back at the same time, you can feel free to enjoy making pixel art style games. The motivation for the development of Pyxel is the feedback from users. Please give Pyxel a star on GitHub! Pyxel's specifications and APIs are inspired by PICO-8 and TIC-80. Pyxel is open source and free to use.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    PHPMailer

    PHPMailer

    The classic email sending library for PHP

    PHPMailer – A full-featured email creation and transfer class for PHP. Support for UTF-8 content and 8bit, base64, binary, and quoted-printable encodings. SMTP authentication with LOGIN, PLAIN, CRAM-MD5, and XOAUTH2 mechanisms over SMTPS and SMTP+STARTTLS transports. Validates email addresses automatically. Many PHP developers need to send email from their code. The only PHP function that supports this directly is mail().
    Downloads: 197 This Week
    Last Update:
    See Project
  • 20
    Symfony Cache

    Symfony Cache

    Provides extended PSR-6, PSR-16 (and tags) implementations

    Symfony Cache is a powerful caching library that provides a consistent API for caching data in PHP applications. It supports a variety of cache backends, including Redis, Memcached, and filesystem caching. The component is highly efficient, with features like tag-based invalidation and hierarchical cache structures, making it suitable for both small and large-scale applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    minbpe

    minbpe

    Minimal, clean code for the Byte Pair Encoding (BPE) algorithm

    minbpe is a minimal, clean implementation of byte-level Byte Pair Encoding (BPE), the tokenization approach widely used in modern language models. It operates on UTF-8 encoded bytes rather than Unicode characters, which makes it robust to arbitrary text inputs and avoids needing a language-specific character vocabulary. The repository is structured as a teaching-oriented implementation that shows how to train a tokenizer by learning merge rules, then apply those merges to encode text into token IDs and decode tokens back into text. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LingBot-World

    LingBot-World

    Advancing Open-source World Models

    ...LingBot-World supports long-term temporal consistency, maintaining coherent scenes and interactions over minute-level horizons. With real-time interactivity and sub-second latency at 16 FPS, it is well-suited for interactive applications and rapid experimentation. The project is fully open-access, releasing both code and models to help bridge the gap between closed and open world-model systems. LingBot-World empowers researchers and developers in areas such as content creation, gaming, robotics, and embodied AI learning.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    ripgrep

    ripgrep

    Regex pattern directory search tool that respects your .gitignore

    ripgrep is a line-oriented search tool that actively searches the directory you're currently in for a regex pattern. By default, ripgrep will ignore your .gitignore and skip hidden files or directories and binary files automatically. ripgrep has first class support on Windows, macOS and Linux, with binary downloads available for every release. ripgrep is similar to other popular search tools like The Silver Searcher, ack and grep. ripgrep supports arbitrary input preprocessing filters which...
    Downloads: 53 This Week
    Last Update:
    See Project
  • 24
    Better BibTeX for Zotero

    Better BibTeX for Zotero

    Make Zotero effective for us LaTeX holdouts

    Better BibTeX (BBT) is a plugin for Zotero and Juris-M that makes it easier to manage bibliographic data, especially for people authoring documents using text-based toolchains (e.g. based on LaTeX / Markdown). Zotero does all its work in UTF-8 Unicode, which is absolutely the right thing to do. Unfortunately, for those shackled to BibTeX and who cannot (yet) move to BibLaTeX, unicode is a major PITA. Also, Zotero supports some simple HTML markup in your items that Bib(La)TeX won’t understand.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 25
    goquery

    goquery

    A little like that j-thing, only in Go

    ...Since the net/html parser returns nodes, and not a full-featured DOM tree, jQuery's stateful manipulation functions (like height(), css(), and detach()) have been left off. Also, because the net/HTML parser requires UTF-8 encoding, so does goquery: it is the caller's responsibility to ensure that the source document provides UTF-8 encoded HTML. See the wiki for various options to do this. Syntax-wise, it is as close as possible to jQuery, with the same function names when possible, and that warm and fuzzy chainable interface. jQuery being the ultra-popular library that it is, I felt that writing a similar HTML-manipulating library was better to follow its API than to start anew (in the same spirit as Go's fmt package), even though some of its methods are less than intuitive (looking at you, index()...).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB