isaacus / kanon-tokenizer

huggingface.co
Total runs: 0
24-hour runs: 0
7-day runs: 0
30-day runs: 0
Model's Last Updated: October 15 2025

Introduction of kanon-tokenizer

Model Details of kanon-tokenizer

The Kanon tokenizer is the world's most space efficient legal document tokenizer of its size.

With a vocabulary of only 65,536 tokens, documents compressed with the tokenizer are capable of being stored as unsigned 16-bit integers, reducing memory requirements dramatically over larger vocabularies .

The Kanon tokenizer is already being used in production by all of Isaacus ' currently available Kanon models.

The Kanon tokenizer was trained on Isaacus ' Blackstone Corpus, one of the world’s largest private repositories of contracts, decisions, legislation and other legal and government documents, covering a wide range of jurisdictions, including the U.S., U.K., Canada, Australia, New Zealand, Ireland, the entire European Union, the United Nations and the International Court of Justice, to name a few.

The Kanon tokenizer is licensed freely, including for commercial usage, under the Apache 2.0 license. We actively encourage legal AI practioners, including our own competitors, to take advantage of the Kanon tokenizer when training their legal AI models to promote better interoperability between models while also improving their space efficiency.

Runs of isaacus kanon-tokenizer on huggingface.co

0
Total runs
0
24-hour runs
0
3-day runs
0
7-day runs
0
30-day runs

More Information About kanon-tokenizer huggingface.co Model

More kanon-tokenizer license Visit here:

https://choosealicense.com/licenses/apache-2.0

kanon-tokenizer huggingface.co

kanon-tokenizer huggingface.co is an AI model on huggingface.co that provides kanon-tokenizer's model effect (), which can be used instantly with this isaacus kanon-tokenizer model. huggingface.co supports a free trial of the kanon-tokenizer model, and also provides paid use of the kanon-tokenizer. Support call kanon-tokenizer model through api, including Node.js, Python, http.

kanon-tokenizer huggingface.co Url

https://huggingface.co/isaacus/kanon-tokenizer

isaacus kanon-tokenizer online free

kanon-tokenizer huggingface.co is an online trial and call api platform, which integrates kanon-tokenizer's modeling effects, including api services, and provides a free online trial of kanon-tokenizer, you can try kanon-tokenizer online for free by clicking the link below.

isaacus kanon-tokenizer online free url in huggingface.co:

https://huggingface.co/isaacus/kanon-tokenizer

kanon-tokenizer install

kanon-tokenizer is an open source model from GitHub that offers a free installation service, and any user can find kanon-tokenizer on GitHub to install. At the same time, huggingface.co provides the effect of kanon-tokenizer install, users can directly use kanon-tokenizer installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

kanon-tokenizer install url in huggingface.co:

https://huggingface.co/isaacus/kanon-tokenizer

Url of kanon-tokenizer

kanon-tokenizer huggingface.co Url

Provider of kanon-tokenizer huggingface.co

isaacus
ORGANIZATIONS

Other API from isaacus

huggingface.co

Total runs: 34
Run Growth: -27
Growth Rate: -58.70%
Updated:June 12 2024