Amazing work on bpe!
Wanted to see if there were any known Python bindings a la the snippets from tiktoken below?
import tiktoken
enc = tiktoken.get_encoding("o200k_base")
assert enc.decode(enc.encode("hello world")) == "hello world"
# To get the tokeniser corresponding to a specific model in the OpenAI API:
enc = tiktoken.encoding_for_model("gpt-4o")
Amazing work on
bpe!Wanted to see if there were any known Python bindings a la the snippets from tiktoken below?