1
0
mirror of https://github.com/osmarks/nanogpt-experiments.git synced 2024-11-10 20:09:58 +00:00
nanogpt-experiments/data
2023-01-11 05:27:19 +00:00
..
openwebtext candidate changes to apis, have to think through more 2023-01-01 01:29:48 +00:00
shakespeare candidate changes to apis, have to think through more 2023-01-01 01:29:48 +00:00
shakespeare_char add support for character-level language models, a new character-level shakespeare dataset, a new config file that shows how to train a character-level baby GPT on it, and adjust the sample function to figure out if it should decode with characters or GPT2 bpe tokens. The current implementation is a bit hacky and basically assumes just these two possibilities. In the future we may want to support more general encoders or decoders. 2023-01-11 05:27:19 +00:00