1
0
mirror of https://github.com/osmarks/nanogpt-experiments.git synced 2024-12-18 14:10:28 +00:00
nanogpt-experiments/data/shakespeare
2023-01-20 10:39:45 -08:00
..
prepare.py use relative paths so that running the data prep scripts always create files in local folder, no matter where run from 2023-01-20 10:39:45 -08:00
readme.md candidate changes to apis, have to think through more 2023-01-01 01:29:48 +00:00

tiny shakespeare

Tiny shakespeare, of the good old char-rnn fame :)

After running prepare.py:

  • train.bin has 301,966 tokens
  • val.bin has 36,059 tokens