1
0
mirror of https://github.com/osmarks/nanogpt-experiments.git synced 2024-12-18 14:10:28 +00:00

Fixing eval path in README

This commit is contained in:
Gautam Kumar 2024-03-23 23:51:02 -07:00
parent 325be85d9b
commit 1ab9ec1b83

View File

@ -131,10 +131,10 @@ Finally, to train on a single GPU simply run the `$ python train.py` script. Hav
OpenAI GPT-2 checkpoints allow us to get some baselines in place for openwebtext. We can get the numbers as follows:
```
$ python train.py eval_gpt2
$ python train.py eval_gpt2_medium
$ python train.py eval_gpt2_large
$ python train.py eval_gpt2_xl
$ python train.py config/eval_gpt2.py
$ python train.py config/eval_gpt2_medium.py
$ python train.py config/eval_gpt2_large.py
$ python train.py config/eval_gpt2_xl.py
```
and observe the following losses on train and val: