mirror of
https://github.com/osmarks/nanogpt-experiments.git
synced 2024-12-18 14:10:28 +00:00
commit
9755682b98
@ -131,10 +131,10 @@ Finally, to train on a single GPU simply run the `python train.py` script. Have
|
|||||||
OpenAI GPT-2 checkpoints allow us to get some baselines in place for openwebtext. We can get the numbers as follows:
|
OpenAI GPT-2 checkpoints allow us to get some baselines in place for openwebtext. We can get the numbers as follows:
|
||||||
|
|
||||||
```sh
|
```sh
|
||||||
python train.py eval_gpt2
|
$ python train.py config/eval_gpt2.py
|
||||||
python train.py eval_gpt2_medium
|
$ python train.py config/eval_gpt2_medium.py
|
||||||
python train.py eval_gpt2_large
|
$ python train.py config/eval_gpt2_large.py
|
||||||
python train.py eval_gpt2_xl
|
$ python train.py config/eval_gpt2_xl.py
|
||||||
```
|
```
|
||||||
|
|
||||||
and observe the following losses on train and val:
|
and observe the following losses on train and val:
|
||||||
|
Loading…
Reference in New Issue
Block a user