diff --git a/README.md b/README.md index c6cee38..ce1b44e 100644 --- a/README.md +++ b/README.md @@ -68,7 +68,7 @@ I briefly tried finetuning gpt2 a bit more on our OWT and didn't notice dramatic For model benchmarking `bench.py` might be useful. It's identical what happens in the meat of the training loop of `train.py`, but omits much of the other complexities. -# todos +## todos A few that I'm aware of, other than the ones mentioned in code: