1
0
mirror of https://github.com/osmarks/nanogpt-experiments.git synced 2024-12-18 06:00:29 +00:00

Commit Graph

  • f2fc4be69b mention 4gpu loss as well in readme Andrej Karpathy 2022-12-29 17:26:42 +0000
  • fa57d464d7 pull out dtype up top Andrej Karpathy 2022-12-29 05:32:55 +0000
  • e7bac659f5 oops missed one # have to fix Andrej Karpathy 2022-12-29 05:24:14 +0000
  • 97e2ab1b8d enhance readme, add some todos Andrej Karpathy 2022-12-29 05:23:36 +0000
  • cc11744131
    Add MIT LICENSE file Andrej 2022-12-28 21:11:26 -0800
  • dea1507252 add support for DDP training. the scaling timings right now do not look good by default, have to dig more into Andrej Karpathy 2022-12-29 05:06:07 +0000
  • ee6459f1d0 readme tweaks Andrej Karpathy 2022-12-29 02:00:25 +0000
  • 3000cf5dda add pytorch profiler support. not sure how to support both profiler and simple benchmarking, a bit gnarly atm hmm Andrej Karpathy 2022-12-29 01:49:53 +0000
  • b760ef1358 add data loading into benchmarking as well, just for completeness Andrej Karpathy 2022-12-29 00:05:32 +0000
  • 70b5d93aee add benchmarking script v0 Andrej Karpathy 2022-12-28 23:55:43 +0000
  • 5d2b4807bf adding a lightweight configurator that may be a terrible mistake lol. also adding configs to evaluate the baseline GPT2 versions released by OpenAI on OWT. we have some ways to go to match those numbers atm Andrej Karpathy 2022-12-28 23:31:23 +0000
  • c9fe00c0e9 small readme clarification and training script defaults changes Andrej Karpathy 2022-12-28 01:45:55 +0000
  • fe8042867c first very bad commit Andrej Karpathy 2022-12-28 00:58:19 +0000