Edit ‘the_seventy_maxims_of_maximally_effective_machine_learning_engineers’

This commit is contained in:
osmarks
2026-02-28 15:43:22 +00:00
committed by wikimind
parent d7fba4347d
commit 3de911aa82
@@ -38,7 +38,7 @@ Based on [[https://schlockmercenary.fandom.com/wiki/The_Seventy_Maxims_of_Maxima
*. When the loss plateaus, the wise call for more data.
*. There is no “overkill.” There is only “more tokens” and “CUDA out of memory.”
*. Whats trivial in Jupyter can still crash in production.
*. Theres a difference between spare GPUs and GPUs youve accidentally mined Ethereum on.
*. Theres a difference between spare GPUs and idle GPUs.
*. Not all NaN is a bug sometimes its a feature.
*. “Do you have a checkpoint?” means “I cant fix this training run.”
*. “We propose a novel method” means “This has no sound mathematical basis.”