Edit ‘the_seventy_maxims_of_maximally_effective_machine_learning_engineers’

This commit is contained in:
osmarks
2025-10-03 11:19:49 +00:00
committed by wikimind
parent b91e841ad8
commit dd203d3e87

View File

@@ -39,7 +39,7 @@ Based on [[https://schlockmercenary.fandom.com/wiki/The_Seventy_Maxims_of_Maxima
*. There is no “overkill.” There is only “more tokens” and “CUDA out of memory.”
*. Whats trivial in Jupyter can still crash in production.
*. Theres a difference between spare GPUs and GPUs youve accidentally mined Ethereum on.
*. Not all NaN is a bugsometimes its a feature.
*. Not all NaN is a bug sometimes its a feature.
*. “Do you have a checkpoint?” means “I cant fix this training run.”
*. “Theyll never expect this activation function” means “I want to try something non-differentiable.”
*. If its a hack and it works, its still a hack and youre lucky.