From 4edc137aa08664794aa0c9ab5c36d9a8f53d0e0e Mon Sep 17 00:00:00 2001 From: osmarks Date: Sun, 2 Mar 2025 09:56:11 +0000 Subject: [PATCH] =?UTF-8?q?Edit=20=E2=80=98the=5Fseventy=5Fmaxims=5Fof=5Fm?= =?UTF-8?q?aximally=5Feffective=5Fmachine=5Flearning=5Fengineers=E2=80=99?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- ...axims_of_maximally_effective_machine_learning_engineers.myco | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/the_seventy_maxims_of_maximally_effective_machine_learning_engineers.myco b/the_seventy_maxims_of_maximally_effective_machine_learning_engineers.myco index 8635735..81017ba 100644 --- a/the_seventy_maxims_of_maximally_effective_machine_learning_engineers.myco +++ b/the_seventy_maxims_of_maximally_effective_machine_learning_engineers.myco @@ -54,7 +54,7 @@ Based on [[https://schlockmercenary.fandom.com/wiki/The_Seventy_Maxims_of_Maxima *. The framework you’ve got is never the framework you want. *. The data you’ve got is never the data you want. *. It’s only too many layers if you can’t fit them in VRAM. -*. It’s only too many parameters if they’re multiplying NaNs. +*. It’s only too much compute if the power grid collapses. *. Data engineers exist to format tables for people with real GPUs. *. Reinforcement learning exists to burn through compute budgets on simulated environments. *. The whiteboard is mightiest when it sketches architectures for more transformers.