diff --git a/autogollark.myco b/autogollark.myco index ff39603..55dc400 100644 --- a/autogollark.myco +++ b/autogollark.myco @@ -12,6 +12,7 @@ Autogollark is much [[safer]] than [[instruction-tuned]] systems optimized based * {Fix lowercasing issue. * Due to general personality stability. Need finetune or similar. * One proposal: use internal finetune to steer big model somehow. Possibly: use its likelihood (prefill-only) to evaluate goodness of big model output wrt. gollark personality, and if it is too bad then use finetune directly. +* Is GCG code salvageable? NanoGCG, maybe. } * {Increased autonomy (wrt. responses). * Use cheap classifier to evaluate when to respond.