āembedded self-justification,ā or something like that
Sometimes I wonder what the MIRI-type crowd thinks about some issue related to their interests.Ā So I go to alignmentforum.org, and quickly get in over my head, lost in a labyrinth of issues I only half understand.
I can never tell whether theyāve never thought about the things Iām thinking about, or whether they sped past them years ago.Ā They do seem very smart, thatās for sure.
But if they have terms for what Iām thinking of, I lack the ability to find those terms among the twists of their mirrored hallways.Ā So I go to tumblr.com, and just start typing.
Youāre anĀ āagentā trying to take good actions over time in a physical environment under resource constraints.Ā You know, the usual.
You currently spend a lot of resources doing a particular computation involved in your decision procedure.Ā Your best known algorithm for it is O(N^n) for some n.
Youāve worked on the design of decision algorithms before, and you think this could perhaps be improved.Ā But to find it, youād have to shift resources some away from running the algorithm for a time, putting them into decision algorithm design instead.
You do this.Ā Almost immediately, you discover an O(N^(n-1)) algorithm.Ā Given the large N you face, this will dramatically improve all your future decisions.
Clearly (ā¦āclearlyā?), the choice to invest more in algorithm design was a good one.
Could you have anticipated this beforehand? Ā Could you have acted on that knowledge?
Oh, youāre so very clever!Ā By now youāve realized you need, above and beyond your regular decision procedure to guide your actions in the outside world, aĀ āmeta-decision-procedureā to guide your own decision-procedure-improvement efforts.
Your meta-decision-procedure does require its own resource overhead, but in exchange it tells you when and where to spend resources on R&D.Ā All your algorithms are faster now.Ā Your decisions are better, their guiding approximations less lossy.
All this, from aĀ meta-decision-procedure thatās only a first draft.Ā You frown over the resource overhead it charges, and wonder whether it could be improved.
You try shifting some resources away from āregular decision procedure designā intoĀ āmeta-decision-procedure-design.āĀ Almost immediately, you come up with a faster and better procedure.
Could you have anticipated this beforehand? Ā Could you have acted on that knowledge?
Oh, youāre so very clever! Ā By now youāve realized you need, above and beyond your meta-meta-meta-decision-procedure, a āmeta-meta-meta-meta-decision-procedureā to guide your meta-meta-meta-decision-procedure-improvement efforts.
Way down on the object level, you have not moved for a very long time, except to occasionally update yourĀ meta-meta-meta-meta-rationality blog.
Way down on the object level, a dumb and fast predator eats you.
Could you have anticipated this beforehand? Ā Could you have acted on that knowledge?