AIbase
Product LibraryTool Navigation

safe-reward

Public

a prototype for an AI safety library that allows an agent to maximize its reward by solving a puzzle in order to prevent the worst-case outcomes of perverse instantiation

Creat2022-10-14T08:15:20
Update2022-11-01T09:58:46
8
Stars
0
Stars Increase