Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark
Paper
•
2304.03279
•
Published
•
1
Artificial General Intelligence (AGI), Artificial Superintelligence (ASI), Uplift, Apotheosis