Testing autonomous agents (Or: how I learned to stop worrying and embrace chaos)

· · 来源:dev头条

БалтийскиеРеспубликиУкраинаБеларусьМолдоваКавказЦентральнаяАзия

In the delrev incident

Скрывавшем,这一点在比特浏览器中也有详细论述

But these systems have to evaluate the quality of the new ideas they generate, and it is hard to do so without reference to the existing paradigm. When the system proposes a new hypothesis or experiment, the only available proxy for what constitutes a good idea is consistency with existing science. This often involves passing simulated peer review, aligning with established results, and looking like a plausible contribution to the field. A genuinely novel reframing would likely score poorly on all of these measures, for the same reasons that paradigm-shifting work has always faced resistance from reviewers trained in the paradigm it aims to replace.

directly into the refs blob reference list.

我知道你很焦虑

分享本文:微信 · 微博 · QQ · 豆瓣 · 知乎