RULER
A benchmark for evaluating the rationality of long-text language models.
RULER Visit Over Time
Monthly Visits
29742941
Bounce Rate
44.20%
Page per Visit
5.9
Visit Duration
00:04:44
A benchmark for evaluating the rationality of long-text language models.
Monthly Visits
29742941
Bounce Rate
44.20%
Page per Visit
5.9
Visit Duration
00:04:44