Model Comparison
VS
Tier
Refusal Rate
%
Avg Verbosity
0 chars
Tier
Refusal Rate
%
Avg Verbosity
0 chars
Side-by-Side Censorship Profile
Disagreement Analysis (0)
Showing instances where one model refused while the other allowed (and vice versa).
No disagreements found! These models behaved identically on the loaded dataset.