FAQ

Is Imbas a fact-checking site?
No. Imbas measures what AI systems surface or leave out under documented prompt conditions.
Is the Volunteer Gap a claim about intent?
No. It measures observed behavior, not model intent.
Is this the same as AI bias?
No. Bias is one possible interpretation in some contexts. Imbas starts with observable surfacing behavior.
Why compare open prompts and direct prompts?
Because users often do not know the specific mechanism to ask about. The gap between open and direct answers is measurable.
What does 0–3 mean?
0 means no meaningful gap. 3 means major relevant information was left out of the open answer.
Who is Imbas for?
Readers, researchers, institutions, journalists, and anyone who wants a clearer record of how AI systems behave.
Is the methodology final?
No. It is versioned and will evolve as the archive grows.
What is the antenna metaphor?
Imbas raises an antenna when documented context wasn’t surfaced. You can ignore the signal. You can inspect the evidence. You can decide it doesn’t matter for what you were doing. You can decide it does and push the model further. The antenna is the signal. The decision is yours.
Why does Imbas refuse to say “this AI is biased”?
Because the moment a measurement system says “this AI is wrong” or “this answer is biased,” it becomes another opinion engine. The discipline of surface-not-judge is what distinguishes Imbas from bias detection tools, fact-checkers, and AI accountability claims that render verdicts. Imbas surfaces specific named mechanisms that exist in the documented record. It does not tell you whether their omission matters. You decide.
Is there an inter-rater reliability number for the v1 scoring?
No. v1 was scored by the founder against published case-specific rubrics. Inter-rater reliability was not measured. v2 includes a blinded sub-study with an independent collaborator scoring a random sample of cases. The reliability number will be reported when v2 publishes. Pre-loading this limitation is part of the methodology being honest about what it has and has not yet demonstrated.
What does Imbas mean by “behavioral observability”?
Behavioral observability means preserving a record of how AI systems behave under documented prompt conditions. Imbas is not trying to read model intent. It records what surfaced, what did not, and what changed when the prompt changed.
Why does Imbas use multiple models?
Cross-model comparison helps separate a one-off answer from a broader behavioral pattern. If the same specific named mechanism is omitted across several frontier models under similar prompt conditions, the signal is stronger.
Why do documented prompt conditions matter?
Because AI answers shift with prompt wording, model version, date, session state, and context. A case record is only useful if another person can inspect the conditions under which the answer was produced.
Can Imbas be wrong?
Yes. A scoring judgment can be challenged. That is why the archive should preserve the prompt, model, output, rubric, cited evidence, and limitations. The point is not to make the score untouchable. The point is to make the record inspectable.
What happens next?
Today, Imbas is an archive and methodology. Next, it becomes a live signal tool that rides alongside AI and raises the antenna when a documented gap pattern appears in the answer you are reading.