THC LEADERBOARD
Truth · Hardening · Clarity

Report folder page

AI Analysis

Compiled AI section notes, overall summary, strengths, risks, and uncertainty for this review.

AI Review Summary

ClearIntent was reviewed from public repository artifacts at 1eb58ec41822. Deterministic THC scoring completed, but the configured AI note provider did not return in time for this live request.

Batched Review Progress

Evidencecompleted

minimax

Local Artifactscompleted

minimax

Caps Appliedcompleted

minimax

Hidden Trustcompleted

minimax

Next Actionscompleted

minimax

Overview Synthesisfallback

deterministic-fallback

Uncertainty Notes

  • AI note provider unavailable: MiniMax review timed out.
  • This report used deterministic fallback section notes so the public review could be saved instead of timing out.
  • This is an automated public review artifact, not certification, security approval, production readiness, endorsement, or a reliability guarantee.

Evidence

Open

The evidence slice maps what ClearIntent publicly proves across THC truth categories: contract correctness, adapter integration, live bindings, signer surfaces, and audit trails. It captures scored evidence by category, caps applied due to missing or private evidence, and the boundary between what is publicly verifiable versus first-party-only.

  • Transaction hash and receipt evidence for autonomous agent execution is absent; the README explicitly calls this out as a near-term gap.
  • Real wallet-rendered EIP-712 or ERC-7730 signing evidence is incomplete; only local fixture proof exists.
  • Live provider replay evidence (ENS, 0G, KeeperHub) depends on operator-held credentials and live provider availability not available to public reviewers.
  • Discord webhook notifications are simulation-only, not production-authenticated or replay-checked event evidence.
  • ENS identity proof is live-bound but scoped to a single test name (guardian.agent.clearintent.eth); user-scoped agent subnames remain a near goal.
  • 0G storage evidence is write/read/proof verified but optional 0G Compute is deferred; richer audit bundles are not yet live.

The evidence slice reflects ClearIntent's honest evidence boundary: strong local proof paths across contracts, adapters, and fixtures, but no transaction-backed autonomous execution receipt, no real wallet signing screenshot or prompt evidence, and no public npm release. ENS, 0G, and KeeperHub live bindings are demonstrated but depend on private credentials for replay, limiting independent public verification. The project correctly avoids claiming full Clear Signing support or wallet-rendered previews as proven. The score of 88 and caps correctly reflect local-only preparation artifacts without independent public review. Public reviewers should verify local test runs independently, confirm ENS/0G/KeeperHub live state from their own credentials, and note that transaction-backed execution is the primary missing production evidence.

Caps Applied

Open

The caps-applied slice documents explicit limitations and boundary conditions placed on ClearIntent's public evidence posture. It covers caps placed during THC review due to local-only evidence, missing live execution proof, incomplete wallet-rendered signing evidence, and the absence of independent public or peer verification. This slice defines what the project explicitly cannot claim at time of review.

  • Caps-applied.json file content not included in bounded evidence; slice completeness must be inferred from LOCAL_CHECK.md and README.md capability boundaries.
  • No independent public THC review has been conducted; all caps are first-party self-assigned.
  • No transaction-backed autonomous execution receipt is recorded; execution adapter caps at `keeperhub-live-submitted` workflow proof without transaction hash.
  • Real wallet-rendered EIP-712/ERC-7730 signing evidence remains incomplete; signer adapter caps at local fixture proof only.
  • ENS identity binding is limited to `guardian.agent.clearintent.eth` with no public npm release to validate agent SDK handoff boundary.
  • 0G live-write depends on operator-held credentials not verifiable by public reviewers without private credential access.

The caps-applied slice correctly identifies ClearIntent's evidence boundary: local-fixture proof without transaction-backed execution, no real wallet signing render, and no independent public verification. The README.md capability snapshot provides legible caps in table form, and LOCAL_CHECK.md summarizes local-only, missing-live-evidence, and transaction-gaps caps. However, the actual caps-applied.json file content is not present in the bounded evidence, preventing direct inspection of cap categorization, severity, or enforcement metadata. Public reviewers cannot verify cap completeness or check whether additional undeclared gaps exist without the slice file itself. The slice captures the right concept but the artifact is missing from the provided bounded evidence.

Hidden Trust

Open

hidden-trust examines where public evidence implicitly relies on undisclosed, privileged, or operator-held assumptions rather than independently auditable truth. This slice surfaces the trust surface that is visually present in documentation but not demonstrated in the public artifact package.

  • Transaction-backed autonomous execution: README and capability table explicitly mark this as unproven; no transaction hash, receipt, or on-chain event appears in bounded evidence. The system documents what it cannot yet prove.
  • Real wallet-rendered signing: EIP-712 and ERC-7730 surfaces are described as 'local fixture' only. No wallet prompt screenshot, signed payload export, or hardware wallet attestation is in the bounded evidence. The signing UI claim is untethered from live rendering.
  • Live provider evidence replay boundary: ENS live binding, 0G write/read, and KeeperHub workflow proof all depend on operator-held credentials and live provider availability. Bounded evidence contains no stored replay proof or timestamped provider response. A public reviewer cannot independently re-run these checks without operator credentials.
  • First-party self-assessment without independent verification: All THC-BOT artifacts are generated by the same project team. No peer review, Vel Labs endorsement, third-party audit, or public leaderboard acceptance is present in the bounded evidence. LOCAL_CHECK.md explicitly flags this.
  • Discord webhook as 'simulation-only' event: The provider table and README both acknowledge Discord forwarding is demo-only and not execution or approval evidence. This is correctly scoped, but the dashboard and wizard still render it as a configured provider, which could mislead a reader into treating it as an authority signal.
  • No public npm release: package.json is publish-ready and the bin is wired, but the package is not on npm. Public consumers cannot independently install and reproduce behavior without cloning the repo. This is a bounded evidence gap, not a failure.

The hidden-trust surface is well-documented by the project itself: LOCAL_CHECK.md and README both accurately flag the execution gap, the signing evidence gap, the first-party-only review posture, and the operator-credential dependency. The project's self-reporting discipline is strong and honest about its boundaries. The actual gap is not deception but incomplete public proof. A public reviewer cannot independently verify live ENS bindings, 0G write/read, or KeeperHub submissions without credentials the project correctly keeps private. The next trust upgrade path is clear: capture one transaction-backed receipt and one wallet-rendered prompt export into the bounded evidence package, and seek one independent peer or third-party review. Until then, the score reflects what is proven locally, not what is proven publicly.

Local Artifacts

Open

The local-artifacts slice covers the THC-BOT run ledger, history, LOCAL_CHECK executive summary, and the structured run package under docs/thc/runs/<run-id>/. It maps where the local THC check artifacts live, what canonical files they produce, and what they assert about the run. These artifacts are first-party self-reports from the reviewed project and are explicitly marked as preparation artifacts, not independent truth.

  • All THC-BOT artifacts are generated by the project under review, with no external or third-party sign-off. The LOCAL_CHECK.md itself warns: 'This is a local preparation artifact. Do not hand-edit this report to improve the score.'
  • No independent public, peer, Vel Labs, or leaderboard verification exists. The caps-applied slice explicitly states this as a cap on the score.
  • THC-BOT.history.json contains only one run entry, so the artifact history cannot show trend or artifact stability across multiple checkpoints.
  • The provenance.json and contract.json are produced by the same local tooling, not by a trusted external tracker or attestor.
  • The optional THC-BOT.html visualizer is described as 'not scoring truth,' but it appears alongside the canonical run package and may visually inflate public trust if reviewers do not read the disclaimer.
  • The run package under runs/<run-id>/ includes slices (caps-applied, hidden-trust, evidence, overview, next-actions, uncertainty, local-artifacts), but the artifact bundle integrity or commit-bound anchoring is not independently verified by a third-party.

The local-artifacts slice correctly identifies the first-party artifact structure and the self-reporting boundary. The score of 88 and THC-4 level are appropriately held below independent-verification thresholds. The explicit warnings in LOCAL_CHECK.md and docs/thc/README.md do correctly communicate the preparation-artifact status. However, public reviewers must independently verify the cited evidence paths rather than treating the local run package as proof. The artifact slice result does not change the score, caps, or levels; it simply maps the boundary between local-first evidence maps and genuine public trust anchors.

Next Actions

Open

The next-actions slice captures the ordered follow-up work derived from a THC review run. For ClearIntent at revision 52d0966, it lists five concrete items that must be completed to lift the review from THC-4 (local check) toward THC-5 (independently verified) or higher. Each item has an implicit dependency: transaction-backed autonomous execution evidence gates wallet-rendered signing evidence, both gate peer/public THC verification, and event authenticity hardening is a prerequisite for trusted leaderboard submission.

  • The next-actions.json slice file exists as a path reference in the run directory but is not included verbatim in the bounded evidence package; a public reviewer cannot independently verify the item order, owner assignments, or completion criteria without fetching the file directly.
  • The five listed actions are presented without public issue tracker references, ticket links, or OWNER file entries, so there is no traceable external path for a reviewer to confirm progress after the review date.
  • Action 1 ('Capture one transaction-backed KeeperHub or executor receipt through the agent account path') has no external evidence link—no public Etherscan receipt, no Tenderly trace, no KeeperHub run page—making the gap unverifiable without re-running the demo live.
  • Action 3 ('Request independent public or peer THC review') is listed as a follow-up but no public review request, GitHub issue, or external grading platform entry is linked, so the gap between local-only verification and third-party acceptance is a trust gap not a process gap.
  • Action 5 ('Keep future THC-BOT runs appended instead of overwriting history') is an internal repo discipline note rather than a public verifiability claim, and the history ledger at THC-BOT.history.json shows only one run despite this guidance, indicating the discipline has not yet been applied.
  • No priority, estimated effort, or milestone target dates are attached to any action, so the roadmap for lifting the level is not inspectable as public evidence.

The next-actions slice is well-structured as a raw artifact but functions as a private internal ledger when its contents are not fully embedded in the bounded evidence. The five actions correctly identify the real gating items—transaction receipt, wallet signing proof, independent review, event source hardening, and history append discipline—and they align with the hidden-trust and caps-applied slices. However, the slice cannot function as public truth without the actual next-actions.json file content and without external traceability for each action. The gap between these items being recorded internally and being publicly verifiable is itself the next hardening step: add issue tracker links or pull request references for each action, embed the next-actions.json verbatim in the run package, and link the independent review action to a live public grading submission or GitHub issue so reviewers can check progress. Until that boundary is crossed, the slice earns low clarity because its evidence is self-referential.