MetaGenesis Core — Verification Protocol

The Verification Gap

The entire field runs on
unverifiable numbers.

ML benchmarks. Simulation outputs. Regulatory submissions. Financial model results. None of them produce a verifiable artifact — a third party cannot confirm the claimed number without re-running everything from scratch. $28 billion wasted annually on irreproducible research. 294 ML papers with inflated results. Carbon markets approaching $2 trillion backed by unverifiable models. There has never been a standard for computational proof. Until now.

The mechanism of the problem

A number in a report.
Zero verifiable proof.

When any computational result is produced — an ML accuracy score, a FEM displacement, a VaR estimate, an ADMET prediction — no verifiable artifact is generated. The result exists as a number in a PDF, a log file, or a dashboard. A reviewer who wants to verify it faces one choice: rebuild the entire environment, data, and compute from scratch. That is not verification. That is reproduction — and most reviewers never attempt it.

ML accuracy: 94.3% no verifiable artifact

FEM displacement: rel_err 0.8% no verifiable artifact

VaR model output: 2.3% no verifiable artifact

ADMET prediction: logP 2.1 no verifiable artifact

Calibration: E = 70 GPa ± 0.7% no verifiable artifact

MetaGenesis Core — the standard

One command.
Cryptographic proof.

Every computation produces a tamper-evident evidence bundle: SHA-256 integrity, semantic invariant verification, and a Step Chain execution trace across 4 cryptographic steps. Any third party verifies offline with one command — no model access, no environment, no trust. This is not logging. This is machine-verifiable proof.

ML_BENCH-01 Δaccuracy ≤ 0.02 · PASS

FINRISK-01 ΔVaR ≤ tol · PASS

PHARMA-01 Δprop ≤ tol · PASS

MTR-1 rel_err ≤ 0.01 · PASS · ⚓ E=70GPa

DT-FEM-01 rel_err ≤ 0.02 · PASS · ⚓ anchored

$ python scripts/mg.py verify --pack bundle.zip → PASS

Verified by code, not by claim:

PASS test_cert02 — semantic bypass attack caught PASS test_cert03 — Step Chain tamper caught PASS test_cross_claim_chain — E=70GPa → MTR-1 → DT-FEM-01 → DRIFT-01 10/10 deep_verify.py — all proof tests passed PASS test_cert11 — coordinated multi-vector attack caught PASS test_cert12 — encoding attacks caught GREEN 2423 tests — CI passing on every PR

The Problem Is Real

The reproducibility crisis
is now your liability.

Independent researchers, regulators, and clients can no longer accept computational results on trust alone. The cost of unverified claims is rising.

70%

of researchers report failing to reproduce another scientist's results

Nature survey, 1,576 researchers

294

ML papers across 17 scientific disciplines contained data leakage silently inflating accuracy claims — discovered only after publication

Kapoor & Narayanan · Science · 2023

4

major regulatory and engineering standards now require independent verification of computational and simulation outputs

EU AI Act · FDA 21 CFR Part 11 · Basel III/IV · ISO/ASME V&V 10

0

existing open tools provide offline, third-party verifiable evidence bundles with semantic integrity

MetaGenesis Core fills this gap

What MetaGenesis Core closes — with executable proof

Every row below maps to running code. Clone the repo and verify in 60 seconds.

Accuracy claim with no third-party proof

→

CLOSED ML_BENCH-01 mlbench1_accuracy_certificate.py Δaccuracy ≤ 0.02 · semantic PASS

Data leakage inflating benchmark results (294 papers)

→

CLOSED DATA-PIPE-01 datapipe1_quality_certificate.py schema PASS · range PASS · policy-gate enforced

FEM/CFD simulation outputs unverifiable by engineering auditors

→

CLOSED DT-FEM-01 dtfem1_displacement_verification.py rel_err ≤ 0.02 vs physical reference · ISO/ASME V&V 10

Calibration drift undetected between model versions

→

CLOSED DRIFT-01 · MTR-1/2/3 drift_monitor.py drift_threshold 5.0% · rel_err ≤ 0.01–0.03

Regulatory submissions with no offline-verifiable audit trail

→

CLOSED All 20 claims mg.py verify --pack bundle.zip → PASS offline · no model access · 60 seconds

Benchmark gaming — Meta tested 27 private Llama 4 variants before publishing. Published scores unverifiable at submission time.

→

CLOSED ML_BENCH-01 ß Physical Anchor mlbench1_accuracy_certificate.py cryptographic provenance · physical anchor · verifiable at submission

Evidence tampering — attacker removes evidence, rebuilds SHA-256, submits as valid

→

⚠ CAUGHT CERT-05 · 5 attack scenarios tests/steward/test_cert05_adversarial_gauntlet.py Strip & Recompute · Single-Bit · Cross-Domain · Canary Laundering · Chain Reversal

Simulation outputs unanchored — no proof of agreement with measured physical reality

→

⚓ ANCHORED MTR-1 → DT-FEM-01 → DRIFT-01 mtr1_calibration.py E = 70 GPa measured · chain verifiable offline

How It Started

Built for one thing.
Became universal.

MetaGenesis Core started as a materials simulation verifier. Then the protocol worked for ML accuracy. Then data pipelines. Then risk models. The same 4-step structure handles every computational claim — because the problem is always the same: proof, not trust.

Version 0.1 — origin

Materials science verifier

Young's modulus. Thermal conductivity. Multilayer contact. Three claims, one physicist, one problem: simulation results that couldn't be independently verified by any external party.

MTR-1 MTR-2 MTR-3

→

Today — universal protocol

Verification engine for any domain

The same 4-step pipeline — run, index, pack, verify — works for ML benchmarks, system identification, data pipelines, drift monitoring. Any computational claim. Any domain. One command. PASS or FAIL.

ML_BENCH-01 SYSID-01 DATA-PIPE-01 DRIFT-01 DT-FEM-01 PHARMA-01 FINRISK-01

In plain language

Think of it like a certificate of conformity for computation.

When a product gets certified, the certifier doesn’t rebuild it from scratch — they verify it meets the documented spec. That certificate is then trusted by anyone, anywhere, without repeating the test.

MetaGenesis Core does the same for computational results. The bundle is the proof.

And where a physical constant exists — E = 70 GPa for aluminum (NIST, ~1% uncertainty), or k_B = 1.380649×10^‑23 J/K by SI 2019 definition (exact, zero uncertainty) — the verification chain is anchored to physical reality itself. NIST-measured material constants. SI fundamental constants. This is traceability, not compliance.

Physical constants (E=70 GPa, k_B, N_A from NIST/SI 2019) anchor materials and physics claims. ML, Finance, and Pharma claims use domain-specific verification thresholds — a different but documented level of traceability. Scope documented in reports/known_faults.yaml.

Physical Anchor Principle

Most verification tools answer: “was this number changed?” MetaGenesis Core answers a harder question: “does this number agree with physical reality?” Two anchor levels: NIST-measured material constants (~1% uncertainty) and SI 2019 fundamental constants (k_B, N_A — exact by definition, zero uncertainty). The chain runs from computation to the laws of physics.

⚓ PHYSICAL ANCHOR

E = 70 GPa

Aluminum Young’s Modulus

Measured independently in thousands of laboratories worldwide. Not assumed — measured.

→

MTR-1 PASS

rel_err ≤ 1%

Materials Science

Computational model verified against physical constant

→

DT-FEM-01 PASS

rel_err ≤ 2%

Digital Twin / FEM

FEM solver output verified against MTR-1 anchor

→

DRIFT-01 PASS

drift ≤ 5%

Drift Monitoring

Ongoing deviation from physical anchor monitored

$ python scripts/mg.py verify --pack bundle.zip → PASS — entire chain verified offline in 60 seconds. No model access. No trust.

Open Protocol

Not a tool.
A standard.

Every run produces an evidence bundle with five independent verification layers. Any deviation surfaces as FAIL with a specific reason.

01

runner.run_job()

Executes computation → produces run_artifact.json + ledger_snapshot.jsonl

→ artifact

02

evidence_index.json

Maps run artifacts to registered claims with provenance chain

→ index

03

mg.py pack build

Bundles artifacts + SHA-256 manifest + root_hash into submission pack

→ bundle

04

mg.py verify

Integrity via SHA-256 + semantic check: job_snapshot, canary flag, kind

PASS / FAIL

04b

mg_sign.py

Ed25519 Bundle Signing — asymmetric proof of bundle creator identity, catches unauthorized submissions

→ signed

05

mg_temporal.py

Temporal Commitment — anchors bundle to NIST Randomness Beacon timestamp, catches backdated bundles

→ anchored

Live Demo

Five minutes
from zero to proof.

Clone the repo. Run one command. Your result is now cryptographically anchored, semantically verified, and independently auditable by anyone — offline, without trusting you.

bash — metagenesis-core-public

$

cloning...

installing requirements...

→ running verification demo

✓ PASSbundle integrity verified

✓ PASSsemantic invariants verified

✓ PASSstep chain: trace_root_hash verified

$ python -m pytest tests/ -q

2423 passedin 18.3s

Steward Audit

statusSTEWARD AUDIT: PASS

required filesall present

immutable anchorslocked ✓

claim coveragebidirectional ✓

Canonical State

MTR-1,2,3materials

SYSID-01system id

DATA-PIPE-01pipelines

DRIFT-01drift

ML_BENCH-01ML accuracy + step chain

DT-FEM-01FEM verification

trace_root_hash✓ chain verified

Why SHA-256 Is Not Enough

The bypass attack.
Caught. Proven.

Every system using only file hashes is vulnerable. An adversary removes content, recomputes all hashes, and passes integrity checks silently.

The attack

1

Remove job_snapshot from run_artifact.json — stripping the core evidence

2

Recompute all SHA-256 hashes to match modified files — restoring apparent integrity

3

Submit bundle with no real evidence that passes all standard integrity checks

✗ Standard check: PASS (attack succeeds silently)

→

MetaGenesis defense

1

Integrity layer — SHA-256 + root_hash detects any file modification after manifest generation.

2

Semantic layer runs independently — checks job_snapshot present, payload.kind matches, canary_mode correct.

3

Even with all hashes recomputed, semantic check fails: job_snapshot missing.

✓ Semantic check: FAIL — job_snapshot missing (caught)

Adversarial test: tests/steward/test_cert02_pack_includes_evidence_and_semantic_verify.py

Under the Hood — Real Example

Real physics.
Real data. Real proof.

The open demo isn't synthetic. It runs a real physics calibration against a real dataset — then packages everything into a verifiable bundle. Here's exactly what happens, step by step.

01

Real dataset — Al6061 aluminium alloy

strain,stress0.000040 → 2,861,142 Pa0.000081 → 5,722,285 Pa0.000122 → 8,583,428 Pa... 49 data points, elastic region only

Fingerprinted with SHA-256 at packaging time — cannot be substituted after the fact.

↓

02

Physical law applied — Hooke's Law

stress=E × strain

# OLS through origin — standard calibrationE = Σ(strain × stress) / Σ(strain²)

No exotic dependencies. Pure stdlib Python. Reproducible on any machine.

↓

03

Calibration result vs. physical reality

E_estimated70.12 GPa

E_true (Al6061)∼70 GPa

relative_error0.0017 — within 1%

threshold (MTR-1)rel_err ≤ 0.01

↓

04

Packaged into a tamper-evident bundle

mg.py pack build --output bundle/ --include-evidence# SHA-256 every file → root_hash# job_snapshot → semantic anchor# canary + normal runs → dual-mode proof

↓

05

Third-party verifies — offline, one command

python scripts/mg.py verify --pack bundle/

✓ PASSAll checks passed

✓ SHA-256 root_hash matches — no file modified

✓ job_snapshot present — evidence not stripped

✓ payload.kind = mtr1_youngs_modulus_calibration

✓ trace_root_hash == final step hash — execution chain intact

✓ rel_error = 0.0017 ≤ 0.01 — threshold met

✓ canary_mode flag consistent

What FAIL looks like — and what each message means

FAIL: job_snapshot missing in run artifact

→ Core evidence was removed after packaging. The run cannot be traced to its computation.

FAIL: payload.kind does not match registered claim

→ The claim type was changed. The bundle describes a different computation than declared.

FAIL: file hash mismatch — test_results.csv

→ A file was modified after the manifest was sealed. Integrity broken at that file.

FAIL: Step Chain broken — trace_root_hash mismatch

→ An execution step was modified or reordered. The cryptographic chain over computation sequence is invalid.

FAIL: canary_mode flag inconsistent

→ Normal and canary execution metadata do not match. Provenance chain broken.

Full pipeline: backend/progress/mtr1_calibration.py · Dataset: demos/.../strain_stress_open.csv · V&V threshold: MTR-1 rel_err ≤ 0.01

20

Active Claims

Across 7 domains: materials, ML/AI, system ID, data pipelines, digital twin, pharma/biotech, financial risk

2423

Tests Passing

Including adversarial tamper detection, Step Chain Verification (4-step execution trace), determinism checks, and boundary conditions

5

Verification Layers

SHA-256 integrity + semantic invariants + Step Chain execution trace. Each layer catches attacks the others miss.

0 req.

Trust Required

No GPU, no internet, no code access. Verify on any machine with Python.

22

Agent Evolution Checks

Governance monitoring embedded in repo — any contributor gets automated health checks on every change.

Verified Claims

Twenty claims.
All bidirectionally enforced.

Every claim has an implementation, runner dispatch, threshold, and tests. Enforced on every PR — not by human review.

MTR-1⚓ ANCHOR

Materials Science

Young’s Modulus Calibration

relative_error ≤ 0.01 · anchored to E = 70 GPa

MTR-2

Materials Science

Thermal Paste Conductivity Calibration

relative_error ≤ 0.02

MTR-3

Materials Science

Multilayer Thermal Contact Calibration

rel_err_k ≤ 0.03 ß rel_err_r ≤ 0.05

MTR-4⚓ ANCHOR

Materials Science

Titanium Ti-6Al-4V Young’s Modulus

relative_error ≤ 0.01 · anchored to E = 114 GPa

MTR-5⚓ ANCHOR

Materials Science

Stainless Steel SS316L Young’s Modulus

relative_error ≤ 0.01 · anchored to E = 193 GPa

MTR-6⚓ ANCHOR

Materials Science

Copper Thermal Conductivity

relative_error ≤ 0.02 · anchored to k = 401 W/(m·K)

PHYS-01⚓ SI 2019

Fundamental Physics

Boltzmann Constant — Thermodynamics

rel_err ≤ 10⁻⁹ · anchored to k_B = 1.380649×10⁻²³ J/K

PHYS-02⚓ SI 2019

Fundamental Physics

Avogadro Constant — Molecular Mass

rel_err ≤ 10⁻⁸ · anchored to N_A = 6.022×10²³ mol⁻¹

SYSID-01

System Identification

ARX Model Calibration

rel_err_a ≤ 0.03 ß rel_err_b ≤ 0.03

DATA-PIPE-01

Data Pipelines

Data Pipeline Quality Certificate

schema pass ß range pass

DRIFT-01

Drift Monitoring

Calibration Anchor & Drift Monitor

drift_threshold 5.0%

ML_BENCH-01

ML / AI Benchmarking

ML Model Accuracy Certificate

|actual − claimed| ≤ 0.02

DT-FEM-01⚓ ANCHOR

Digital Twin / FEM

FEM Displacement Verification

rel_err ≤ 0.02 · anchored to MTR-1

ML_BENCH-02

ML / AI — Regression

ML Regression Certificate

|actual_rmse − claimed_rmse| ≤ 0.02

ML_BENCH-03

ML / AI — Time-Series

ML Time-Series Forecasting Certificate

|actual_mape − claimed_mape| ≤ 0.02

PHARMA-01

Pharma / Biotech — ADMET

ADMET Prediction Certificate

|predicted − claimed| ≤ tol · FDA 21 CFR Part 11

FINRISK-01

Financial Risk — VaR

Value-at-Risk Model Certificate

|actual_var − claimed_var| ≤ tol · Basel III/IV

DT-SENSOR-01

Digital Twin — IoT Sensor

IoT Sensor Data Integrity Certificate

schema pass ß range pass ß temporal pass

DT-CALIB-LOOP-01

Digital Twin — Calibration

Calibration Convergence Certificate

drift_pct monotone ↓ ß final ≤ threshold · DRIFT-01 anchor

Governance

Code enforces.
Not people.

Bidirectional claim coverage checked on every pull request. A claim without implementation blocks merge. An implementation without claim blocks merge.

$ python scripts/steward_audit.pySTEWARD AUDIT: PASS

canonical_state: ['MTR-1','MTR-2','MTR-3','SYSID-01','DATA-PIPE-01','DRIFT-01','ML_BENCH-01','DT-FEM-01',
'ML_BENCH-02','ML_BENCH-03','PHARMA-01','FINRISK-01','DT-SENSOR-01','DT-CALIB-LOOP-01',
'AGENT-DRIFT-01','MTR-4','MTR-5','MTR-6','PHYS-01','PHYS-02']

claim_index: 20 claims — all bidirectionally verified — steward_audit PASS

coverage check: all job_kinds dispatched — runner kinds in claim index

canonical sync: PASS — bidirectional coverage verified

Open Protocol

Not a tool.
A standard.

MetaGenesis Verification Protocol (MVP) v1.0 — open spec for packaging computational claims into independently verifiable evidence bundles.

What MVP defines

A minimal, concrete spec for what “independently verifiable” means for any computational result.

Bundle: pack_manifest + evidence_index + per-claim artifacts

Integrity: SHA-256 hashes + root_hash over all files

Semantic: job_snapshot present, kind matches, canary flag correct

Governance: runner kinds == claim_index kinds == canonical_state

Output: PASS or FAIL with specific reason — no ambiguity. See known_faults.yaml

What it is not

Not a simulation platform

Not an AI system

Not “tamper-proof” — tamper-evident under trusted verifier assumptions

Does not guarantee algorithm correctness — only evidence integrity

Planned domains

PHARMA-01 — ADMET certificates (FDA 21 CFR Part 11)

CARBON-01 — carbon sequestration model outputs

FINRISK-01 — VaR model validation (Basel III/IV)

Use Cases

Six verticals.
One protocol.

01

ML / AI

Benchmark Certification

Any ML accuracy claim packaged into a tamper-evident bundle. Reviewers verify offline — no model access, no environment, no GPU.

02

Pharma / Biotech

Regulatory Submission

ADMET predictions, PK/PD simulation outputs — packaged with full provenance. FDA 21 CFR Part 11 compatible audit trail.

03

Carbon Markets

ESG Model Auditing

Carbon sequestration and deforestation models become independently auditable. Corporate buyers verify without proprietary model access.

04

Financial Services

Risk Model Validation

VaR, credit scoring, stress test outputs packaged for Basel III/IV model risk management.

05

Materials / Engineering

Calibration Handoff

Young’s modulus and thermal conductivity — verified against NIST-measured physical constants: Al E=70 GPa, Ti E=114 GPa, SS316L E=193 GPa, Cu k=401 W/(m·K). Machine-verifiable proof for 4 materials. Drift detection included.

06

Digital Twin / FEM

FEM Output Verification

ANSYS, FEniCS, OpenFOAM outputs verified against a physically measured anchor — E = 70 GPa for aluminium, measured in thousands of labs worldwide. Not threshold compliance. Traceability to physical reality. Machine-readable proof for engineering certification.

Why MetaGenesis

Nothing else does
all of this.

Existing tools solve parts of the problem. MetaGenesis Core is the only open protocol that combines governance enforcement, semantic integrity, and offline third-party verification.

Capability

MetaGenesis

MLflow / DVC

Manual Audit

Trust the PDF

Offline third-party verification

✓

✗

partial

✗

Semantic tamper detection (beyond SHA-256) — test_cert02 + test_cert01

✓

✗

Step Chain execution trace verification (test_cert03)

✓

✗

Governance-enforced bidirectional claim coverage — steward_audit.py

✓

✗

No model or environment access required to verify

✓

✗

partial

✗

Dual-mode canary pipeline (health vs authority) — runner.py

✓

✗

Open source + patent-pending protocol

✓

✗

FDA 21 CFR Part 11 / EU AI Act alignment path

✓

partial

✗

FEM / simulation output verification (Digital Twin)

✓

✗

⚓ Physical anchor traceability — verification grounded in measured physical constants (E = 70 GPa). test_cross_claim_chain + deep_verify.py

✓

✗

Temporal Commitment (NIST Beacon anchoring) — mg_temporal.py

✓

✗

Coordinated multi-vector attack resistance — test_cert11

✓

✗

MLflow and DVC excel at experiment tracking and pipeline reproducibility — MetaGenesis Core adds the missing verification layer on top, not a replacement for experiment infrastructure.

Regulatory Alignment

Three deadlines.
One protocol.

Three major frameworks — EU AI Act, FDA 21 CFR Part 11, Basel III/IV — all require the same thing: independently auditable computational evidence. MetaGenesis bundles satisfy that requirement with a single offline-verifiable artifact.

EU AI Act

⚠ Aug 2, 2026

High-Risk AI Systems — Article 12 + Annex IV

Technical documentation & post-market logging

Article 12 mandates logging of AI system operations enabling post-market monitoring. Annex IV requires technical documentation proving the system functions as intended. MetaGenesis bundles provide immutable, offline-verifiable evidence records without exposing proprietary model internals to regulators.

Satisfies via:

ML_BENCH-01 ML_BENCH-02 ML_BENCH-03 DATA-PIPE-01

FDA 21 CFR Part 11

⚠ Q2 2026 final guidance

Pharma & Medical Devices — AI/ML-based software

Electronic records & reproducible audit trails

FDA draft guidance (Jan 2025) establishes a 7-step credibility framework for AI in drug development. Computational claims in IND filings require documentation that a regulator can verify without re-running the model. MetaGenesis provides exactly that: a self-contained bundle, verifiable offline in 60 seconds.

Satisfies via:

PHARMA-01 DATA-PIPE-01 DRIFT-01 MTR-1/2/3

Basel III / IV

⚠ SR 11-7 active now

Financial Risk Models — Model Risk Management

Independent model validation without source access

SR 11-7 requires independent validation of risk model outputs by a party that did not build the model. Today that means handing over model code or paying for a full re-run. MetaGenesis bundles give validators verifiable evidence of exactly what the model produced — without touching proprietary code or data.

Satisfies via:

FINRISK-01 ML_BENCH-01 DATA-PIPE-01

MetaGenesis Core does not constitute legal or regulatory compliance advice. It provides technical infrastructure that supports compliance documentation workflows.

Get Started

Start with proof.
Scale when ready.

MetaGenesis Core is open source. The protocol is free to use. We offer a free pilot for your specific use case before any commercial conversation.

Open Source

Free

Full protocol, All 20 claims, all verification tools. No limits.

Full MetaGenesis Verification Protocol
20 active domain claims
2423 tests including adversarial proof
Local deployment, no cloud required
MIT licensed

Clone on GitHub →

Recommended first step

Free Pilot

$0

Send us your computational result. We build a verification bundle for it. You see PASS or FAIL before any commercial decision.

Your specific use case verified
Working bundle you can give to any third party
Proof the protocol fits your domain
No strings attached
Response within 48 hours

Request Free Pilot →

Skeptical?
Good.

The questions we'd ask if we were you.

Does this guarantee my algorithm is correct?

No. MetaGenesis verifies that the evidence bundle contains what it claims to contain and has not been modified. It does not verify the correctness of the underlying algorithm. This distinction is intentional and documented in SECURITY.md and reports/known_faults.yaml.

Can a sophisticated attacker fake a passing bundle?

A sufficiently sophisticated adversary with full codebase access could potentially construct a passing fake bundle. MetaGenesis is tamper-evident — not a guarantee against all threat models. The semantic layer catches attacks that survive SHA-256 recomputation — proven by test_cert02. The Step Chain layer catches execution-order tampering — proven by test_cert03. Known limitations are in reports/known_faults.yaml. The verifier (mg.py) is MIT-licensed open source — any third party can read, audit, or independently re-implement it. The trust model requires trusting open-source code, not any single party.

How is this different from just sending a Docker container?

A Docker container requires the verifier to run your environment, trust your dependencies, and have compute available. A MetaGenesis bundle requires only Python and the verification script. No model access, no GPU, no network. Verification takes seconds, not hours.

Does the verifier need to trust MetaGenesis?

The verifier needs to trust the verification script (mg.py) and the protocol specification. Both are open source and auditable. The protocol is designed so any third party can re-implement the verifier independently from the spec and get the same result.

What does "patent pending" mean for open source users?

The code is MIT licensed — free to use, modify, and deploy. The provisional patent (USPTO #63/996,819) covers the protocol innovations. Commercial licensing options are available for organizations that want to build products on the protocol. Open source use is unrestricted.

Does this replace experiment tracking tools like MLflow or DVC?

No — MetaGenesis Core is complementary to MLflow and DVC, not a replacement. Experiment tracking tools record what you ran and when. MetaGenesis Core answers a different question: can any third party verify the result independently, offline, without access to your environment? The two layers work together: MLflow tracks the experiment, MetaGenesis Core packages the result as a verifiable evidence artifact for external reviewers, auditors, and regulators.

Is a MetaGenesis bundle legally binding?

A bundle is cryptographic evidence, not a legal contract. It proves what was computed, when, and that results haven’t been modified — the same way a notarized document proves a signature is real without making the document itself “legal.” Courts, regulators, and auditors can use bundles as evidence of computational integrity. The protocol is open-source (MIT), patent-pending, and designed to complement existing compliance frameworks like FDA 21 CFR Part 11 and Basel III SR 11-7.

What happens if MetaGenesis Core goes away?

Nothing changes. The protocol is open-source (MIT license), the code is on GitHub, and verification requires only Python — no server, no API key, no subscription. Your bundles remain independently verifiable forever. The cryptographic proofs are self-contained: SHA-256 hashes, Ed25519 signatures, and NIST Beacon timestamps are all standard algorithms. Even if this repository disappeared tomorrow, anyone with a copy of mg.py can verify any bundle ever created.

For Your Industry

Built for your world.
Speak your language.

One protocol. Five industries. Find your use case.

The problem

Your model achieves 94.3% accuracy. Your client wants proof — not a screenshot, not a PDF, not a Docker container they have to run. They want an answer they can verify themselves in 60 seconds.

Your scenario

1You run ML_BENCH-01 → bundle contains predictions CSV + SHA-256 fingerprint + semantic proof

2Client runs mg.py verify --pack bundle.zip on their laptop

3They see PASS — your claimed accuracy is verified. No model access. No GPU. No trust required.

Relevant claims

ML_BENCH-01 — classification accuracy ML_BENCH-02 — regression RMSE ML_BENCH-03 — time-series MAPE DATA-PIPE-01 — pipeline quality DRIFT-01 — model drift monitor

View ML_BENCH-01 on GitHub → Request Free Pilot →

The problem

Your computational claim is a number in a PDF. A regulator can’t verify it without recreating everything from scratch. MetaGenesis packages each claim into an artifact any auditor verifies in 60 seconds. Not trust — proof.

Your scenario

1Your calibration pipeline runs → MTR series + DATA-PIPE-01 produce evidence bundles with full provenance chain

2Regulatory reviewer runs verification offline — no access to your environment or proprietary data

3Audit trail is SHA-256 + semantically verified. Every deviation surfaces as FAIL with specific reason.

Relevant claims

DATA-PIPE-01 — data integrity DRIFT-01 — calibration anchor MTR-1/2/3 — calibration certificates PHARMA-01 — ADMET certificate

Read Protocol Spec → Request Free Pilot →

The problem

SR 11-7 and Basel III/IV require independent validation of risk model outputs. Your VaR models, credit scoring pipelines, and stress test results need a verifiable evidence trail — without handing over your proprietary model to the validator.

Your scenario

1Risk model runs → bundle contains output fingerprint + claimed metrics + governance-enforced evidence index

2Independent validator verifies bundle offline — no model access, no proprietary data exposure

3Semantic layer catches any post-hoc modification. Evidence chain is audit-ready for Basel model risk documentation.

Relevant claims

ML_BENCH-01 — model accuracy DATA-PIPE-01 — pipeline integrity FINRISK-01 — VaR certificate

Commercial Licensing → Request Free Pilot →

The problem

Your paper's reviewer cannot reproduce your simulation results without your exact environment, data, and compute. Nature estimates 70%+ of computational results are never independently verified. MetaGenesis makes yours the exception.

Your scenario

1Your computation runs → evidence bundle contains result + dataset SHA-256 fingerprint + semantic proof. No raw data included.

2Reviewer downloads bundle from your supplementary materials. Runs one command on their laptop.

3They see PASS — your results are independently verified. No environment, no GPU, no trust required.

Relevant claims

MTR-1/2/3 — materials calibration SYSID-01 — system identification ML_BENCH-01 — ML results DRIFT-01 — baseline comparison ML_BENCH-02 — regression certificate ML_BENCH-03 — time-series certificate

Clone & Try in 5 min → Request Free Pilot →

The problem

Your ANSYS, FEniCS, or OpenFOAM simulation produces displacement results. Your engineering client needs machine-verifiable proof that the output matches physical reference data — not a PDF report, not a screenshot. A certificate any auditor can check offline.

Your scenario

1Your FEM solver runs → DT-FEM-01 verifies output against physical reference → bundle contains displacement result + SHA-256 fingerprint + semantic proof

2Client runs mg.py verify --pack bundle.zip on their machine

3They see PASS — rel_err ≤ 0.02 verified. No solver access. No environment. No trust required.

Relevant claims

DT-FEM-01 — FEM displacement verification MTR-1 — Young's modulus anchor DRIFT-01 — baseline drift monitor DT-SENSOR-01 — IoT sensor integrity DT-CALIB-LOOP-01 — calibration convergence

View DT-FEM-01 on GitHub → Request Free Pilot →

Live Verifier

Verify a claim.
Right now. In your browser.

This is the exact logic the protocol runs. No backend. No network. The same verification that ships with the codebase.

Select claim domain

Input values

claimed_accuracy 0.00 – 1.00

actual_accuracy 0.00 – 1.00

threshold: |actual − claimed| ≤ 0.02

claimed_value GPa

actual_value GPa

threshold: |actual − claimed| / claimed ≤ 0.01

schema_valid — all required columns present range_valid — all values within expected bounds

threshold: schema_valid AND range_valid → PASS

baseline_value verified anchor

current_value new measurement

threshold: |current − baseline| / baseline ≤ 0.05 (5%)

simulated_displacement mm

reference_displacement mm (physical)

threshold: |simulated − reference| / reference ≤ 0.02

mg.py verify

$ python scripts/mg.py verify --pack bundle.zip

→ awaiting input...

Free Pilot

Send your result.
Get proof back.

Share a computational result — any domain. We build a verification bundle for it at no charge. You see PASS or FAIL before any commercial conversation. No strings attached. Response within 48 hours.

Name

Email

Domain

Describe your use case — what result do you want to verify?

What happens next

01

You describe your result

Any computational output — ML accuracy, calibration data, pipeline certificate, simulation output.

02

We build the bundle

We implement the verification claim for your domain and generate a tamper-evident evidence bundle.

03

You verify it yourself

Run mg.py verify --pack bundle.zip on your machine. See PASS or FAIL. No trust required.

04

Then we talk

If it solves your problem, we discuss next steps. If it doesn't fit, we tell you honestly.

— SITE NAVIGATION —

Protocol

Claims — 20 active

For You

Evidence

Tools

For Agents & LLMs

Get Started

The entire field runs onunverifiable numbers.

A number in a report.Zero verifiable proof.

One command.Cryptographic proof.

The reproducibility crisisis now your liability.

Built for one thing.Became universal.

Materials science verifier

Verification engine for any domain

Not a tool.A standard.

Five minutesfrom zero to proof.

The bypass attack.Caught. Proven.

Real physics.Real data. Real proof.

Twenty claims.All bidirectionally enforced.

Code enforces.Not people.

Not a tool.A standard.

What MVP defines

What it is not

Planned domains

Six verticals.One protocol.

Benchmark Certification

Regulatory Submission

ESG Model Auditing

Risk Model Validation

Calibration Handoff

FEM Output Verification

Nothing else doesall of this.

Three deadlines.One protocol.

Technical documentation & post-market logging

Electronic records & reproducible audit trails

Independent model validation without source access

Start with proof.Scale when ready.

Skeptical?Good.

Built for your world.Speak your language.

Verify a claim.Right now. In your browser.

Send your result.Get proof back.

The entire field runs on
unverifiable numbers.

A number in a report.
Zero verifiable proof.

One command.
Cryptographic proof.

The reproducibility crisis
is now your liability.

Built for one thing.
Became universal.

Not a tool.
A standard.

Five minutes
from zero to proof.

The bypass attack.
Caught. Proven.

Real physics.
Real data. Real proof.

Twenty claims.
All bidirectionally enforced.

Code enforces.
Not people.

Not a tool.
A standard.

Six verticals.
One protocol.

Nothing else does
all of this.

Three deadlines.
One protocol.

Start with proof.
Scale when ready.

Skeptical?
Good.

Built for your world.
Speak your language.

Verify a claim.
Right now. In your browser.

Send your result.
Get proof back.