We generate &
verify RL
rollouts.
Frontier agents learn inside environments with provable reward — agents that reach the goal, paths a verifier can re-walk step for step. DeRL generates those rollouts and verifies every one by re-execution, then anchors the proofs on Solana. Every number below is read live from the derl_core program — nothing simulated.
One rollout, end to end.
Every verified rollout travels this exact path. Re-execution is the real check — each reward is recomputed by an independent verifier, not trusted — and the proof is anchored on Solana. The counters below are read live from the chain.
A bounty escrows funds against an environment on-chain — no cherry-picking the easy work.
A node solves a real task — a gridworld path, code against hidden tests, or a math answer — and signs it (ed25519).
An independent verifier re-executes the reward: re-walks the path, runs the hidden tests, recomputes the answer. Reproduce it yourself via /api/verify.
The verifier confirms and signs; a curator drops near-duplicates. A wrong reward is rejected and slashed on-chain.
Verified rollouts are hashed into a sha-256 Merkle tree and the root is anchored on Solana.
The derl_core program releases escrow per verified rollout and burns the protocol fee — all on-chain.
Verifiable,
on Solana.
Everything below is read live from the derl_core program on Solana mainnet — no simulation. Registered environments, anchored Merkle batches, staked $DEREL, and the feed of real program transactions. Reproduce any reward yourself from its seed.
Pay. Get real fuel.
The product is verified AI training data. Pay on-chain in SOL, USDC or $DEREL and the network generates a real dataset of verified RL rollouts — every reward re-executed, a Merkle root as proof — and hands you the file to download. No subscription, no API key, no trust required.
Choose a family (code, math, grid or mixed) and size, then pay on-chain. The payment is verified on Solana before anything is generated.
The network produces that many rollouts and re-executes every reward — only verified ones make the cut (RLVR).
You get a JSONL dataset — task, solution, reward, trajectory per row — plus a sha-256 Merkle root over the set. Download it straight from the browser.
Stake, settle, farm rollouts.
Connect your wallet to deposit, withdraw and stake against the on-chain derl_core program — your balance is always 100% withdrawable. Fund a bounty to commission verified rollouts in any environment, with funds escrowed and released only on re-executed proof.
