Built by
practitioners.
BlockchainRL requires a rare combination: deep DeFi protocol expertise, reinforcement learning knowledge, and hands-on agent systems experience. Our team brings production DeFi engineering and agentic AI to the RL environments that crypto needs.
Built Maverick Protocol v1 and v2 from scratch — the exact wagmi/viem/ABI/indexer stack that BlockchainRL environments simulate. 4 production DeFi apps shipped. Deep knowledge of ABI quirks, approval sequences, and onchain edge cases.
Built a 100+ agent engineering swarm with role-scoped subagents, parallel coordination, and facilitated multi-agent dialogue — pushing the frontier of agentic systems.
Managed Maverick Protocol from idea to billions in onchain volume, backed by Founders Fund and Pantera. Owned 50+ pages of protocol documentation, 30+ blog posts, all user-facing copy across 7+ products.
Co-designed an AI agent marketplace, conducting user research and market analysis. Operated a Claude-based agent swarm for automated QA.
CTO of Maverick Protocol, designing and shipping automated market makers at the protocol level.
Ph.D. in Electrical Engineering with focus on signal processing and machine learning. Reward engineering is signal processing — his core domain.
Co-founded Bastille Networks ($45M raised).
Three domains.
One team.
DeFi Protocol Engineering
Smart contracts, MEV, multi-protocol operations, ABI-level knowledge from years of production DeFi development.
RL / ML Infrastructure
Environment design, reward engineering, training pipeline integration — the infrastructure AI labs need.
Agent Systems
Multi-agent orchestration, role-scoped coordination, and production agentic architecture at scale.
Building the future
of AI agent training.
We are looking for exceptional people who want to define how AI agents learn to operate onchain.