Rafay Systems Empowers AI Factories to Monetize Token-Based Access to Models

New Token Factory capabilities enable GPU providers to offer token-metered AI services to enterprises and developers

Apr. 2, 2026 at 9:59pm

A highly detailed, glowing 3D illustration of a futuristic AI hardware accelerator device, with neon cyan and magenta lights illuminating its intricate circuits and components, conceptually representing the technology that enables GPU providers to monetize token-based access to AI models.Rafay's Token Factory empowers GPU providers to monetize token-based access to AI models, transforming them into AI factories.Sunnyvale Today

Rafay Systems, a leader in infrastructure orchestration for AI and cloud-native workloads, has announced the general availability of Token Factory, a suite of capabilities in the Rafay Platform that deliver token-based access to AI models and services. Token Factory enables infrastructure operators to turn GPU capacity into a token-based revenue stream by providing the metering, pricing and access-control capabilities needed to monetize token-based access to AI models running on accelerated computing infrastructure.

Why it matters

Token-based access to models and other AI services has become a foundational requirement in the AI industry, allowing AI factory operators to differentiate themselves from commodity GPU providers. Token Factory empowers these operators to serve the growing demand for token-based AI consumption, which is being driven by agentic frameworks like OpenClaw and NVIDIA NemoClaw that consume significantly more tokens than conventional AI interactions.

The details

Token Factory extends the Rafay Platform with a purpose-built monetization and metering layer for AI services. It enables AI factory operators to expose AI models via API endpoints that are token-metered and provide price, access management and quota definition capabilities. This makes it easy for both enterprises and retail users to track token consumption and enforce policies in real time across users, applications and agentic workflows. Token Factory has been validated to work with OpenClaw and NVIDIA NemoClaw, which are driving the highest-velocity token consumption in the market today.

  • Rafay announced the general availability of Token Factory on April 2, 2026.
  • The GPU-as-a-Service market is projected to reach $7.36 billion in 2026 and grow to $26.43 billion by 2031, according to Research and Markets.

The players

Rafay Systems

A leader in infrastructure orchestration for AI and cloud-native workloads, providing a Platform-as-a-Service (PaaS) that enables organizations to operationalize compute infrastructure with self-service automation, governance and multi-tenancy.

OpenClaw

An open-source AI agent platform that executes multi-step workflows, calls external tools, and runs continuously to complete real tasks.

NVIDIA NemoClaw

An extension of the OpenClaw model that provides policy-based privacy and security guardrails for production and enterprise deployments of agentic AI workflows.

Haseeb Budhani

CEO and co-founder of Rafay Systems.

Jensen Huang

CEO of NVIDIA, who elevated the concept of "tokenomics" to a keynote theme at GTC 2026.

Got photos? Submit your photos here. ›

What they’re saying

“Token Factories are the new cellphone companies. Similar to how cellphone companies used to sell pre- and post-paid minute plans, AI factories are beginning to sell pre- and post-paid token plans.”

— Haseeb Budhani, CEO and co-founder of Rafay Systems

“At GTC 2026, NVIDIA CEO Jensen Huang elevated the concept of "tokenomics" to a keynote theme, describing tokens as a new commodity and envisioning a future in which token-based access becomes the standard way enterprises and developers consume AI.”

— Jensen Huang

What’s next

Rafay's Token Factory is available now as part of the Rafay Platform, and the company is looking forward to supporting the success of a thousand AI factories across the world with this offering.

The takeaway

Token Factory enables infrastructure operators to turn their GPU capacity into a token-based revenue stream, transforming them from commodity GPU providers into AI factories that can monetize token-based access to AI models and services. This gives them a new competitive edge in the growing GPU-as-a-Service market.