← knowledge.oriz.in

Cloudflare Workers AI

service aicloudflareworkersinferenceserver-sideprimary

Cloudflare Workers AI

Role

Server-side AI inference inside the umbrella Hono Worker at api.oriz.in — the place to put any AI call where:

Browser-side AI stays on Puter.js — see the split decision at decisions/architecture/ai-puter-plus-cf-workers-ai.md.

Free tier

Card / subscription required?

NO. Same account as the rest of the Cloudflare stack — the account stays no-card per rules/no-card-on-file.md.

Quota-headroom plan

Per rules/interaction/never-hit-quotas.md:

Why two AI services?

Different surfaces:

Use case Service
Browser AI (chat in oriz-me, on-page assistants) Puter.js
Server AI (inside Hono Worker, chained with DB / Queue / R2) Cloudflare Workers AI (this file)
Hosted Gemini if a feature truly needs Google's specific model Firebase AI Logic (firebase-ai-logic-basics skill)

Picked together so each surface has a no-card free tier already sitting on infra the family uses. See decisions/architecture/ai-puter-plus-cf-workers-ai.md.

Alternatives

Swap cost

Medium — Workers AI's binding API is Cloudflare-specific. A swap means rewriting the AI helper module to call OpenAI / Anthropic / Hugging Face over HTTP and adding a credentials surface. Encapsulate in apps/api/src/ai/ so the swap is one file.

Why this is our pick

Cross-refs