● live/api.dalesai.com/serving inference now

I design, build, and operate private AI infrastructure.
This site runs on it.

DalesAI is a complete AI platform — six open models, authentication, billing, and usage dashboards — running on dedicated hardware I own in Gilbert, Arizona. It's not a product for sale. It's proof of what I can build for your business.

6 modelsserved from own hardware
Full stackauth / billing / dashboards
Private by designdata never leaves the box
Zero cloud APIs100% self-hosted

01 / The Platform

Everything a real AI product needs, running in production.

Most "AI consultants" resell someone else's API. I built the whole thing — so when I build for you, I know every layer because I've operated every layer.

Client request HTTPS Secure edge tunnel ZERO OPEN PORTS · TLS API gateway OPENAI-COMPATIBLE · AUTH · METERING Llama 3.1 70B + DEEPSEEK R1 Qwen / Gemma VISION ENABLED Mistral Small + MODEL WARMER Apple Silicon · 128GB unified memory

fig. 01 — request path, edge to silicon

Inference

Six open models, locally served

Llama 3.1 70B, DeepSeek R1, Qwen, Gemma, and Mistral on Apple Silicon — with vision support and a model warmer keeping responses fast.

API layer

OpenAI-compatible gateway

A unified API in front of every model — anything built for OpenAI works against private infrastructure with one URL change.

Accounts

Auth & usage dashboards

Email-code login, per-user token tracking, and live usage dashboards backed by a production database.

Billing

Stripe subscriptions & metering

Plan-based token limits, webhooks, and automated provisioning — the full path from checkout to working API key.

Security

Hardened, tunneled, monitored

Zero open ports to the public internet, locked-down CORS, security headers, secrets out of source.

Operations

Self-healing services

Every component runs as a managed service that survives reboots and restarts itself — built to run unattended.

02 / What I Build

AI systems for businesses that do real work.

Watch the demo — that's a missed call turning into a booked appointment with no human touching a phone.

/01

AI receptionists & lead follow-up

Missed-call text-back, appointment booking, and instant lead response for service businesses — so inquiries at 9 PM become customers instead of voicemails.

/02

Private AI assistants

Internal chat and document assistants running on dedicated hardware — for businesses that want AI without sending customer data to a third-party cloud.

/03

Automation & integration

Connecting AI to the tools you already use — CRMs, calendars, payment systems, email — with webhooks and APIs that just work.

/04

Full product builds

Auth, billing, dashboards, and deployment — the same production stack this platform runs on, built for your idea.

fig. 02 — missed-call text-back, as the customer sees it

03 / About

Built by one person. Operated every day.

I'm Dale — a builder based in Gilbert, Arizona. I'm not a classically trained engineer, and I think that's an advantage: I build with modern AI tools the way your business will actually use them, and I've shipped every piece of this platform myself — the inference servers, the security hardening, the billing, the deployments.

If you want slideware, hire a consultancy. If you want a working system from someone who runs his own, email me.

04 / Work With Me

Have a project? Let's scope it.

Tell me what you're trying to automate or build. I'll reply with an honest read on whether it's worth doing — and a fixed price if it is.

Replies within one business day