AI Agents · Agentic SEO · Agentic Websites · AI Marketing

Partnership proposal · Tavus × hashtag.org

You render the face. We make it close the deal.

Tavus built the most lifelike real-time conversational video on the planet. But a photoreal agent that can only talk doesn’t justify the GPU bill. We’ve already built the agentic harness that turns those conversations into captured leads, booked meetings, and completed on-page actions. It’s live today, on real businesses.

See it live on the map The harness

The cost problem

Photoreal real-time video is expensive, and a talking head alone doesn’t pay for it.

Tavus’s Phoenix-3 renders a full-face, micro-expression, lip-synced human via Gaussian diffusion at conversational latency. That demands datacenter GPUs held warm per call. The render is worth it only if the conversation drives a measurable business outcome.

What Tavus does brilliantly

Phoenix-3 (rendering), Raven-0 (perception), Sparrow-0 (~600 ms turn-taking). The face feels alive. This is the hard, defensible part, and it’s also the GPU-hungry part.

Why it costs so much

Real-time video diffusion can’t be cheaply batched or sharded the way an LLM can. Every active call burns premium GPU. That’s a structural cost, not a rounding error.

The gap

A beautiful agent that can only chat is a demo. To justify the spend it has to do things: capture the lead, book the meeting, move the visitor down the funnel.

What we already built

The agentic harness: abilities your video agents don’t ship with.

This is production code running today. Your models render and take turns; our harness wires the agent into the business so the conversation produces outcomes.

Capture leads

Agent opens a secure name/email/message form mid-conversation. The lead fans out to the business in real time across Slack, a HMAC-signed webhook, and the CRM.

Fill forms for the visitor

On an owner-allowlisted field, the agent types the value the visitor gives it (React-friendly native setter + input/change events). It NEVER submits. The human reviews and sends.

Navigate & act on the page

Allowlisted host-page routes, button clicks, and section scrolls. The agent can take a visitor to pricing, open a menu, or expand a booking widget on the live site.

Book meetings

Detects scheduling intent and opens the owner’s calendar (Cal.com / Calendly / Google) so the visitor picks a slot without leaving the conversation.

Knowledge-grounded answers

Full-text search over the business’s verified website, profile, and documents (Postgres FTS), so answers are grounded rather than hallucinated.

Lead fan-out & routing

Tier-gated CRM routing and specialist hand-off, with cross-session memory so a returning visitor is recognized.

Self-test & uptime

An admin "AI test circuit" runs a battery against every agent handler (markers, tools, keys, delivery) and tracks a rolling uptime score, so the harness is observably healthy.

Owner-allowlist safety

Every host-page action is defence-in-depth gated, validated at the API, the tool executor, the loader, and the effects client. No payment or password fields, ever.

Already wired to Tavus. We register these as real function tools on the Tavus persona (layers.llm.tools), decode the conversation.tool_call events, and apply the effect in the visitor’s browser. So a Tavus video agent calls open_lead_form, fill_host_field, or book_meeting and it just works.

Live proof, not slideware

Running on real businesses, on a spatial map of #portals.

Every business is a geographic #portal on hashtag.org. The same single embed code drops the agent onto the owner’s own website. The agent talks, qualifies, and acts, and the lead lands in the owner’s Slack before the call ends.

One embed, two surfaces

A single install powers both the on-site widget (talk to the business’s AI clone) and the #portal card on the spatial map. Backlinks + discovery come free.

Lead → Slack in real time

Visitor intent → open_lead_form → submitted lead fans out to Slack, webhook, and CRM. The business feels the agent working immediately.

Rich data layers

Each #portal carries a spatial data layer (local businesses, civic data, more) the agent can reason over. That’s context Tavus agents don’t otherwise have.

Sales playbook built in

The agent is prompted to qualify within 60 s, surface value, and close to the lead form or booking. It doesn’t wait to be asked.

Open the live map

Harness × Phoenix-3

Together you get an agent that earns its GPU cost.

Your video fusion makes the interaction feel human enough to trust. Our harness makes that trust convert. Neither half is as valuable alone.

Tavus alone

A lifelike face that chats

Stunning. But the ROI conversation is hard: high GPU cost, and no built-in path to a captured outcome.

Tavus × harness

A face that closes

Qualifies, fills forms, books, captures the lead, routes it to the CRM. Now the GPU spend maps directly to pipeline, an easy ROI story for every customer you sell to.

The cost flip

Browser-native render on the user’s own GPU, no app to install.

Our #space Chrome extension is already the distribution surface. For power users and enterprises with a capable GPU, a small native helper renders a distilled real-time avatar LOCALLY and streams frames straight into the extension. The per-call GPU cost goes to zero, right inside the browser.

Runs in the #space extension

The browser extension is the front-end (call UI, captions, the harness’ lead form). A one-time native helper does the CUDA render and talks to it over Chrome Native Messaging. You get browser distribution with native GPU power, no separate app to discover.

PC / RTX 4090 first

We’re validating real-time local inference on a 4090 (Windows/NVIDIA) first. Distilled avatar models now hit 16–32 FPS on a single modern GPU. macOS and other GPUs follow.

Zero marginal GPU cost

For these users the expensive render is free to us. Hosted Tavus stays the premium path for everyone without a capable GPU, and the SAME harness drives outcomes on both.

Honest scope: the heavy render can’t run in a tab directly (browsers can’t touch CUDA), so a tiny signed native helper does it and streams frames in. If a user won’t install it or lacks the GPU, the extension transparently falls back to hosted Tavus. It’s graceful, never broken. This is the answer to “what about customers who can’t absorb per-minute GPU cost?”

The ask

Make us a reference partner and use-case.

We chose Tavus. We can route real conversational-video volume to you, showcase the harness on live businesses, and prove the ROI story your sales team needs.

What we bring

A shipped agentic harness: leads, forms, navigation, booking, CRM routing.
Live distribution: businesses as #portals plus a one-line embed.
A model-agnostic provider layer (Tavus is our showcase backend).
A browser-native local-GPU tier (in the #space extension) that answers your cost objection.

What we’re asking

Partner / reference-use-case status and co-marketing.
Preferred pricing or volume terms as we drive call volume.
Early access to model + tool-calling roadmap so we build with you.
A design conversation on the local-GPU premium tier.

You make AI faces feel human. We make them close business.

Let’s put them together and make every call worth the GPU spend.

Watch a live #portal agent