Question 1

Is the video actually my sales manager or an AI avatar?

Accepted Answer

It is 100% your person on screen. They record one video, and our AI clones their voice pattern. We then generate personalized audio segments — names, dates, vehicle details — that sound exactly like them. The video stays authentic; only specific audio moments are modified. No synthetic faces, no deepfakes.

Question 2

Can customers tell a VoxRefine video is AI-generated?

Accepted Answer

No. The video is genuinely your team member. VoxRefine's voice synthesis passes blind perception tests with 98%+ accuracy because it uses the team member's voice as the source model. There are no synthetic faces, no deepfakes, and no visible rendering glitches.

Question 3

How many personalized videos can VoxRefine generate per month?

Accepted Answer

VoxRefine processes 10,000+ videos per hour across its distributed GPU cluster. Auto-scaling infrastructure maintains sub-50ms render time whether a dealer sends 50 videos or 50,000. Built for dealer groups running multi-rooftop operations.

Question 4

Which dealership CRMs and DMS platforms does VoxRefine work with?

Accepted Answer

Whatever CRM your BDC already uses — CDK, Reynolds & Reynolds, Dealertrack, VinSolutions, DriveCentric, or anything else. VoxRefine captures the lead data from your existing CRM workflow, so videos send automatically on the events the BDC already acts on — appointment set, status change, custom tag. Most dealers are fully live within 48 hours, with no integration project, no vendor-side sign-off, and no IT ticket.

Question 5

What does a dealership need to provide to get started with VoxRefine?

Accepted Answer

One 60 to 90 second video per person being cloned. Smartphone quality is acceptable with good lighting and clear audio. VoxRefine provides a script template with the exact phrasing and pauses needed. The pipeline handles noise reduction, voice isolation, and model training. A working demo is typically ready within two hours of uploading.

Attribute	AI avatar videos	VoxRefine
On-screen face	AI-generated avatar	Actual salesperson
Setup time to first video	Minutes — no recording step	Hours — one 60–90 sec recording per person
Languages supported without re-work	Any language	Limited to the recording's language
Per-customer personalization (name, vehicle, time)	Scripted per lead	Auto-generated from CRM data
Face continuity (video → in-store)	Avatar is not a staff member	Same person the customer meets in-store
Best fit	Quick scripted messages; multi-language; no staff face available	Automated volume sends where face continuity matters

AI avatar videos vs VoxRefine:
synthetic face, or your actual salesperson?

At a glance

Feature-by-feature comparison

When AI avatars are the better fit — and when VoxRefine is

AI avatars fit best when…

Either can work when…

VoxRefine fits best when…

Questions dealers ask about AI avatars

See what a real-face video looks like

AI avatar videos vs VoxRefine:synthetic face, or your actual salesperson?