Question 1

Is the video actually my sales manager or an AI avatar?

Accepted Answer

It is 100% your person on screen. They record one video, and our AI clones their voice pattern. We then generate personalized audio segments, names, dates, vehicle details, that sound exactly like them. The video stays authentic; only specific audio moments are modified. No synthetic faces, no deepfakes.

Question 2

Can customers tell a VoxRefine video is AI-generated?

Accepted Answer

No. The video is genuinely your team member. VoxRefine's voice synthesis passes blind perception tests with 98%+ accuracy because it uses the team member's voice as the source model. There are no synthetic faces, no deepfakes, and no visible rendering glitches.

Question 3

How many personalized videos can VoxRefine generate per month?

Accepted Answer

VoxRefine processes 10,000+ videos per hour across its distributed GPU cluster. Auto-scaling infrastructure maintains sub-50ms render time whether a dealer sends 50 videos or 50,000. Built for dealer groups running multi-rooftop operations.

Question 4

Which dealership CRMs and DMS platforms does VoxRefine work with?

Accepted Answer

Whatever CRM your BDC already uses: CDK, Reynolds & Reynolds, Dealertrack, VinSolutions, DriveCentric, or anything else. VoxRefine captures the lead data from your existing CRM workflow, so videos send automatically on the events the BDC already acts on: appointment set, status change, custom tag. Most dealers are fully live within 48 hours, with no integration project, no vendor-side sign-off, and no IT ticket.

Question 5

What does a dealership need to provide to get started with VoxRefine?

Accepted Answer

One 60 to 90 second video per person being cloned. Smartphone quality is acceptable with good lighting and clear audio. VoxRefine provides a script template with the exact phrasing and pauses needed. The pipeline handles noise reduction, voice isolation, and model training. A working demo is typically ready within two hours of uploading.

Feature	VoxRefine	Covideo (manual)	Covideo AI avatar
On-screen face	Actual salesperson	Actual salesperson	AI-generated avatar
Recording step required	One 60–90 sec recording per person, once	Yes, for every outbound video	No recording step
Per-customer personalization (name, vehicle, time)	Auto-generated from CRM data	Manual, mentioned by the salesperson	Scripted per lead
Throughput	10,000+ videos per hour	Limited by rep's recording time	High, generation in seconds
Face continuity (video → in-store)	Same person	Same person	Avatar is not a staff member
Best fit	Automated volume sends where face continuity matters	Ad-hoc messages from a specific rep to a specific lead	Quick scripted messages where face is less critical

Covideo uses AI avatars.
VoxRefine keeps your actual salesperson on screen.

At a glance

Side-by-side: what the customer sees

Feature-by-feature comparison

When Covideo is the better fit, and when VoxRefine is

Covideo fits best when…

Covideo AI fits when…

VoxRefine fits when…

Questions dealers evaluating both tools ask

See what a real-face video looks like

Covideo uses AI avatars.VoxRefine keeps your actual salesperson on screen.