Voice-cloned outbound video built for Houston dealerships at BDC scale. Real salesperson on screen. AI generates the audio that names each customer, each vehicle, and each appointment time. One recording per rep, thousands of personalized sends from there.
Written by Josh Duhon, Co-Founder of VoxRefine. We work with dealer groups across the country, including the rooftops Premier Automotive operates. Last reviewed May 11, 2026.
Houston is consistently one of the top three US auto markets by total new and used vehicle sales, with major dealer rows stretching along I-45 north toward Spring and south toward Galveston. The math at a metro that size is straightforward. Outbound lead volume per rooftop runs deep into the thousands every month, and no BDC team is recording a personal video for each one. Templates carry the load, customers recognize them, and the appointment that does get set sits inside the 67 percent no-show baseline that every BDC manager in the country knows by heart.
Hurricane season runs June through November and a single named storm can flatten BDC volume for a full week. Truck demand spikes after every major weather event as flood-damaged vehicles get replaced. The BDC has to keep producing through that seasonality. The reps cannot record their way out of it, the template stack is already maxed out, and the next-gen options on the market either ask for more rep time (manual-record video) or break the customer trust that the rooftop has spent years building (AI avatars).
Houston is a truck market. Full-size pickup share is well above the national average and the typical BDC sees a heavy mix of fleet-adjacent buyers and trade-in heavy retail. The drive between competing rooftops can be 45 minutes in traffic, so an appointment that actually shows up is more valuable here than in tighter metros.
The mechanic is simple. Your salesperson records one 60 to 90 second video. VoxRefine clones that voice. From then on, every outbound video to a lead in your CRM gets the same real footage of the same real person, with AI-generated audio segments naming the customer, the vehicle they engaged with, and the time of their appointment. The face on screen is the rep the customer will meet in the showroom. The voice is the same voice they will hear on the phone.
A Houston dealer running a 5-rep BDC sending 10,000 outbound touches a month is not going to record 10,000 videos. The math has never worked. The same dealer running VoxRefine records 5 source videos, once, and every lead in the pipeline gets a personalized send from the right rep, in the rep's actual voice, naming the actual vehicle, every time.
What changes for the BDC manager is the day-to-day work. The reps stop recording. The manager stops chasing template approvals. The outbound queue runs itself off the CRM events that the BDC already tracks. The BDC team shifts from production work to live calls and in-store follow-up, which is where the conversion lift actually shows up.
Dealership video splits four ways. Manual-record, where the rep records each video by hand. AI avatars, where a synthetic face reads a script. Generic stock video, where nothing is personalized at all. And real-face plus cloned-voice, which is the VoxRefine approach.
Manual-record is the most authentic and the least scalable. AI avatars scale but break face continuity, so the customer meets a different person in the showroom than the one they saw in the video. Generic stock video is the cheapest and converts the worst. Real-face plus cloned-voice is the only approach that keeps the actual rep on screen at volumes a BDC can actually deliver.
For a longer breakdown of where each category fits and how the tradeoffs play out at dealership scale, see our VoxRefine vs Covideo comparison and the AI avatar video for dealerships page. Both directly address the Houston-relevant tradeoffs.
The Houston market runs on a brand mix that includes Ford, Chevrolet, Toyota, Ram, GMC, Nissan, Honda, and Lexus. That breadth matters because a dealer group operating five rooftops across three franchises has five different sales floors, each with its own top performers and its own tone with customers. The video the customer gets from the Toyota store cannot sound like the video they get from the BMW store. The reps are different people, and the customers know it.
VoxRefine's per-salesperson voice cloning is built for that reality. Each rep records once. Each rep's voice clone stays attached to leads routed to that rep. A group running four rooftops in metro Houston, with three reps per BDC, ends up with twelve voice clones running in parallel and no audio bleed between them. The customer assigned to Maria at the Honda store gets Maria's voice. The customer assigned to Dave at the Ford store gets Dave's.
That scales with the dealer group. Add a rooftop, add the reps, record the source videos, and the pipeline runs the same way it ran on the original four stores.
VoxRefine's product development partner is Premier Automotive, a dealer group co-owned by Josh Duhon. That partnership is where the pipeline gets tested under real BDC conditions, which is the reason we will not quote you a fabricated industry stat we cannot back up. We say what we see.
What we see at Premier and at the other groups we work with is consistent. Appointment confirmation video, sent the day before the appointment, lifts show rate by an amount we will quote you specifically on a call once we know your baseline. No-show follow-up, sent two to four hours after the missed window, recovers appointments that template SMS does not. Service-reminder video from the customer's actual service advisor, triggered by DMS milestones, retains customers who would otherwise drift to an independent shop.
In Houston, the same playbook applies. The market context above shifts which use case carries the most weight, but the workflow is the same workflow Premier runs on its rooftops today.
Most Houston rooftops are live within 48 hours. Your salesperson records one 60 to 90 second source video. VoxRefine clones their voice, connects to whichever CRM your BDC already uses, and starts generating personalized outbound video tied to your existing lead routing. No native API project, no IT ticket, no vendor sign-off. The BDC keeps the workflow it already has.
Yes. The core mechanic is the same regardless of unit type. As long as you have a designated salesperson willing to record a source video and a CRM your BDC works in through the browser, VoxRefine generates personalized outbound video for the customers in that pipeline. Around Houston we have seen the same playbook port from franchise rooftops to powersports and RV operations on the same dealer-group umbrella.
No. The video is unmodified footage of your actual salesperson. AI generates only the audio segments containing customer-specific details — name, vehicle, appointment time — using a clone of the salesperson's own voice. No synthetic face, no lip-sync manipulation. The person the customer sees on screen is the same person they meet in the showroom.
Pricing is per rooftop, typically priced on monthly outbound video volume. We do not publish list pricing because rooftop counts and use case mix vary too widely. A 5-minute call with our team produces a real quote you can take back to your GM and CFO.
Other markets we cover
Send us a short clip of one of your reps and we will return a personalized VoxRefine video featuring that rep, naming a real customer, a real vehicle, and a real appointment time. No commitment. Just proof.