Sources & Verification
This page combines adjacent U.S. production-market data, current buyer-demand data from a major voice marketplace, audiobook and e-learning demand signals, public labor and tax guidance, pricing-structure guidance, and AI-competition market data. Because there is no clean public standalone series for freelance voiceover services, the page also uses editorial judgment to connect these sources to a practical small-business version of the idea, with special weight given to how AI is changing demand quality and pricing power.
adjacent industry size
IBISWorld
Supports: Closest public U.S. production-market context
Key point: The U.S. Audio Production Studios industry is about $1.7 billion in 2026, with 22,009 businesses and high, steady competition.
View source →buyer demand
Voices 2025 Trends Report
Supports: Current buyer intent across voice project categories
Key point: Voices says 52% of buyers expected voice needs for branding and marketing in 2025, 45% for animation, and 35% each for internet ads and television-related work.
View source →audiobook demand
Audio Publishers Association
Supports: Narration-market demand signal
Key point: U.S. audiobook sales reached $2.22 billion in 2024, up 13% year over year, with digital audiobooks making up 99% of revenue.
View source →elearning demand
Grand View Research
Supports: Demand signal for training and instructional narration
Key point: The global e-learning services market was estimated at $299.67 billion in 2024 and is projected to reach $842.64 billion by 2030.
View source →pricing structure
Voices
Supports: How non-broadcast and broadcast jobs are commonly priced
Key point: Voices says non-broadcast jobs are generally priced per word or per finished minute, while broadcast work is priced according to usage.
View source →pricing structure
Voice123
Supports: Word count and purpose as core pricing drivers
Key point: Voice123 says its non-union, non-broadcast reference budgets are based on word count and finished duration, with final fees varying by purpose and scope.
View source →union rate context
SAG-AFTRA
Supports: Official union benchmark for usage-based commercial pricing
Key point: SAG-AFTRA publishes current commercial rate sheets, reinforcing that professional commercial pricing is structured around defined usage windows rather than simple reading time alone.
View source →income distribution
Full Time Voice Talent / NAVA 2024 Survey Summary
Supports: Income-distribution reality for the profession
Key point: A summary of the 2024 NAVA industry survey says 74.9% of voice actors earn $50,000 or less per year and 81.6% earn $75,000 or less.
View source →adjacent wage context
BLS
Supports: Public labor-market context for related voice-facing occupations
Key point: Broadcast announcers and radio disc jockeys had a median hourly wage of $21.96 in May 2024, and actors had a median hourly wage of $23.33.
View source →ai competition
Grand View Research
Supports: Scale and growth of AI-generated voice competition
Key point: The AI voice generators market was about $4.6 billion in 2024 and is projected to reach about $21.75 billion by 2030.
View source →tax context
IRS
Supports: Self-employment tax and estimated-tax context
Key point: The IRS says gig workers with net self-employment earnings of $400 or more must file a tax return and may need to pay estimated taxes.
View source →studio setup context
Backstage
Supports: Home-studio setup priorities
Key point: Backstage's voiceover setup guidance says the recording room is more important than the microphone and stresses acoustic control before gear spending.
View source →The parts covering adjacent production-market size, buyer demand by project type, audiobook growth, e-learning growth, pricing structure, union-rate context, public wage context, AI voice market growth, self-employment tax obligations, and home-studio room priority are grounded in public sources. The parts covering startup shape, repeat-client logic, operator fit, audition burden, niche selection, AI-resistant versus AI-vulnerable lane strategy, and growth sequencing are editorial conclusions built from those sources rather than direct single-source claims.
Whether this business is worth doing still depends heavily on your lane, your audio quality, your ability to price usage correctly, your tolerance for audition-heavy selling, and whether you want to stay a solo talent or become a more structured audio-service operator. The broad demand story is real, but AI has made weak positioning much easier to replace, so repeat clients and clear human value usually decide whether the business actually works.