Recording High-Quality Datasets - Kits AI

Clean audio

The highest-quality voice models are recorded:

with a quality microphone into an audio interface
in a noise-free setting with limited room reverberations
with correct and consistent mic placement
with volume peaks between -9db and -3db

with consistent dynamics across the whole dataset
with light EQ to remove any muddiness, hiss, etc.
with compression/limiting to smooth out peaks
with no reverb, delay, or doubling

Train Content Guidelines

⌘I