POST
/
api
/
kits
/
v1
/
voice-conversions
curl -X POST https://arpeggi.io/api/kits/v1/voice-conversions \
-H "Authorization: Bearer your_api_key" \
-F "voiceModelId=voice_model_id" \
-F "soundFile=@file_path"
{
  "id": 1,
  "createdAt": "2023-09-19 20:13:50.428000 +00:00",
  "type": "infer",
  "voiceModelId": 2,
  "status": "running",
  "jobStartTime": "2023-09-19 20:15:50.428000 +00:00",
  "jobEndTime": null
}
Creates a voice conversion inference job and adds it to the inference queue. All requests must be made as multipart form requests. The response will contain the job status.

Headers

Authorization*
String
Bearer <api-key>

Request Body

voiceModelId*
Number
ID of the voice model.
soundFile*
String
Supports wav, mp3, or flac. Max file size is 100MB.
conversionStrength
Number
Increases the amount of accent from the Voice Model. This may cause mispronunciation. Range from 0 to 1.
modelVolumeMix
Number
Increase to convert the volume of your input audio to the volume of the AI model. Decrease to hear dynamics from the input audio. High values may accentuate noise. Range from 0 to 1.
pitchShift
Number
The number of semitones to shift the input audio. Range from -24 to 24.
pre
Object
Preprocessing effects run on the input file before the conversion is processed.
post
Object
Postprocessing effects run on the output file after the conversion is processed.

Response

Inference Job
curl -X POST https://arpeggi.io/api/kits/v1/voice-conversions \
-H "Authorization: Bearer your_api_key" \
-F "voiceModelId=voice_model_id" \
-F "soundFile=@file_path"
{
  "id": 1,
  "createdAt": "2023-09-19 20:13:50.428000 +00:00",
  "type": "infer",
  "voiceModelId": 2,
  "status": "running",
  "jobStartTime": "2023-09-19 20:15:50.428000 +00:00",
  "jobEndTime": null
}