# Build your Dataset

apiIn this section, you will find a detailed list of tips and recommendations to ensure the quality and consistency of the audio used in your dataset. These guidelines will help you compile high quality recordings that will be used to train your voice model. Follow these recommendations to ensure accurate and improved results during the training of your model.

In this list we understand that you have already recorded your files and that you are going to start the per-processing process of the audios to have them ready. If you don't have these audios yet, you have a list of [recommendations in this link](/twintune/tips-and-recommendations/audio-recording.md). If you have files with instrumentals we are currently developing an API to separate instrumentals and vocals using UVR  or use their program directly([you can download in this link](https://github.com/Anjok07/ultimatevocalremovergui)).

* At least 15 minutes of dry (no effects) and monophonic (one note at a time) vocal recordings are required.
* It's best to have examples that cover your entire range. Chest, blend, falsetto; big and short intervals; high and clean notes; etc. The more variety, the better.
* clean EQd (subtractive) to reduce muddy or harsh frequencies in the recording
* subtly pitch corrected (slow attack, moderate strength) unless it's a key part of the vocal style
* De-essed to reduce any harsh sibilance
* compressed lightly to even out dynamic range/reduce peaks (\~4-5db of gain reduction at most)
* boosted (additive EQd) to fit the style of the vocal
* limited to a peak of -6db with overall levels between -6 and -12db.
* high/low passed to remove frequencies below 40hz–100hz and above 20khz
* phase re-balanced


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.twintune.ai/twintune/tips-and-recommendations/build-your-dataset.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
