What datasets and open source models/software are used by Kits? – Kits AI

All models created by the Kits.AI team are exclusively trained on:

fully-licensed proprietary datasets collected by the Kits.AI team with fair compensation to providers
open-license datasets

Open-license datasets used in training include:

Title: Emilia-YODAS dataset
Creator: Amphion (OpenMMLab)
License: Creative Commons Attribution 4.0 International (CC BY 4.0)
Source: https://huggingface.co/datasets/amphion/Emilia-Dataset
Modifications: none

Title: VCTK Corpus (Version 0.92)
Creator: University of Edinburgh, CSTR (Junichi Yamagishi et al.)
License: Creative Commons Attribution 4.0 International (CC BY 4.0)
Source: https://datashare.ed.ac.uk/handle/10283/3443
Modifications: none

Kits also uses the following open source models and software:

Title: Parakeet TDT 0.6B V2 (En)
Creator: NVIDIA
License: Creative Commons Attribution 4.0 International (CC BY 4.0)
Source: https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2
Modifications: none

Title: Audiobox Aesthetics
Creator: Meta AI (Facebook Research); Andros Tjandra et al.
License: Creative Commons Attribution 4.0 International (CC BY 4.0)
Source: https://github.com/facebookresearch/audiobox-aesthetics
Modifications: none

Title: LAME MP3 Encoder (used via lamejs)
Creator: LAME Developers
License: GNU Lesser General Public License (LGPL)
Source: https://lame.sourceforge.io/
Modifications: none

Title: lamejs (JavaScript port of LAME)
Creator: zhuker & contributors
License: GNU Lesser General Public License (LGPL)
Source: https://github.com/zhuker/lamejs
Modifications: none

Title: FFmpeg
Creator: The FFmpeg developers
License: GNU LGPL 2.1-or-later (most of FFmpeg)
Source: https://ffmpeg.org/
Modifications: none

More information on our models can be found at kits.ai/research

Related articles