Voice cloning NLP
noun phrase
Definition: The synthesis of speech that preserves a target speaker’s vocal identity from reference audio, typically using only a short sample of that speaker’s voice. This is the formulation used in recent technical work on instant and zero-shot voice cloning [Qin et al. 2024].
Example in context: “We introduce OpenVoice, a versatile instant voice cloning approach that requires only a short audio clip from the reference speaker to replicate their voice and generate speech in multiple languages.” [Qin et al. 2024]
Related terms: speech synthesis, text-to-speech, speaker adaptation, synthetic voice, voice spoofing