Will AI Voice Cloning Change Podcasts & YouTube Forever?

  • I have been discussing this with a friend today about how AI can now replicate our voices by taking a sample and then recreate us speaking in a way that sounds incredibly realistic. This obviously opens up a lot of security concerns especially around scams, impersonation and fraud which is quite worrying.

    That said I can also see the other side of it. For podcasts and content creation, it seems like it could be an incredibly useful tool. Imagine being able to fix mistakes without re-recording, update old episodes, create content more efficiently or make AI videos while still keeping your own voice and style.

    Im curious where everyone stands on this. Do you think AI voice cloning is something creators should be embracing or do the risks outweigh the benefits?

    Would you actually use an AI version of your own voice for your own content?

  • Yeah that is true especially for interactive things eventually it slips up or isnt enough. If you look at AI customer service, a lot of the time you end up needing a human especially with the problem is niche and not run of the mill. I can see how it would be good if someone is unable to create with their own voice due to time, not being able to physically etc but I can imagine it not being as good as the real thing, same with a lot of things where theres a substitute haha

    I think AI is best used as an assistant when someone doesnt have one. Thats how I use it mainly in general for both work and personal use.

  • This is a tough one, it certainly makes me very uncomfortable, but if I really think about it, I'm not sure it's gonna have the detrimental impact that some people think it will. (is there a way to make this thing stop trying to guess what I'm about to say? It's driving me nuts) I mean the way I see it, no matter how good it gets, it will not be able to convincingly mimick the most unique people out there. Like I highly doubt it would ever be able to make a convincing Markiplier video, or even a Ted Nivison etc, because these people surprise us, they have a style but the style is spontaneous, they do weird voices and impressions and scream etc. I don't even think 10 years from now AI will be able to convincingly replicate THAT.

    I DO think it will (and maybe already can) convincingly be able to replicate lower energy "NPC" types, people who talk the same way every video, just deliver information, the more monotone among us etc. But here's the thing, if that's you? Why would someone want to use AI to steal that anyway? That's not unique enough for someone to WANT to steal. (Also, if you have that kind of delivery, I'm not saying that's bad or anything, I don't mean "NPC" negatively, a lot of people do research and do awesome informative videos with a monotone delivery). So I guess I"m just not sure who's really gonna suffer from this, if you try to trick us into thinking it's a truly animated unique person we all know? It probably will never be able to get the nuance right. If it tries to mimick the monotone? First off why bother what's the point, and secondly we'd probably STILL know because the type of info that person usually delivers will feel off you know?

    I'm sure it will be able to fool some, I just doubt it's gonna be able to fool everyone, and these things will probably get found out eventually. What seems more likely to me is them making short clips that make it look like someone is saying something offensive and people flying into a rage before vetting the video. THAT does worry me.

Participate now!

Don’t have an account yet? Sign up now to post your own questions and be a part of our creator community!