Here's an old post of mine, an edited version of a post I made earlier in the thread.
[Possibilities] The Bedroom Multimedia Studio
When I was about 9 or 10 (which wasn't that long ago, just 15 years), I actually thought we already had media generation and synthesizing algorithms. I very distinctly remember looking up on AOL how to download a program that would create a cartoon. I'd type in my useless and baffling 10-year-old descriptions and the computer would self-destruct trying to decipher what I was trying to say, but eventually I'd get a 30-minute episode that I could watch and show off to others. I was confused and disappointed when my words didn't magically turn into a cartoon.
Eventually, when I was 13, I found this Anime Studios program at Walmart and thought "Aw sweet, Imma go create my own show". And I even asked myself "Do they already have the voices I want on there?" at one point like a dumbass.
I think I still have that disc, too. Never touched since 2008, which should tell you whether or not I was able to create a cartoon that day. But I was a silly little kid who didn't really know much about technology. Computers and the internet seemed like magic and this was the era right around when blogs became common, so I just watched Cartoon Network and their contemporaries not understanding the sheer effort and manhours that went into creating even a single animated short, let alone an entire series. Cartoons just existed. I didn't know how they were created other than that some guy drew images over and over again and they somehow got put on TV, and I never thought to use the magical internet to research how cartoons are made. So surely, computers could just generate them, right?
Actually, I did find out the extensive process of animation around the same time of my second little gaffe because I genuinely did try to do something with it and was flabbergasted at how labor intensive it was just to create a single piss-poor 2-second loop of stick figures and that brought me to do some actual research.
How disappointing, right? That was over a decade ago.
It feels surreal and sorta vindicating to know that, by the late 2020s, it's possible someone who's a stupid 10-year-old that doesn't know better could think "Can I download something that makes a cartoon?" and the answer will be "Yes!"
If I were a bettin' man, I'd say that the sort of generative networks that we'll see during the next decade up to the latest years (2028 and 2029) will allow for you to pick and choose what sort of show you want to create.
Like, to keep with the cartoon example, imagine that for the early 2020s: GoAnimate, but vastly superior. There are many presets you can choose. You can type in detailed descriptions of someone or something and get a character or object designed, and then you can pick out what you want in a specific style. You don't even have to stick with any one style for all objects and characters— you could have a character from classic Disney a la Snow White or Beauty and the Beast with a modern overly-childlike moe anime character in a single work with a background that looks like it was drawn in the style of Ed Edd 'n Eddy. The animation will probably be either too awkward or too fluid, making it too obvious that it's something created with software. And the voices will also be realistic, but with poorly cadenced intonations and weird inflections. But a particularly skilled creator or a small team could create something with $200 that looks like it cost a thousand times more than that (which is the actual cost of a single episode of a typical Western cartoon, believe it or not).
I can see that as being probable by 2022-2023 or so. Mainly if deep learning + destructive brain scans takes off.
By the late 2020s, it will all likely be many times more refined, sort of like the difference between creating websites in 1998 vs. today. Animation quality/fluidity could be entirely a choice rather than a limitation and voice synthesization will be indistinguishable from reality. The OST can be anything you want it to be, however epic and sweeping or minimalist and unpretentious. What was rough and took a lot of your own effort in the early 2020s is literally as easy as typing in descriptions and then refining the results with the help of AI, then letting said AI run various scenes through and letting you choose which one was the best or was the closest so you could further tweak it. And the AI could also help you edit it if you doubt your hand. And if you were to release it to the world, you genuinely could since NLP/NLU would allow you to translate everything (speech and text) to all other languages. You could create an entire multimedia franchise in your bedroom.