This is currently the best text2video model, that i have seen. But it has some problems with objects morphing.
You can try it out here: https://replicate.com/anotherjesse/zeroscope-v2-xl
This sublemmy is a place for sharing news and discussions about artificial intelligence, core developments of humanity's technology and societal changes that come with them. Basically futurology sublemmy centered around ai but not limited to ai only.
[email protected] (Our community focuses on programming-oriented, hype-free discussion of Artificial Intelligence (AI) topics. We aim to curate content that truly contributes to the understanding and practical application of AI, making it, as the name suggests, “actually useful” for developers and enthusiasts alike.)
This is currently the best text2video model, that i have seen. But it has some problems with objects morphing.
You can try it out here: https://replicate.com/anotherjesse/zeroscope-v2-xl
Just tried it and the result was goofy but the frame consistency and smoothness of movement is insane! I can't wait to see how things develop.
The speed of progress that we are making with ai is crazyyy. I will be able to watch quality movies generated by ai sooner than I was expecting. I thought it will take 5 years or more at best case scenario but now I think that it will be less than 5 years from now.
At the current rate of progress, completely generated films are probably possible next year. The audio and video part is currently not good enough, but the quality will probably get to Midjourney5 level in the next half year. Scripts for a full movie can be written by GPT-4, currently it still needs a lot of help for a good result, but with better fine tuning that shouldn't be a problem. Then the audio and video parts can be combined by using ChatGPT code interpreter, which already works quite well.