- The Renaissance Times
- Posts
- OpenAI Just Changed the Game with Sora Text to Video
OpenAI Just Changed the Game with Sora Text to Video
Welcome to the AI Renaissance,
Today is a historic day in the world of AI.
Not only did OpenAI release a potentially game changing text to speech model, it is the first day of the AI Renaissance newsletter! This newsletter is for the artists, patrons, and builders of the AI Renaissance. We will provide you everything you need to know, the tools you have at your disposal, and the inspiration to get moving and participate in the coming Renaissance yourself!
Now… on to the other big news of the day:
After many months of anticipation we have been greeted to a new product announcement from OpenAI: Sora.
Sora is OpenAI’s text to video model, and it looks absolutely mind blowing.
This release has the potential to change the future of Hollywood, YouTube, and every other form of visual media we consume today. It’s that big.
Let’s dive in.
Hot Off The Press
OpenAI’s Text to Video Model, Sora, has arrived. It is available to Red Team users to discover vulnerabilities, with a research paper coming later today
Google releases Gemini 1.5, its next generation LLM. This model boasts an improved architecture and enhanced performance
Apple is getting deeper into the AI game, making a push to enhance its xCode and Spotlight search products with AI
Lambda, a startup that sells cloud services for training AI software, has raised $320 million and is now valued at $1.5 billion
The One Big Thing
OpenAI’s Text to Video Model Sora is Changing the Game…Again
Whatever you think of him, you have to admit Sam Altman is great at his job.
Just a few days after rumours he wanted to raise $7 trillion for chip manufacturing, and on the same day as Google’s Gemini release, he drops the most anticipated product in the entire industry.
Per their release, “Sora is an AI model that can create realistic and imaginative scenes from text instructions. Sora can generate videos up to a minute long while maintaining visual quality and adherence to the user’s prompt.”
Of course OpenAI would be the ones to nail this. Who else?
Text to video has been the holy grail for generative AI for all of 2023 (a lifetime in this industry), with most attempts thus far leading to massive disappointments.
Who can possibly forget the generative Will Smith eating a pasta video?
🚨text to video progress in one year🤯
2023: 2024:
— Sam Sheffer (@samsheffer)
7:03 PM • Feb 15, 2024
If you thought ChatGPT was a game changer, Sora is going to reshape the way people behave altogether.
Not only will this massively disrupt every creative industry from film-making to 3D modeling, it will also accelerate the deep fake problem to the extreme present. Although OpenAI and any other text to video models that gain relevancy will block much of the unsavory content you’re imagining right now, the issue will be extremely present in people’s minds.
This technology is also much more likely to pull forward regulation. Whereas the limit of written text on the internet is that the average social media user can not read (only half joking), video is right up in your face.
The moment there are deep fakes of both presidential candidates in the upcoming US Election (soon), expect things to really hit the fan.
Until then, start playing around with the coolest technology we may have experienced in quite a while. Figuratively speaking of course, as there is no waitlist out yet.
Now it makes sense why Sam needs all that money for chips.
Artists’ Gallery
Today is, deservedly, a Sora demo day
welcome to bling zoo! this is a single video generated by sora, shot changes and all.
— Bill Peebles (@billpeeb)
8:16 PM • Feb 15, 2024
I don’t even know what to say…
These clips generated by OpenAI’s Sora model have me speechless.
We knew good AI text-to-video would come, but this quickly? Unreal.
We’re stepping into a new world.
Buckle up.
— Mckay Wrigley (@mckaywrigley)
9:11 PM • Feb 15, 2024
SORA can animate images pretty amazingly.
Prompt: "In an ornate, historical hall, a massive tidal wave peaks and begins to crash. Two surfers, seizing the moment, skillfully navigate the face of the wave."
— AP (@angrypenguinPNG)
1:06 AM • Feb 16, 2024
If you think OpenAI Sora is a creative toy like DALLE, ... think again. Sora is a data-driven physics engine. It is a simulation of many worlds, real or fantastical. The simulator learns intricate rendering, "intuitive" physics, long-horizon reasoning, and semantic grounding, all… twitter.com/i/web/status/1…
— Jim Fan (@DrJimFan)
7:22 PM • Feb 15, 2024
Of the OpenAI Sora videos, this one blew my mind. Rendering this scene via a classic renderer is very hard. Sora doesn't model physics the way we do. It can definitely still get it wrong, but I wouldn't have predicted it'd be this convincing. openai.com/sora
— Evan Morikawa (@E0M)
7:21 PM • Feb 15, 2024
Tools
Must have tools for every Renaissance creator to add to their toolkit:
GetLazy, a platform to build full stack apps with prompts
Google Gemini 1.5, the largest ouput LLM in existence today
NVIDIA Chat RTX, a chatbot from the largest AI chipmaker
One API with 100+ AI Models, accessible 24/7: AI / ML API
Magiscan: A 3d scanner app on your mobile phone Powered by AI
Deep Tech
The newest and coolest in the research world that you need to know about:
Explainable AI for Safe and Trustworthy Autonomous Driving
Generative AI in the Construction Industry
Stable Cascade, a new text to image model from Stability AI
Aya : A Finetuned Open-Access Multilingual Language Model
Human-in-the-Loop AI for Quantitative Investment: Alpha-GPT 2.0
Proof of Humanhood
Imagine being in film school right now and seeing the Sora demo. What would you even be thinking?
Work With Us!
The AI Renaissance is coming and we are building the best community of the people making it happen.
Contact us to showcase your product or brand and reach the exact audience for your needs across our newsletter and podcast network.