Even as technology transforms our world, it’s the human touch and spirit that give it true meaning.
Let’s get into it!
Welcome to WednesdAI – Pixel Dreams’ weekly update with top stories from the rapidly evolving world of Artificial Intelligence.
This Week’s Episode
This Week’s News
Put Yourself Anywhere, Be Anything with Video to Video
Runway’s new Video-to-Video AI tool, part of its Gen-3 Alpha release, lets users take real-world video clips and apply AI-driven transformations.
First @runwayml vid2vid tests. I've been looking forward to testing input video stylization with an actually-good video model. The ability to input your own video instead of just relying on a text prompt is powerful.
Here's "a dancing flame, candle flame, fire" 🧵 #aivideo #vfx pic.twitter.com/5npz0NCRXV
— Nathan Shipley (@CitizenPlain) September 14, 2024
Gen-3 Alpha Video to Video is now available on web for all paid plans. Video to Video represents a new control mechanism for precise movement, expressiveness and intent within generations. To use Video to Video, simply upload your input video, prompt in any aesthetic direction… pic.twitter.com/ZjRwVPyqem
— Runway (@runwayml) September 13, 2024
This is a step up from their earlier text-to-video and image-to-video features, now allowing you to keep the original motion of your footage but completely change the look or setting. For instance, a clip of a kid running around the yard can be re-imagined underwater or on an alien planet—all with just a text prompt. For businesses, this is a shortcut to highly customized video content—less time, less effort, and no need for a Hollywood budget.
For more details, check the Runway’s official Release and coverage on Tom’s Guide.
Adobe announced their generative video platform but we’ll cover it in-depth when it releases later this year.
ChatGPT o1
OpenAI has released a preview of its new o1 reasoning model, designed to spend more time thinking through problems. Unlike GPT-4, this model excels at solving complex tasks like math, coding, and science—scoring 83% on a math Olympiad qualifier, compared to GPT-4’s 13%. It’s a stripped-down version, lacking some GPT features like web browsing or image uploads, but it shines in reasoning and safety, outperforming in jailbreak resistance tests.
Watch YouTuber Kyle Kabasares get the ChatGPT o1 model to write PhD code in under an hour that it took him over a year to complete:
Official release from OpenAI.
Salesforce’s Hard Pivot to AI Agents
As part of a strategic pivot towards AI, Salesforce CEO Marc Benioff introduced Agentforce, a platform for AI agents designed to automate tasks and improve customer interactions. The move comes as Salesforce faces pressure to stay competitive in the AI space, with clients like Heathrow and OpenTable already testing the tool. Benioff promises more insights at the upcoming Dreamforce conference, where AI will dominate the conversation.
Read about it at Fortune.com.
The First Violin Performance in Space
This moment is beautiful and profound for the human touch it represents, but the technology we talk about is what made it possible.
Enjoy this image of a SpaceX astronaut making the farthest spacewalk from Earth in 50 years.
Read about the performance at The Strad.
Videos of the Week
RunwayML’s Gen:48 contest just ended and the entries are a wonder to behold!
Join us every Wednesday for WednesdAI – a PD production!
The images accompanying the news items in this article were generated in Midjourney using the following prompts: