The Renaissance Times
Posts
Robots Are Having their ChatGPT Moment

Robots Are Having their ChatGPT Moment

The Renaissance Times
March 12, 2024

Covariant’s Robots in Action

Dear Artisan,

Robots are about to have their ChatGPT moment.

While most of the AI advancements of the past 24 months have been focused on software and creating better ways of retrieving and organizing information, a new release from robotics startup Covariant could bring that same methodology to the physical world.

Let’s dive in.

Hot Off The Press

Covariant Releases RFM-1, a Robotics Foundation Model giving robots human-like reasoning capabilities
US government-commissioned assessment of catastrophic national security risks from AI Shows Large Risks
AI Models are Showing Incredible Bias towards Certain Groups
The Italian Data Protection Authority has opened an investigation into OpenAI’s Sora
Elon Musk’s xAI to Open Source its Grok Model this Week

The One Big Thing

Moving AI From Bits to Atoms

It’s been a crazy 24 hours in AI. Famed AI Twitter personality Beff Jezos announced his latest startup, Extropic, a new way of creating computing resources using thermodynamics. The US Government commissioned a report on AI threats that turned out to be even more fear mongering than you would expect even from the USG.

But the news from Covariant trumps all of that in a landslide. Today, we have our first ever Robotics Foundation Model: RFM-1.

The Basics:

RFM-1, a Robotics Foundation Model, has been developed to accurately interact with and manipulate the physical world, utilizing both internet data and extensive real-world robotics interactions.
The model is trained on a diverse and multimodal dataset, collected from Covariant's deployed robotic systems in various real-world environments, to understand complex physical interactions and achieve high performance.
RFM-1's capabilities extend to physics simulation, language understanding for intuitive robot programming, and predictive world modeling, aiming to make robots more autonomous and efficient in complex tasks.
Despite its groundbreaking advancements, RFM-1 faces limitations such as resolution and frame rate constraints, reliance on traditional programming for orchestration, and the need for further scaling to enhance its understanding and operational capabilities.

Covariant's unveiling of RFM-1 isn't just another incremental step; it's akin to planting a flag on previously uncharted territory. It’s a shift towards enabling machines with a level of understanding and interaction with the physical world that mirrors human intuition.

The potential future applications of this innovation are what make it most compelling. By blending vast datasets from the digital and physical realms, RFM-1 is essentially being taught to 'think' and 'act' in a world filled with unpredictable variables. This isn't just about making robots smarter; it's about redefining the boundaries of machine capability.

Btw, this is a sampling of what our Covariant robots look like in action:
— Pieter Abbeel (@pabbeel)
5:14 PM • Mar 11, 2024

RFM-1 is a foundation model that all robots can use in years to come. The stated goal is for this to be used by “billions” of robots in the coming years. Its multi modal approach means that the model allows for robots to understand natural language, while creating 3D models of the world that they inhabit, allowing for human-machine interactions that could have previously been impossible.

Rather than learning to program a robot in the future, engineers and even factory workers will be able to prompt their robot assistants with prompts like “move the bolt a 2.2 inches to the right and 1 inch higher.”

Now you don’t need to be a creative genius to see where this could go awry. Combining bits and atoms has proven difficult through the history of technology and this will be no different. I expect many months of testing and perhaps regulation to play out before we see commercially available robots using RFM-1.

This is, however, a new paradigm we are entering in the age of AI. You can just feel the acceleration.

The Gallery

Zuck, Snoop and Leo as Vintage Samurai

Vintage samurai photos.
— Bojan Tunguz (@tunguz)
1:38 AM • Mar 12, 2024

Trailer for the future of humanity

I made a trailer for the future of humanity (e/acc)
— @levelsio (@levelsio)
6:08 PM • Mar 8, 2024

Maybe Claude 3 Is Not AGI After All

Is this your god
— well hello (@quantizor)
4:07 PM • Mar 10, 2024

Tools

Must-have tools for every artisan to add to their toolkit:

AutoMerger: Tool to automatically merge models on Huggingface
Crazy Fast RAG with Ollama, Nomic Embedding Model, Groq API
Microsoft AICI: Sending mini WASM programs instead of prompts to LLM providers
Vid2Persona: talk to person from video clip
Midjourney "Character Reference": Create Consistent Characters

Deep Tech

The newest and coolest in the research world that you need to know about:

Extropic (created by Beff Jezos) Announced their new mission and have invented a new computing paradigm: thermodynamic
Command-R: a model focused on scalability, RAG, and Tool Use
Deepseek VL 7B: New model using, OCR, PDFs, Web code, image-text interleaved, caption, tables, etc
CRM: Single Image to 3D Model integrates geometric relationships directly into its design
Tencent presents ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
BELLE: Whisper large v3 model fine-tuned on Chinese dataset

Closing Thought

❝

Can not wait to yell at my robot dentist in the 2040

DFM-1 (Dental Foundation Model 1) Thoughts

Work With Us!

The AI Renaissance is coming and we are building the best community of the people making it happen.

Contact us to sponsor your product or brand and reach the exact audience for your needs across our newsletter and podcast network.