Wednesday, 6 May 2026
Subscribe
logo logo
  • Global
  • Technology
  • Business
  • AI
  • Cloud
  • Edge Computing
  • Security
  • Investment
  • More
    • Sustainability
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
  • 🔥
  • data
  • revolutionizing
  • Stock
  • Investment
  • Future
  • Secures
  • Growth
  • Top
  • Funding
  • Power
  • Center
  • technology
Font ResizerAa
Silicon FlashSilicon Flash
Search
  • Global
  • Technology
  • Business
  • AI
  • Cloud
  • Edge Computing
  • Security
  • Investment
  • More
    • Sustainability
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Silicon Flash > Blog > AI > Revolutionizing Robotics AI with 3D Thinking: A Challenge to Nvidia and Google
AI

Revolutionizing Robotics AI with 3D Thinking: A Challenge to Nvidia and Google

Published August 13, 2025 By Juwan Chacko
Share
5 Min Read
Revolutionizing Robotics AI with 3D Thinking: A Challenge to Nvidia and Google
SHARE

Summary:

  1. Ai2 introduces MolmoAct 7B, an open-source model that allows robots to reason in 3D space, challenging Nvidia and Google in physical AI.
  2. MolmoAct can understand the physical world, plan interactions, and adapt to different embodiments with minimal fine-tuning.
  3. Benchmark testing shows MolmoAct 7B outperformed models from Google, Microsoft, and Nvidia, marking a step forward in physical AI development.

    Article:

    In the realm of physical AI, a new player has emerged to challenge industry giants like Nvidia and Google. The Allen Institute for AI (Ai2) has unveiled MolmoAct 7B, an open-source model designed to revolutionize how robots reason in 3D space. With a focus on action reasoning within a physical environment, MolmoAct sets itself apart from traditional vision-language-action models by enabling robots to "think" and plan their interactions in a spatial context.

    What makes MolmoAct unique is its ability to understand the physical world and make informed decisions on how to navigate and interact with its surroundings. By outputting spatially grounded perception tokens and encoding geometric structures, the model can estimate distances between objects, predict movement waypoints, and execute specific actions with precision. Ai2’s benchmark testing showcased MolmoAct 7B’s impressive task success rate of 72.1%, surpassing models from industry leaders like Google, Microsoft, and Nvidia.

    Experts in the field, such as Alan Fern from Oregon State University, view Ai2’s research as a significant step forward in enhancing vision-language models for robotics and physical reasoning. While acknowledging the improvements made by MolmoAct, Fern emphasizes the need for further advancements to capture real-world complexities. Meanwhile, Daniel Maturana from Gather AI applauds Ai2’s data openness, recognizing the model as a valuable foundation for future development and refinement by academic labs and hobbyists alike.

    As interest in physical AI continues to grow, companies and researchers are exploring innovative ways to enhance robot capabilities. From Google’s SayCan for task reasoning to Meta and NYU’s OK-Robot for movement planning, the integration of large language models is reshaping the landscape of robotics development. With initiatives like Hugging Face’s affordable desktop robot and Nvidia’s Cosmos-Transfer1 model, the democratization of robotics technology is on the rise. Despite limited demos, the future of physical AI looks promising as advancements in model development and training pave the way for more intelligent and spatially aware robots. Summary:

  4. Achieving general physical intelligence for robots is becoming easier, eliminating the need for individually programming actions.
  5. Large physical intelligence models are still in early stages, offering opportunities for rapid advancements.
  6. The landscape is challenging but exciting for advancements in physical intelligence for robots.

    Article:

    Heading (H1):
    Advancements in Achieving General Physical Intelligence for Robots

    Heading (H2):
    Easier Path to General Physical Intelligence and its Exciting Potential

    Robots have long been a fascination for humans, with the idea of machines possessing general physical intelligence becoming more attainable. The need for individually programming actions for robots is diminishing, making way for a more efficient and advanced approach to their functionality. This shift in focus towards achieving general physical intelligence is paving the way for groundbreaking advancements in the field of robotics.

    Heading (H2):
    The Challenges and Opportunities in Large Physical Intelligence Models

    While the landscape may present challenges in the quest for general physical intelligence, there is still plenty of room for growth and innovation. Large physical intelligence models are still in their early stages, offering a ripe opportunity for rapid advancements. This exciting space in robotics is attracting researchers and developers alike, eager to explore the vast potential that lies ahead.

    Heading (H2):
    Navigating the Future of Robotics with General Physical Intelligence

    As we continue to push the boundaries of what is possible in the realm of robotics, the concept of general physical intelligence holds immense promise. The journey towards achieving this goal may be challenging, but the rewards are well worth the effort. With advancements in large physical intelligence models on the horizon, the future of robotics is set to be nothing short of revolutionary. Embracing this exciting space will undoubtedly lead to groundbreaking developments that will shape the future of technology as we know it.

See also  Revolutionizing Image Editing: Qwen-Image Edit's AI-Powered Text-to-Image Technology Takes on Photoshop
TAGGED: challenge, Google, Nvidia, revolutionizing, Robotics, thinking
Share This Article
Facebook LinkedIn Email Copy Link Print
Previous Article BCS Diversifies Offerings with Expanded Utilities Division BCS Diversifies Offerings with Expanded Utilities Division
Next Article Shielding Small Businesses: A Comprehensive Guide to Cyber Insurance Readiness Shielding Small Businesses: A Comprehensive Guide to Cyber Insurance Readiness
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
LinkedInFollow

Popular Posts

Is BigBear.ai Stock (BBAI) a Bargain Buy After 28% Drop?

Summary: BigBear.ai is a smaller AI company in the government space, with its stock down…

August 29, 2025

Unleashing the Power: The UK’s Most Advanced AI Supercomputer Activated

National Grid Electricity Distribution (NGED) recently powered up the UK’s most advanced AI supercomputer in…

July 23, 2025

Crypto.com Secures EU Approval for Launch of Crypto Financial Derivatives

Crypto.com has recently obtained a MiFID license, allowing it to offer regulated crypto derivatives in…

May 26, 2025

Alt Mobility Secures Funding from Beyond Capital Ventures

Alt Mobility: Revolutionizing Electric Vehicle Leasing in India Alt Mobility, a leading full-stack electric vehicle…

May 9, 2025

Restrictions on Essential Senior Services in New Medicare Advantage Rule

Summary: 1. Medicare Advantage plans are facing new coverage restrictions, affecting services that retirees may…

January 11, 2026

You Might Also Like

Revolutionizing Entertainment: OpenAI and Reliance Collaborate to Enhance JioHotstar with AI-Powered Search
Business

Revolutionizing Entertainment: OpenAI and Reliance Collaborate to Enhance JioHotstar with AI-Powered Search

Juwan Chacko
Revolutionizing Enterprise Treasury Management with AI Advancements
AI

Revolutionizing Enterprise Treasury Management with AI Advancements

Juwan Chacko
Revolutionizing Network Testing with Spirent Luma’s Agentic AI: A Game-Changer in Triage Time Reduction
Global Market

Revolutionizing Network Testing with Spirent Luma’s Agentic AI: A Game-Changer in Triage Time Reduction

Juwan Chacko
Unveiling the Top Holdings of the Vanguard ETF: Nvidia, Apple, Microsoft, and Alphabet
Investments

Unveiling the Top Holdings of the Vanguard ETF: Nvidia, Apple, Microsoft, and Alphabet

Juwan Chacko
logo logo
Facebook Linkedin Rss

About US

Silicon Flash: Stay informed with the latest Tech News, Innovations, Gadgets, AI, Data Center, and Industry trends from around the world—all in one place.

Top Categories
  • Technology
  • Business
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2025 – siliconflash.com – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?