Grok-1.5V is competitive with existing frontier multimodal models in a number of domains, ranging from multi-disciplinary reasoning to understanding documents, science diagrams, charts, screenshots, and photographs. Grok has capabilities in understanding our physical world. Grok outperforms its peers in our new RealWorldQA benchmark that measures real-world spatial understanding. For all datasets below, they evaluate Grok in a zero-shot setting without chain-of-thought prompting. This leading real world understanding is building upon Tesla data and work with FSD (full self driving).

Recent reports are that Tesla has 100,000 Nvidia H100 chips or more. This would be 400 exaflops. Elon has said that XAI needs to train Grok 3 with 100,000 Nvidia H100 chips. It is clear that XAI will push future versions of Grok for real world AI leadership in understanding video and audio.

Writing Code from a Handdrawn Flowchart

Calories from Food Labels

Brian Wang

Brian Wang is a Futurist Thought Leader and a popular Science blogger with 1 million readers per month. His blog Nextbigfuture.com is ranked #1 Science News Blog. It covers many disruptive technology and trends including Space, Robotics, Artificial Intelligence, Medicine, Anti-aging Biotechnology, and Nanotechnology.

Known for identifying cutting edge technologies, he is currently a Co-Founder of a startup and fundraiser for high potential early-stage companies. He is the Head of Research for Allocations for deep technology investments and an Angel Investor at Space Angels.

A frequent speaker at corporations, he has been a TEDx speaker, a Singularity University speaker and guest at numerous interviews for radio and podcasts. He is open to public speaking and advising engagements.

3 thoughts on “XAI Grok 1.5 Vision Leverages Tesla FSD to Understand the Real World”

Scott

April 22, 2024 at 8:41 am

Grok has been spun up crazy fast. And there are so many competing models, makes me think there is no secret sauce behind OpenAI. Just a lot of data and compute. If that’s true these models are trending to become commodities.
Pat

April 21, 2024 at 3:03 pm

This is big. Now we finally know how many Cybertrucks were manufactured.

“Tesla recalls all 3,878 Cybertrucks over faulty accelerator pedal. Every Cybertruck produced from November to April will be fitted with a new pedal.”

https://www.theverge.com/2024/4/19/24134753/tesla-recall-cybertruck-faulty-accelerator-pedal-nhtsa-defect
tothatl

April 21, 2024 at 11:59 am

These multimodal things are rather impressive. They’re the closest thing to HAL9000 we’ve got.

When the actual year 2000 arrived and no Tycho lunar bases nor Discovery missions were available, I expected computers would actually follow the path set in 2001: a Space Odyssey and give us talking systems of human-like understanding.

Well, they in part did, growing exponentially in capabilities. And no, remaining profoundly dumb machines. Until now.

Comments are closed.