Imagination and MulticoreWare get 50 GFLOPS of extra compute from Texas Instruments TDA4VM processor

Imagination Technologies and MulticoreWare have enabled GPU compute on the Texas Instruments TDA4VM processor, leading to significant performance improvements in autonomous driving and ADAS. By running the StereoBM algorithm on the GPU, they achieved over 100× performance gains on a high-resolution image. This allows automotive customers to maximize performance per watt in sensor processing workloads. The collaboration demonstrates the potential of GPUs for compute and AI algorithms in automotive applications.

What do we think? Imagination has a long history in automotive, going back over two decades, and a solid customer base among the traditional auto semiconductor suppliers such as Texas Instruments. With the long design times and slow cadence of change in the auto segment, being able to deliver new compute performance on devices that have been in the market for a long while—like the Jacinto 7-based TI TDA4VM, which debuted in 2019—is a good thing.

Compute using IMG BXS GPU on the TI TDA4VM processor delivers 100× performance gains, MulticoreWare says

MulticoreWare Inc. and Imagination Technologies announced at CES 2024 that they have enabled GPU compute on the Texas Instruments TDA4VM processor, unleashing approximately 50 GFLOPS of extra compute and demonstrating a massive improvement in the performance of common workloads used for autonomous driving and advanced driver assistance systems (ADAS).

The partners claim over 100× performance gains when running a stereo block matching (StereoBM) algorithm on the GPU rather than on the CPU on a high-resolution (3200×2000) image. This enables automotive customers to use IMG BXS GPUs with OpenCL to achieve higher performance per watt on TDA4 SoCs for sensor processing workloads (camera, radar, and lidar), augmenting other compute accelerators existing in the platform.

At CES, Imagination Chief Product Officer James Chapman told us: “It supports our overall thrust of GPUs being a very, very good place to do some of these algorithms for compute and AI. So, we ported, as an example, a stereo block matching application where you’ve got two cameras, and you want to identify where are the objects. It’s a common thing that happens in ADAS systems, and it’s a good example of something that you can get to run fast on a GPU.”

MulticoreWare optimized the StereoBM algorithm to leverage the Imagination GPU cores more efficiently. The IMG BXS-4-64 GPU intrinsics (modeled camera data) and adaptive memory handling helped achieve optimal performance on higher-resolution camera data.

Chapman continued: “MulticoreWare realized we can run a lot of this from the GPU now. These GPUs are very capable, but I’m not sure the wider market thinks in those terms yet. They see that a GPU is needed for the surround view. The CPU is needed for this. The DSP is needed for that. And people get into that way of thinking about it. What this effort shows is we took something that was absolutely hammering the chip, loading up all the compute resources, and we got it to run with huge speedups from 20 to maybe even 100 times faster. The CPU workload was down at about 10%. Suddenly, you went from a device that was freaking out and almost falling over to hang on a minute, this device I’ve got in my system is now highly capable—orders of magnitude more.”

BentleyCAD

Bentley redefines the meaning of ‘family business’

Part I: Building a family legacy in the CAD industry.

SIGGRAPH

The Analog Years of Computer Animation

Before digital special effects in video and film were done with pots

AI PCCopilot+MicrosoftNPUQualcomm

Qualcomm Copilot+ PCs

Belle of the Microsoft event, Qualcomm shows strong backing from OEMs.

2024 Worldwide CAD Report

March 20, 2024

The Worldwide CAD Report JPR’s CAD market report has been published since 2005. As a result, it comes with a strong historical perspective as well as current data on the rapidly changing CAD industry. The 2024 report provides information on market segments, individual company market shares, new workflows, and new players.

Worldwide CAD Report Executive Summary
Worldwide CAD Report Table of Contents

learn more

The Arm IPO—background and possibilities – Predictions, potentials, and pitfalls

September 9, 2023

The Arm IPO—background and possibilities - Predictions, potentials, and pitfalls In the first part of this report, we look at the buildup to the Arm IPO. We look at where the company is on the starting blocks, pose questions about valuation, and the edge IoT and data center opportunities. The second section covers the competitive landscape, Arm product pricing models, why Arm GPU isn’t in the desktop and data center yet, and the relationship with Arm China. The third section covers the failed sale to Nvidia—and the reported new strategy for Arm growth revealed through the dispute with Qualcomm, including new licensing and royalty models and the chipset business. The report is full of insights, predictions, and some never-before-revealed aspects of Arm China and the IP industry.

Table of Contents

learn more

TV Gaming Hardware market study – advanced financial modeling of the global TV Gaming Hardware market

April 19, 2023

TV Gaming Hardware market study - Bi-annual, advanced financial modeling of the global TV Gaming Hardware market TV and Cloud Gaming market study – advanced financial modeling of the global PC Gaming Hardware market. Jon Peddie Research’s TV and Cloud Gaming market study is a supply-side series, it establishes the TV Gaming Hardware market size by value, platform, and unit shipments. TV and Cloud Gaming market study subscription consists of two issuances per year and gives one year of history, a current year estimate, and a three-year forecast. For a subscription that includes models of the market released bi-annually please click here. Contact us now if you would like to receive a sample of the report.

learn more

Imagination and MulticoreWare get 50 GFLOPS of extra compute from Texas Instruments TDA4VM processor

Compute using IMG BXS GPU on the TI TDA4VM processor delivers 100× performance gains, MulticoreWare says

Related posts

Bentley redefines the meaning of ‘family business’

The Analog Years of Computer Animation

Qualcomm Copilot+ PCs

Recent products