Llama 2 to run locally for you with Snapdragon

Qualcomm plans to offer Llama 2-based AI implementations on flagship smartphones and PCs starting in 2024. This will enable developers to create generative AI applications using Snapdragon platforms. On-device AI implementation provides benefits such as increased user privacy, improved application reliability, personalized experiences, and lower costs compared to cloud-based AI services.

(Source: Meta)

Qualcomm Technologies and Meta say they have collaborated to optimize the performance of Meta’s Llama 2 large language models for use on devices locally, eliminating the need for reliance on cloud services. This advancement enables the execution of generative AI models, like Llama 2, on various devices such as smartphones, PCs, VR/AR headsets, and vehicles. This approach offers several advantages, including cost savings on cloud services, enhanced privacy, improved reliability, and personalized experiences for users.

With the aim of facilitating the creation of AI applications, Qualcomm plans to provide on-device Llama 2-based AI implementations. This will allow developers to create various use cases, including intelligent virtual assistants, productivity tools, content creation applications, entertainment, and more. These on-device AI experiences, powered by Snapdragon, can, says Qualcomm, even function in areas with no Internet connectivity or airplane mode.

Meta and Qualcomm have a history of collaboration, most notably in VR HMDs. Their current joint efforts support the Llama ecosystem, which includes research and product engineering efforts. Qualcomm says their presence in on-device AI uniquely positions it to support the Llama ecosystem, given its vast footprint at the edge with billions of smartphones, vehicles, XR headsets and glasses, PCs, IoT devices, and more powered by its AI hardware and software solutions, enabling the scaling of generative AI.

The availability of Llama 2-based AI implementation on Snapdragon-powered devices is set to begin in 2024. Developers can start optimizing applications for on-device AI using the Q u alcomm AI Stack, a dedicated set of tools that enhance AI processing efficiency on Snapdragon, making on-device AI feasible even in small, thin, and light devices. Those interested can subscribe to their monthly developer newsletter for updates.

Press Releases

Q1’22 saw a decline in GPU and PC shipments quarter-to-quarter

The PC GPU market shipments decreased by -6.2% sequentially from last quarter and decreased by -19% year-to-year.

TSMC’s 20-angstrom process kicks off the sub-atomic era+

If Moore’s Law is dead, someone forgot to tell TSMC

AIAppleGenerative AIMicrosoftOpenAI

Apple launches Apple Intelligence

Generative models coming to iPhone, iPad, and Mac.

2024 Worldwide CAD Report

March 20, 2024

The Worldwide CAD Report JPR’s CAD market report has been published since 2005. As a result, it comes with a strong historical perspective as well as current data on the rapidly changing CAD industry. The 2024 report provides information on market segments, individual company market shares, new workflows, and new players.

Worldwide CAD Report Executive Summary
Worldwide CAD Report Table of Contents

learn more

The Arm IPO—background and possibilities – Predictions, potentials, and pitfalls

September 9, 2023

The Arm IPO—background and possibilities - Predictions, potentials, and pitfalls In the first part of this report, we look at the buildup to the Arm IPO. We look at where the company is on the starting blocks, pose questions about valuation, and the edge IoT and data center opportunities. The second section covers the competitive landscape, Arm product pricing models, why Arm GPU isn’t in the desktop and data center yet, and the relationship with Arm China. The third section covers the failed sale to Nvidia—and the reported new strategy for Arm growth revealed through the dispute with Qualcomm, including new licensing and royalty models and the chipset business. The report is full of insights, predictions, and some never-before-revealed aspects of Arm China and the IP industry.

Table of Contents

learn more

TV Gaming Hardware market study – advanced financial modeling of the global TV Gaming Hardware market

April 19, 2023

TV Gaming Hardware market study - Bi-annual, advanced financial modeling of the global TV Gaming Hardware market TV and Cloud Gaming market study – advanced financial modeling of the global PC Gaming Hardware market. Jon Peddie Research’s TV and Cloud Gaming market study is a supply-side series, it establishes the TV Gaming Hardware market size by value, platform, and unit shipments. TV and Cloud Gaming market study subscription consists of two issuances per year and gives one year of history, a current year estimate, and a three-year forecast. For a subscription that includes models of the market released bi-annually please click here. Contact us now if you would like to receive a sample of the report.

learn more

Llama 2 to run locally for you with Snapdragon

Related posts

Q1’22 saw a decline in GPU and PC shipments quarter-to-quarter

TSMC’s 20-angstrom process kicks off the sub-atomic era+

Apple launches Apple Intelligence

Recent products