Google releases Gemma 2 open source AI model with 9/27 billion parameters: performance is better than its peers and can be run on a single A100/H100 GPU

Google issued a press release yesterday, releasing the Gemma 2 large language model to researchers and developers around the world . It comes in two sizes: 9 billion parameters (9B) and 27 billion parameters (27B).

Compared with the first generation, the Gemma 2 large language model has higher reasoning performance and efficiency, and has made significant progress in security.

Google said in a press release that the performance of the Gemma 2-27B model is comparable to mainstream models twice its size, and this performance can be achieved with just one NVIDIA H100 ensor Core GPU or TPU host, greatly reducing deployment costs.

The Gemma 2-9B model outperforms Llama 3 8B and other open source models of similar size. Google also plans to release a Gemma 2 model with 2.6 billion parameters in the coming months, which is more suitable for AI application scenarios on smartphones.

Google said it has redesigned the overall architecture for Gemma 2 to achieve excellent performance and reasoning efficiency. IT Home attached the main features of Gemma 2 as follows:

Excellent performance:

The 27B version is the best performer in its size class, even more competitive than models twice its size. The 9B version is also the best performer in its class, outperforming the Llama 3 8B and other open models of the same size.

Efficiency and cost:

The 27B Gemma 2 model can efficiently run inference at full precision on a single Google Cloud TPU host, NVIDIA A100 80GB Tensor Core GPU, or NVIDIA H100 Tensor Core GPU, maintaining high performance while significantly reducing costs. This makes AI deployment more accessible and budget-friendly.

Fast inference across hardware

Gemma 2 is optimised to run at blazing speeds on a wide range of hardware, from powerful gaming laptops and high-end desktops to cloud-based setups.

Try Gemma 2 at full precision in Google AI Studio, unlock native performance on your CPU using the quantised version of Gemma.cpp , or try it on your home PC with NVIDIA RTX or GeForce RTX with Hugging Face Transformers.

Tags: Google News World News

Google releases Gemma 2 open source AI model with 9/27 billion parameters: performance is better than its peers and can be run on a single A100/H100 GPU

Related News

7 watch faces leak ahead of Samsung Galaxy Watch 8 launch

Visual Open Source AI Reasoning Library YOLOv11 Infested With Supply Chain

Microsoft pushes 25H2 updates to Windows 11 24H2 devices

“Call of Duty 21: Black Ops 6” to be released on PS4, Xbox One

Blackberry’s first-quarter loss of $42 million smaller than expected as road to profitability gradually showing signs of light

Sony’s Sword Stars first-day patch removes potentially racist gameplay footage

Latest News

Poland defends new border checks as German officials warn of fallout

JAMB releases 2025 mop-up UTME results, records only 12% turnout

Olubadan of Ibadanland, Oba Owolabi Olakulehin, dies at 90

Cross River Chief of Staff reaffirms loyalty to Governor Otu, donates to church

Calls intensify for NOUN study centre in Ogoja

Mbappé’s bicycle kick seals Real Madrid’s 3-2 win over Dortmund

About Us

Social Media

Coverage

Special Pages

Google releases Gemma 2 open source AI model with 9/27 billion parameters: performance is better than its peers and can be run on a single A100/H100 GPU

Excellent performance:

Efficiency and cost:

READ ALSO:

Fast inference across hardware

Related News

Latest News

About Us

Social Media

Coverage

Special Pages