Close Menu
  • Tech Insights
  • Laptops
  • Mobiles
  • Gaming
  • Apps
  • Money
  • Latest in Tech
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
TechzLab – Tech News, Gadgets, Mobile & IT UpdatesTechzLab – Tech News, Gadgets, Mobile & IT Updates
  • Tech Insights
  • Laptops
  • Mobiles
  • Gaming
  • Apps
  • Money
  • Latest in Tech
TechzLab – Tech News, Gadgets, Mobile & IT UpdatesTechzLab – Tech News, Gadgets, Mobile & IT Updates
Home » Gemini 1.5 Flash-8B With Lowest Token Cost Among Gemini Family Now Available
Tech Insights

Gemini 1.5 Flash-8B With Lowest Token Cost Among Gemini Family Now Available

adminBy adminOctober 4, 2024No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email

Gemini 1.5 Flash-8B, the latest entrant in the Gemini family of artificial intelligence (AI) models, is now generally available for production use. On Thursday, Google announced the general availability of the model, highlighting that it was a smaller and faster version of the Gemini 1.5 Flash which was introduced at Google I/O. Due to being fast, it has a low latency inference and more efficient output generation. More importantly, the tech giant stated that the Flash-8B AI model is the “lowest cost per intelligence of any Gemini model”.

Gemini 1.5 Flash-8B Now Generally Available

In a developer blog post, the Mountain View-based tech giant detailed the new AI model. The Gemini 1.5 Flash-8B was distilled from the Gemini 1.5 Flash AI model, which was focused on faster processing and more efficient output generation. The company now claims that Google DeepMind developed this even smaller and faster version of the AI model in the last few months.

Despite being a smaller model, the tech giant claims that it “nearly matches” the performance of the 1.5 Flash model across multiple benchmarks. Some of these include chat, transcription, and long context language translation.

One major benefit of the AI model is its price effectiveness. Google said that the Gemini 1.5 Flash-8B will offer the lowest token pricing in the Gemini family. Developers will have to pay $0.15 (roughly Rs. 12.5) per one million output tokens, $0.0375 (roughly Rs. 3) per one million input tokens, and $0.01 (roughly Rs. 0.8) per one million tokens on cached prompts.

Additionally, Google is doubling the rate limits of the 1.5 Flash-8B AI model. Now, developers can send up to 4,000 requests per minute (RPM) while using this model. Explaining the decision, the tech giant stated that the model is suited for simple, high-volume tasks. Developers who wish to try out the model can do so via Google AI Studio and the Gemini API free of charge.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
admin
  • Website

Related Posts

The Surface Laptop Is $400 Off

January 28, 2026

Amazon UK slashes the price of Blink, Ring, Fire TV, Kindle and Echo devices — here are the 12 best deals I’d buy from £13.99 to upgrade my home

January 27, 2026

IndiGo flights disrupted by new FDTL rules: What it is and how to check your flight live status

January 26, 2026

Comments are closed.

Latest
  • SpaceX is coming to the public markets, and secondaries are already on fire | TechCrunch January 28, 2026
  • New Super Mario Bros. Wonder amiibo are up for preorder at Amazon, and you won’t believe how incredible they look January 28, 2026
  • Here’s our first look at Samsung’s upcoming 25W wireless charger January 28, 2026
  • ‘Windows 11 25H2 edges ahead of Windows 10 in gaming performance’: testing proves newer OS is faster — but there’s an elephant-sized BSOD in the room January 28, 2026
  • Lenovo’s new concept laptop has a rotating screen that’s perfect for doomscrolling – The Verge January 28, 2026
We are social
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest creative news from Techzlab.

Tags
AI Anthropic Apple Apps artificial intelligence ces 2026 ChatGPT cybersecurity data centers Donald Trump electric vehicles Elon Musk evergreens EVs Exclusive gemini Google Grok In Brief iPhone Meta Microsoft Netflix nvidia Openai Perplexity Pinterest robotics social media Softbank Solar Power SpaceX Spotify streaming TechCrunch Disrupt TechCrunch Disrupt 2025 techcrunch mobility Tesla Tiktok Trump Administration Uber Waymo Wind power X YouTube
Archives
Quick Link
  • Apps (385)
  • From the Editor (4)
  • Gaming (424)
  • Laptops (427)
  • Latest in Tech (423)
  • Mobiles (430)
  • Money (257)
  • Tech Insights (404)
Don't miss

The Surface Laptop Is $400 Off

January 28, 2026

Amazon UK slashes the price of Blink, Ring, Fire TV, Kindle and Echo devices — here are the 12 best deals I’d buy from £13.99 to upgrade my home

January 27, 2026

IndiGo flights disrupted by new FDTL rules: What it is and how to check your flight live status

January 26, 2026
Follow us
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
© 2026 Techzlab.com Designed and Developed by WebExpert.
  • Home
  • From the Editor
  • Money
  • Privacy Policy
  • Contact

Type above and press Enter to search. Press Esc to cancel.