Blog

Ideas and Insights

Latest news, updates, and insights from TinyFish.

Product

Gemini 3.0 Flash + Mino API: When Reasoning Meets Real Execution

Sky Zhang·Dec 17, 2025·4 min read

Google Gemini 3.0 Flash is out now. Gemini models power Mino's infrastructure, so we integrated 3.0 Flash within hours and have been running extensive accuracy tests. The results are clear: 3.0 Flash delivers meaningfully better accuracy than its predecessor, and that improvement compounds dramatically when you're operating at scale.

A Test That Tells the Story

We gave three different systems the same task: "Go to Eventbrite, navigate to Mountain View California, and show me all jazz music events happening in the next 10 days.”

This isn't a theoretical benchmark. This task requires multi-step navigation, location filtering, date-based queries, dynamic content loading - everything that a developer should care about to build truly reliable applications at scale. The results were stark.

The improvement with Gemini 3.0 Flash was immediately visible. While 2.5 Flash completed the task in 5+ minutes after working through the navigation flow and auto-reconfiguring CAPTCHA, 3.0 Flash executed the same workflow in ~ 4 minutes with noticeably more precise navigation decisions.

Mino + Gemini 2.5 Flash

<div style="padding:75% 0 0 0;position:relative;"><iframe src="https://player.vimeo.com/video/1147509373?badge=0&amp;autopause=0&amp;player_id=0&amp;app_id=58479" frameborder="0" allow="autoplay; fullscreen; picture-in-picture; clipboard-write; encrypted-media; web-share" referrerpolicy="strict-origin-when-cross-origin" style="position:absolute;top:0;left:0;width:100%;height:100%;" title="Mino + Gemini 2.5 Flash"></iframe></div><script src="https://player.vimeo.com/api/player.js"></script>

Mino + Gemini 3 Flash

<div style="padding:75% 0 0 0;position:relative;"><iframe src="https://player.vimeo.com/video/1147509766?badge=0&amp;autopause=0&amp;player_id=0&amp;app_id=58479" frameborder="0" allow="autoplay; fullscreen; picture-in-picture; clipboard-write; encrypted-media; web-share" referrerpolicy="strict-origin-when-cross-origin" style="position:absolute;top:0;left:0;width:100%;height:100%;" title="Mino + Gemini 3 Flash"></iframe></div><script src="https://player.vimeo.com/api/player.js"></script>

The real difference showed in execution quality: 3.0 Flash navigated directly to the jazz music category and selected Mountain View accurately on the first attempt. More importantly, the quality of extracted results was better - more complete event details, cleaner structured data, and fewer inconsistencies that would require validation or cleanup downstream.

This isn't just about speed. It's about understanding intent more accurately, executing navigation with higher confidence, and producing cleaner output that you can immediately build systems on top of.

Other browser agents either didn't complete the task or took over 20 minutes before falling back to search-based approaches.

Why Speed and Accuracy Compound at Scale

Here's the critical insight about production web agents: a 5% accuracy improvement on a single task becomes transformative when you're running thousands of operations daily.

Consider a real scenario: monitoring 100 competitor websites for pricing changes, twice daily. That's 200 operations per day, 6,000 per month. At 85% accuracy, you're getting 5,100 successful extractions and 900 failures that need manual review or retry. At 90% accuracy with Gemini 3.0 Flash, you're getting 5,400 successes and 600 failures.

That's 300 fewer failures per month - fewer retry costs, less manual intervention, more reliable data pipelines. Multiply this across enterprise workflows tracking inventory across 500 supplier portals, or aggregating appointment availability from 200 service providers, and accuracy improvements translate directly to operational reliability and cost savings.

We've been testing 3.0 Flash across workflows that require understanding nuanced website behaviors: identifying the right navigation path when multiple options exist, distinguishing between similar-looking elements with different functions, handling unexpected popups or modal dialogs that interrupt workflows. The model's ability to reason through these ambiguities more accurately means workflows that previously needed multiple retry attempts now succeed on first execution.

What This Means for Developers

If you tried consumer browser agents and found them too inconsistent for production use cases, Gemini 3.0 Flash + Mino changes what's possible. Better reasoning about website structures, better decision-making in navigation flows, better handling of edge cases that break traditional scrapers.

You get higher success rates on complex workflows. Parallel execution across multiple sites simultaneously with each operation benefiting from improved model reasoning. Built-in handling for dynamic UIs, JavaScript-heavy content, and anti-bot protection - all executing with the accuracy improvements that 3.0 Flash brings.

Results come back as structured JSON, not screenshots you need to parse. And critically, the workflows work reliably enough to build production systems on top of.

Try Mino Now

Consumer browser agents are not infrastructure for developers who need to execute workflows repeatedly, reliably, and economically. That's not a criticism; they're solving different problems for different users.

Mino with Gemini 3.0 Flash solves the production problem. One API endpoint. Natural language instructions. Deterministic execution that improves with use. Infrastructure that actually scales when you need to access the 95% of the web that doesn't have APIs.

We're launching Gemini 3.0 Flash in production today. If you're building anything that needs reliable web automation at scale, this is the stack that makes it economically viable.

https://mino.ai/

Start building.

No credit card. No setup. Run your first operation in under a minute.

Try the PlaygroundRead API Docs

Technology

The TinyFish Accelerator - Build the Agentic Web. 9 weeks to $2M Seed.

TinyFish is launching a high-intensity virtual accelerator program, backed by $2M from Mango Capital. This accelerator is designed to fund and support the founders building the next generation of software on top of the Agentic Web. Applications open February 17, 2026. Rolling admissions.

TinyFish Storytellers·Feb 17, 2026

Technology

Open AI Operator scores 43% on hard web tasks. We scored 81%. Here are all 300 runs.

TinyFish Storytellers·Feb 12, 2026

Product

The Era of Abundant Intelligence

The internet gave humans access to knowledge. Now AI agents need access to action. The web wasn't built for machines. It was built for humans with browsers and clicks. TinyFish is building the infrastructure that makes the web operable for agents: stable contracts instead of brittle DOMs, outcomes instead of search results, reliable execution at scale. Google organized the world's information. We're making it executable.

Sudheesh Nair·Dec 15, 2025

Product

Gemini 3.0 Flash + Mino API: When Reasoning Meets Real Execution

Sky Zhang·Dec 17, 2025·4 min read

A Test That Tells the Story

We gave three different systems the same task: "Go to Eventbrite, navigate to Mountain View California, and show me all jazz music events happening in the next 10 days.”

Mino + Gemini 2.5 Flash

<div style="padding:75% 0 0 0;position:relative;"><iframe src="https://player.vimeo.com/video/1147509373?badge=0&amp;autopause=0&amp;player_id=0&amp;app_id=58479" frameborder="0" allow="autoplay; fullscreen; picture-in-picture; clipboard-write; encrypted-media; web-share" referrerpolicy="strict-origin-when-cross-origin" style="position:absolute;top:0;left:0;width:100%;height:100%;" title="Mino + Gemini 2.5 Flash"></iframe></div><script src="https://player.vimeo.com/api/player.js"></script>

Mino + Gemini 3 Flash

<div style="padding:75% 0 0 0;position:relative;"><iframe src="https://player.vimeo.com/video/1147509766?badge=0&amp;autopause=0&amp;player_id=0&amp;app_id=58479" frameborder="0" allow="autoplay; fullscreen; picture-in-picture; clipboard-write; encrypted-media; web-share" referrerpolicy="strict-origin-when-cross-origin" style="position:absolute;top:0;left:0;width:100%;height:100%;" title="Mino + Gemini 3 Flash"></iframe></div><script src="https://player.vimeo.com/api/player.js"></script>

Other browser agents either didn't complete the task or took over 20 minutes before falling back to search-based approaches.

Why Speed and Accuracy Compound at Scale

Here's the critical insight about production web agents: a 5% accuracy improvement on a single task becomes transformative when you're running thousands of operations daily.

What This Means for Developers

Results come back as structured JSON, not screenshots you need to parse. And critically, the workflows work reliably enough to build production systems on top of.

Try Mino Now

We're launching Gemini 3.0 Flash in production today. If you're building anything that needs reliable web automation at scale, this is the stack that makes it economically viable.

https://mino.ai/