Rendered at 17:29:41 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
gregpr07 13 minutes ago [-]
Love it! From first principles: this kinda answers the "do we really even need CDP" I always have in my head building browser use...
theredsix 5 minutes ago [-]
Totally, I feel that CDP was designed for a different category of automations.
giancarlostoro 2 hours ago [-]
Interesting, I wonder if this would help with other projects too, one project that comes to mind is archivebox, I don't know if they still have the issue I'm thinking of, but archivebox eventually had the Chrome instances (as the meme goes) basically consume all available RAM. If by freezing execution this could stop that, it could be useful for more than just AI agents.
Retr0id 1 hours ago [-]
> As proof, ABP with opus 4.6 as the driver scores 90.5% on the Online Mind2Web benchmark
And what does opus score with "regular" browser harnesses?
9wzYQbTYsAIc 24 minutes ago [-]
90% easy or 90% average?
theredsix 4 minutes ago [-]
90% average!
9wzYQbTYsAIc 2 minutes ago [-]
Nice! Will take a look at this for my homelab - was debating using crawl.cloudflare.com to try it out, as browser rendering was my next stretch goal.
I tweeted at the OSUNLP and they're backed up on eval validation. In the meantime, here's the benchmark repo with the saved runs and also instructions on how to run it locally. https://github.com/theredsix/abp-online-mind2web-results