Ai2 introduced MolmoWeb, an open-source web agent designed to automate browser tasks using visual understanding. Unlike traditional systems that rely on HTML, it interprets screenshots and performs actions like clicking, typing, and navigation.
The model is available in 4B and 8B parameter sizes and can run locally or in the cloud. A key feature is its full transparency, with open access to model weights, training data, and evaluation tools. This includes a large dataset of human and synthetic web interactions.
MolmoWeb aims to provide developers and researchers with a reproducible, customizable alternative to closed AI agents from major tech companies.





