Honor’s New AI Agent Can Read and Understand Your Screen

Honor’s New AI Agent Can Read and Understand Your Screen Leave a comment

It selected a restaurant, however then could not full the method because the spot it selected required a bank card to verify a reservation, at which level the person needed to take over. You might be versatile in your question—in one other instance, asking it to e book a “extremely rated” restaurant meant it will have a look at evaluations with excessive scores, although the agent does not do any extra analysis than that. It is not cross-referencing OpenTable evaluations with information from different components of the net, particularly since all of this information is processed on machine and is not despatched to the cloud.

This sort of agentic synthetic intelligence is the present buzzword within the tech sphere. My colleague Will Knight just lately examined an AI assistant that might browse the net and carry out duties on-line. Google late final 12 months unveiled its Gemini 2 AI mannequin educated to take actions in your behalf. It additionally renews the thought of a generative person interface for smartphones—at MWC 2024, we noticed a couple of corporations engaged on methods to work together with apps with out utilizing apps in any respect, as a substitute leaning on AI assistants to generate a person interface as you issued a command.

Honor’s strategy feels considerably like what Rabbit—of the notorious Rabbit R1—is doing with Educate Mode, the place you practice its assistant manually to finish a job. There is not any must entry an app’s Software Programming Interface (API), which is the normal manner apps or companies talk with one another. The agent memorizes the method, permitting you to then situation the command and have it execute the duty.

However Honor says its self-reliant AI execution mannequin is not educated to comply with strict steps—it is able to multimodal display screen context recognition to carry out duties autonomously. As a substitute of getting to coach the assistant to be taught each single a part of the OpenTable app, it’s able to understanding the semantic parts of the person interface and can follow-through with a multi-step course of to execute your request. Honor highlighted that this course of was more economical: “In contrast to opponents reminiscent of Apple, Samsung, and Google, which depend on exterior APIs—leading to larger operational prices—Honor’s AI Agent independently manages a variety of duties.”

{Photograph}: Julian Chokkattu

Leave a Reply

Your email address will not be published. Required fields are marked *